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PREFACE 


Without mathematics, there is not much we can do to progress in our knowledge 
and technologies in mechanics, engineering, and science. In recent years, this 
statement has become applicable to business (such as the mathematical modeling 
and prediction of stock and currency exchange markets), or even social sciences 
(such as statistical analysis of data). The study of differential equations has been a 
very important branch of mathematics or of applied mathematics since the time of 
Leibniz, Bernoulli and Euler, and this is the main subject of the present book. Its 
evolution ties closely with engineering and scientific applications in the 18th 
century. In these two-book sequels, we focus on discussion of the theories and 
applications of differential equations in engineering and mechanics. This volume 
focuses on the theory, and the next one is on applications. 

In college, I took two “engineering mathematics” courses, one from the 
Mathematics Department taught by a mathematician, and the other from the Civil 
Engineering Department taught by Dr. Yan Sze Kwan, an MIT PhD graduate 
majoring in physics. Although I loved both the textbooks that they adopted for the 
Advanced Engineering Mathematics course (one used Kreyszig, 1979 and the other 
used Wylie, 1975), I enjoyed the teaching from the physicist much more than from 
the pure mathematician. Their emphases are different: the mathematician covered 
no background, physical meaning or applications, whilst the physicist emphasized 
application and usefulness of the technique, without loss of rigor. The syllabi of 
these subjects did cover a wide range of topics (although sometimes the coverage 
was very superficial) that is much broader than what we teach to our students today. 
For example, I learned vectors, matrices, vector calculus, Gauss and Stokes 
Theorems, statistics and probability, complex variables including the residue 
theorem and conformal mapping, special functions, calculus of variations, Laplace 
transform, and differential equations. For the differential equation part, the teaching 
mainly covered ordinary differential equations but not much on partial differential 
equations (except for the wave equation), let alone the classification of second 
order PDE as hyperbolic, parabolic and elliptic, integral transforms, integral 
equations, etc. 

Once I entered graduate school, I found that a lot of the mathematical 
techniques in solving differential equations were employed in various graduate 
courses in mechanics and engineering, like wave propagation in solids, continuum 
and solid mechanics, fracture mechanics, and wave hydrodynamics. Although my 
mathematics education at college exposed me to a wide range of topics, I still had 
to pick up various subjects in applied mathematics by self-learning or by sitting in 
on courses that I did not have the luxury to take, such as perturbation theory, 
asymptotic analysis, Fourier series expansion, integral transforms, boundary 
integral equations, Green’s function method, integro-differential equations, 
eigenfunction expansions, variational principle, and weighted residue methods. 
However, there was no single book covering my needs. After my master's degree 
studies in structural engineering from the Asian Institute of Technology and with 
the encouragement of Professor Pisidhi Karasudhi, I decided to pursue a PhD 
degree in Theoretical and Applied Mechanics (TAM) from Northwestern 
University (NU) under the supervision of Professor John Rudnicki. The TAM 
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program at Northwestern University requires students to take three mathematics 
courses before graduation. I formally took a series of three one-quarter courses on 
Differential Equations on Mathematical Physics I, II & III offered by Professor 
William L. Kath. Even though the course covered extensively on partial differential 
equations that included Laplace equation (or the potential theory), wave equations, 
Green’s functions and distribution theory, I found that it remains essential for me to 
continue self-learning on different mathematical topics in solving differential 
equations for my research work as well as for understanding technical papers by 
others. 

The most commonly raised question by educators and students alike is how 
much engineering mathematics is enough for an engineer or a scientist. There is a 
continuous debate on what needs to be covered for engineering and mechanics 
students. But one thing for sure is that you need to read a lot of books in 
mathematics for which the target audiences are not engineers. Some books are by 
far too theoretical and jammed with pages of axioms and theorems without talking 
about applications, and others covered a lot of applications but the topics are not 
advanced enough and specific enough for engineers and mechanicians. For 
example, most textbooks on differential equations contain a chapter on numerical 
methods, yet after learning that, you still have no idea on what the Newton- 
Raphson method, Newmark beta method and Wilson theta method are. They are 
the standards in solving structural dynamics problems. Another example is that 
nearly all textbooks on partial differential equations discuss solely the solutions of 
the wave equation, Laplace equation and heat equation, but engineers and 
mechanicians are typically required to obtain the solutions of biharmonic equations, 
the Poisson equation, Helmholtz equation, or nonhomogeneous wave equation. 
There is a clear gap between the traditional topics covered on PDEs and what 
engineers and mechanicians need to learn. The second volume of Selvadurai’s 
(2000) two-volume book series entitled Partial Differential Equations in 
Mechanics focuses on biharmonic and Poisson equations and is clearly motivated 
by such deficiency. 

A few years back, I was given the opportunity to teach Engineering Analysis 
and Computation for undergraduate civil engineering students. I find that there is 
no single book that can cover what I want to teach, with focus on engineering and 
mechanics applications. About one sixth of the materials in this two-book series 
resulted from preparation of the lecture notes for the course. The remaining content 
is clearly beyond the undergraduate level, and targets graduate students and 
researchers. 

When I was a graduate student, I was tempted to ask those “big shots” with 
superb mathematical skills what “mathematics books” they studied when they were 
young. Nowadays, many of my graduate students are asking me the same question. 
They seem to believe that there exists a single “secret book” that I studied seriously 
when I was young. To be frank, there is no such book. You have to learn your way 
step by step. With this in mind, the present book, hopefully, can partially satisfy 
some of my students. If the present book will be the first book being recommended 
to students by professors as their “secret book,” my time in preparing the present 
book would not be wasted. One main feature of the present book is that detailed 
step-by-step analyses and proofs are given such that most college students with 
basic training in engineering mathematics can follow through many advanced 
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topics. Hopefully, seemingly difficult and advanced topics will be made accessible 
to everyone. Following the advice of Morris Kline (1977), whenever possible, we 
will introduce a topic through its historical background, intuitive reason, physical 
motivations, potential applications, and relevance to the real world. Interesting 
stories of mathematicians and their problems will be given (either in the text or in 
the biography section). Illustrative examples and workable problems are included 
for each topic, hoping to provoke readers to read, learn, think, and ask. 

The main purpose of these two companion-volume books on differential 
equations is to provide readers with most of the essential topics on differential 
equations that one is likely to encounter in solving engineering and mechanics 
problems. The target students are those studying engineering and mechanics, but 
science students may find the present work useful. However, substantial amount of 
material in these book volumes will form useful reference materials for researchers 
and practitioners in dealing with differential equations. There are twenty-eight 
chapters in these two volumes, fifteen for volume one and thirteen for volume two. 

Volume one consists of fifteen chapters and focuses on reviewing the 
essential mathematical techniques in solving differential equations. Some of these 
chapters discuss classical techniques for undergraduate students, some are mainly 
for graduate students, and there are also some exclusively for researchers. We are 
going more for the breath rather than the depth in our coverage, hoping that 
students can find and learn the essential mathematical skills in a more 
comprehensive single volume. Many of the topics covered by these chapters can 
easily evolve into an independent book. Chapter 1 provides an overall summary of 
mathematical preliminaries, Chapter 2 gives an overall view on differential 
equations, Chapter 3 deals with ODEs, Chapter 4 considers series solutions for 
second order ODEs, and Chapter 5 discusses systems of first order ODEs. These 
four chapters can easily fill up lectures for an undergraduate course on 
“introduction to ordinary differential equations” or more. Chapter 6 covers first 
order PDEs, which is normally not covered in engineering courses, Chapter 7 
considers higher order PDEs, Chapter 8 summarizes the idea of Green’s function 
method, Chapter 9 deals with the classical topics of wave, diffusion and potential 
equations in mathematical physics, and Chapter 10 discusses solution of boundary 
value problems in terms of eigenfunction expansions. These four chapters can form 
a graduate course on “partial differential equations in engineering and mechanics.” 
Chapter 11 covers integral and integro-differential equations, Chapter 12 serves an 
introduction to the asymptotic expansion and perturbation, Chapter 13 deals with 
calculus of variations, Chapter 14 summarizes the concept of variational methods, 
and finally Chapter 15 examines numerical methods. Each of the six chapters can 
be a graduate course on its own right. These chapters are expected to provide 
general knowledge on more advanced topics in engineering and mechanics. 
Readers interested in numerical analyses probably should study Chapters 13-15 
together. Chapter 12 is an important introduction to the domain of studying 
nonlinear differential equations. 

Volume two consists of thirteen chapters focused more on applications of 
differential equations on variable topics. The choices of this topic are shaped by my 
former background and interests in engineering and in mechanics. 

The unfailing and continuous support from my wife Lim, my son Magnum, 
and my daughter Jaquelee is what keeps me going when I face difficult times. 
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Regular training with the PolyU swimming team and interactions with swimmers 
from masters swimming allows me to refresh my mind and my body from being 
drained by this ambitious book project. 

After I finished my first book Analytic Methods of Geomechanics, 1 told 
myself that I was not going to write another one because it is just too demanding 
both physically and mentally. However, most first-time marathon finishers will 
swear that they will never run another marathon again in their lives right after the 
race. It is just too demanding. Yet many return to marathon training after only a 
few months. Marathon running is an addictive behavior. As a former marathoner, I 
know too well this feeling and here I go again. 

This book project was encouraged by Mr. Tony Moore, a senior editor of 
civil engineering at CRC Press (imprint of Taylor & Francis). The expert assistance 
from Production Editor Michele Dimont and Editorial Assistant Scott Oakley is 
highly appreciated. A special thank you goes to my former undergraduate students, 
who through their comments shaped some of the contents of this book. 


K.T. Chau 
Hong Kong 
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CHAPTER ONE 
Mathematical Preliminaries 


1.1 INTRODUCTION 


The main focuses of the present book are the application on differential equations in 
engineering and in mechanics, and on the associated solution techniques used in 
solving these equations. Readers are expected to have general education and 
training in engineering mathematics, including differentiation and integration, 
vector analysis, matrix operation, and basics in algebra and trigonometry. Indeed, 
engineering and science students would have taken engineering mathematics before 
they embark on taking a course on differential equations. It is, however, impossible 
to cover all preliminary mathematics in a single chapter before we continue on our 
discussion on differential equations. This chapter will not only cover some 
elementary topics that we need for later development in this book, but also touch 
upon some less familiar, yet important, topics. 

We will in this chapter review binomial theorem, differentiation, integration, 
Jacobian, complex variables, Euler’s polar form for complex variables, elementary 
functions like circular functions (e.g., sine and cosine) and hyperbolic functions 
(e.g., sinh and cosh), differentiation of complex functions, integration using the 
Cauchy integral formula and residue theorem, Gauss and Kelvin-Stokes theorems, 
series expansions (including Taylor series expansion, Maclaurin series expansion, 
Laurent’s series expansion), and vector calculus. Most of these topics are typically 
covered in the first two years of engineering mathematics courses. In addition, we 
have added some more advanced topics that are unlikely being covered in 
elementary engineering mathematics courses, and they are the Frullani-Cauchy 
integral, Ramanujan’s master theorem, Ramanujan’s integral theorem, Darboux’s 
formula, Mittag-Leffler’s expansion, Borel’s theorem, and tensor analysis. Both 
differential and integral forms of gradient, divergence, and curl are discussed. 
Helmholtz’s representation theorem is also discussed. Some of these advanced 
topics can be skipped in the first reading but they form the essential basis for more 
advanced analysis in applied mathematics and in engineering mechanics. 

Before we summarize some essential mathematical backgrounds needed for 
later analysis of differential equations, we recall here the definitions of algebraic 
functions and transcendental functions. If w(x) is a solution of the following 
polynomials, 

Pi(x)w" + B(x)w"" +...4.P_ (w+ P,(x) =0 (1.1) 
where Po # 0, Pi(x),..., Pn(x) are polynomials of x and x is positive integer, w = 
f(x) is called an algebraic function. On the other hand, any function which cannot 
be expressed as a solution of (1.1) is called a transcendental function. 
Transcendental functions include exponential functions, trigonometric functions (or 
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circular functions such as sine and cosine), hyperbolic functions (such as sinh and 
cosh), and logarithmic functions. 


1.22 BINOMIAL THEOREM 


Typically, the binomial theorem has been covered in high school or in a first-year 
college mathematics course. It is, however, so useful that we will employ it 
repeatedly in our analyses on differential equations. To start with, we all learn in 
high school that the square of a sum can be expanded like 


(a+b) =(a+b)(a+b) =a? +2ab+b" (1.2) 


where a and b are any real numbers. This expansion can easily be generalized to 
other integer (whole number) powers as 


(a+b) =1 

(a+b)! =a+b 
(a+b) =a’ +2ab+b? 
(a+b) =a° +3a°b+3ab? +b 


(a+b)* =a* +.4a°b+6a>b’ +4ab> +b* 


(1.3) 


Clearly, there is a pattern in the coefficient of this expansion as the value of the 
integer power increases. It can be checked in a straightforward manner that the 
coefficients can be put in a triangular pattern as shown Figure 1.1. 
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Figure 1.1 The Pascal triangle or Yang Hui/Jia Xian triangle (the figure on the right is adopted 
from Zhu, 1303) 
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The triangle shown in Figure 1.1 expressed in terms of coefficients of power 
expansions is normally called the Pascal triangle in western literature and was 
discovered by Pascal in 1654, but it later was discovered that it has been known to 
other mathematicians well before Pascal. For example, it was known to German 
mathematician Petrus Apianus in 1527 and to Yang Hui in 1261 (his last name is 
Yang). The triangle on the right of Figure 1.1 is the extraction from Zhu’s book in 
1303 (Zhu, 1303). In China, the term “Yang Hui triangle” gets its popularity 
because of the excellent book by Hua Loo Keng (Hua, 1956), the most influential 
mathematician in modern China (see biography section at the end of this book). In 
its introduction, Hua (1956) remarked that Yang Hui mentioned that this triangle 
was reported earlier in a book by Jia Xian (last name is Jia). Based on the fact that 
Jia’s book is lost today, Hua decided to called it the Yang Hui triangle in his book 
and even use it as the title of his excellent book. Being the authority in mathematics 
in China, it was not challenged. This use or misuse of terminology by Hua (1956) 
has been recently criticized by historians. Thus, in China, both the names of “Yang 
Hui triangle” and “Jia Xian triangle” have been used in the literature. 
Mathematically, the binomial theorem can be expressed as: 
a(n — i) x2 fy 
2 

Although the Jia Xian triangle has been known since at least AD 1200, the 
binomial formula given in (1.4) was believed discovered by Isaac Newton. The 


coefficient on the right hand side of (1.4) is called the binomial coefficient. It can 
be defined as 


(x+h)" =x" +nx""h+ +etnxh" | +h" (1.4) 


ee ae) n! (1.5) 
r} ri(n—r)! 


so that 

(xh) =x" + Cox" ths Ch" OW? 4-48 xh” 1 4 Ctn" (1.6) 
In (1.5), the factorial function n! has been used (1.e., 5! =5x4x3x2x1 = 120). This 
binomial coefficient is found extremely useful in probability analysis, which 
however is out of the scope of the present book. For example, the binomial 
coefficient given in (1.5) can be interpreted as the number of different ways of 
taking r objects from n different objects. Taking 1 object at a time (or r =1), we 
can clearly see that there are exactly n different ways. Putting r = 1 in the right 
hand side of (1.5) gives exactly n (note that 1! = 1). It is clear from Figure 1.1 that 
the Pascal triangle or Jia Xian triangle can be generated by two adjacent terms to 
form the binomial coefficient of the next row as shown in Figure 1.2. This summing 
tule can also be proved analytically in Example 1.1. 


Example 1.1 
Cl Ces EN Plea) 
-1)\(n-r)! r\(n-1-r)! 
on - | (1.7) 
——_ [r + (n-r)] = —— =C) 


“rian! r\(n-r)! 
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Figure 1.2 Summing rule in generating the coefficients on next row 


Example 1.2 illustrates the series expansion of the function e* with e being the base 
of natural logarithms (i.¢., e = 2.71828 18284 59045 23536 0287...). The definition 
of e will be given in Example 1.4. 


Example 1.2 
e* = lim(1+2)" = ienfie yD pS ree 
no on noo llon 2! n 3! n 
= 2 _ _ 3 
tees e 1/n)x _f 1/n)Q-2/n)x ud (1.8) 
noo |! 2! 3! 
2 3 
x xX xX 
=]+—+—+—+ 
I! 2! 3! 


Actually (1.8) can be viewed as a Taylor series expansion, and this will be covered 
later in Section 1.13.2. 

In fact, if we set x = 1 in (1.8), we also have the definition of the e. This was 
first considered by Leonard Euler, who was the most prolific researcher in the 
history of mathematics. There are 886 publications of Euler listed in the website of 
the Mathematics Department at Dartmouth University. Interested readers can 
download them from http://www.math.dartmouth.edu/~euler/tour/tour00.html. We 
will see later in this book that Euler’s name will be mentioned repeatedly. Euler is 
actually one of the main pioneers of differential equations and nearly all of his 
theories on differential equations are related to the solving of real problems. Most 
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of the notations that we use today were proposed by Euler, such as z e, i (=(-1)"), 
lx), sinx and cosx (Havil, 2003). Laplace once said “Read Euler, read Euler, he is 
the master of us all” (Dunham, 1999). 

We will next review the essence of calculus, which consists of differentiation 
and integration. They are the bread and butter of differential equations. 


1.3 DIFFERENTIATION 


To the general public, calculus is believed to be invented by Isaac Newton. In fact, 
it was invented independently by British physicist and mathematician Isaac Newton 
and by German mathematician and diplomat Gottfried Wilhelm Leibniz. The 
currently adopted notation for differentiation and integration are all due to Leibniz. 
In a sense, we owe more to Leibniz than to Newton. The following disputed story 
between Newton and Leibniz was told by the most popular physicist of our time, 
Professor Stephen W. Hawking (Hawking, 1988) in the appendix of his celebrated 
book A Brief History of Time. 


Isaac Newton was not a pleasant man. His relations with other academics were 
notorious, with most of his later life spent embroiled in heated disputes... 

A more serious dispute arose with the German philosopher Gottfried Leibniz. 
Both Leibniz and Newton had independently developed a branch of mathematics 
called calculus, which underlies most of modern physics. Although we now 
know that Newton discovered calculus years before Leibniz, he published his 
work much later. A major row ensued over who had been first, with scientists 
vigorously defending both contenders. It is remarkable, however, that most of 
the articles appearing in defence of Newton were originally written by his own 
hand—and only published in the name of friends! As the row grew, Leibniz 
made the mistake of appealing to the Royal Society to resolve the dispute. 
Newton, as president, appointed an “impartial” committee to investigate, 
coincidentally consisting entirely of Newton’s friends! But that was not all: 
Newton then wrote the committee report himself and had the Royal Society 
publish it, officially accusing Leibniz of plagiarism. Still unsatisfied, he then 
wrote an anonymous review of the report in the Royal Society’s own periodical. 
Following the death of Leibniz, Newton is reported to have declared that he had 
taken great satisfaction in “breaking Leibniz’s heart.” (pp. 181-182, Hawking, 
1988) 


In fact, the initial relation between Newton and Leibniz is not as Hawking (1988) 
described above. The following extraction from Newton’s first edition of Principia 
(1687) gives another side of the story: 


In letters which went between me and that most excellent geometer, G.W. 
Leibniz, 10 years ago, when I signified I was in the knowledge of a method of 
determining maxima and minima, of drawing tangents, and the like ... that most 
distinguished man wrote back that he had also fallen on a method of the same 
kind, and communicated his method which hardly different from mine, except in 
his form and words and symbols. (Gjertsen, 1986) 


6 Theory of Differential Equations in Engineering and Mechanics 


However, in the third edition of Principia (1726), which was published 10 years 
after the death of Leibniz, the above reference to Leibniz work was removed. 
Probably, we all should learn something from this notorious story. 

We now return to the technical side. Differentiation of a function f(x) can be 
defined as a limit process of looking the ratio of the change of f versus the change 
of x: 


df = "(x)= lim f(x+h) f (x) 

dx h>0 h 
If we plot f versus x, differentiation of f with respect to x is actually the slope of the 
curve. Figure 1.3 illustrates this geometric meaning of differentiation. It is also clear 
from Figure 1.3 that when f(x) = 0, the point x is either a maximum or a minimum. 
At the maximum point, clearly the slope of the curve is changing from positive to 
negative whereas the slope at the maximum point is zero. Thus, we must have f"(x) 
<0 at a maximum point, and a similar argument shows that we must have /"(x) > 0 
at a minimum point. Because of these, differentiation is useful in maximizing or 
minimizing certain functions. In addition, for a derivative to exist, the curve shown 
in Figure 1.3 must be continuous and smooth. In mathematical terms, we require 
that the same differentiation is obtained at point a whether we approach the point a 
from the left or from the right 


lim f'(x)= lim f'(x) (1.10) 


x—-a-€é 
where ¢ is a small number. 

In Figure 1.3, we also demonstrated the mean value theorem for 
differentiation. In particular, if f(x) is continuous and differentiable in the interval 
(a, ath), there is a value a+@h (0 < 0< 1) such that 

S(ath)= f(a)+hf'(at+@h) (1.11) 
Before we discuss more rules of differentiation, let us consider a very important 
formula related to taking a limit. It is called L’H6pital’s rule. In particular, when 
fix)/g(x) tends to the indeterminate form of 0/0 or 0/co as the limit of x > a, then 


we can evaluate the limit as: 
fy ie 2) (1.12) 


xa g(x) xa g(x) 
If the right hand side of (1.12) remains as in an indeterminate form of 0/0 or 0/c, 
we can reapply (1.12) repeatedly. Note that a finite limit of any indeterminate form 
may exist or may not exist. The so-called L’H6pital’s rule is actually discovered by 
Johann Bernoulli (also known as John Bernoulli), but his student Gillaume 
Francois Antoine de L’H6pital published this rule in his first textbook on calculus 
in 1696. As a result, this was mistaken as the result by L’ H6pital (Maor, 1994). 


(1.9) 


1.3.1 General Formulas 


The most important formula for differentiation is probably the power law rule, 
which is defined as 


f(x") =n" (1.13) 
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a+Oh 


Figure 1.3 Geometric interpretation of differentiation 
To see the proof, we employ the binomial theorem given in (1.4). In particular, we 
can expand the following limit by using the binomial theorem: 


i (x+h)" -—x n(n-1) yy 


li = lim : [x” +nx"'h+ +etnxh" +h" —x"] 
h>0 h ho>0h 
| = =1) a _ 
= lim —[nx” ig MED 2h? +--+ nh" +h" | 
h->0 


: = ee = ei . 
=nx" "+ ra ) x” he tnxh”? +h" "| = nx"! 
h>0 2 
(1.14) 
Without going into the details, we report the following commonly used formulas of 
differentiation: 


Constant rule: 


—(c)=0 1.15 
ze (c) (1.15) 
Sum rule: 
Sag (1.16) 
dx dx dx 
Product rule: 
! w= (1.17) 
dx dx dx 
Quotient rule: 
du dv 
{(t)--## (1.18) 
dx\ v y , 


Constant multiplier rule: 
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d 
—(cu) =c— (1.19) 
where c is a constant, u and v are both function of x. These formulas were known to 


Leibniz in 1675, but they were not published until 1684. The proofs of them are 
straightforward and will not be reported here. 


1.3.2 Chain Rule 


Among different rules of differentiation, the chain rule is probably the most 
powerful. Mathematically, it can be written as 


dy _ dy du (1.20) 
dx du dx 
where y(u) and u(x). This can be proved by noting that 
dy _ lim y(u + Au) — y(u) du _ ‘ieee u(x + Ax) —u(x) (1.21) 
due 40 Au dx  Ax>0 Ax 
Then, the differentiation on the left of (1.20) can be written as: 
dytu(x)} _y Yulx+Ax)}~ yu} 
dx Ax>0 Ax 
ti yi{u(x + Ax)} — y{u(x)} -u(x + Ax) - ue), 
Ax>0 u(x + Ax)—u(x) Ax (1.22) 
= tin y{u+ Au} — y{u} lim u(x + Ax)—u(x) 
Au—>0 Au Ax0 Ax 
_ du 
du dx 


In deriving (1.22), we note by definition that Au= u(x+Ax)—u(x), and Au > 0 as 
Ax— 0. This completes the proof of (1.20). 

The mastery of the chain rule is essential for later analysis of differential 
equations. For example, we can apply the chain rule to the power rule of an 
arbitrary u(x) as (putting y(u) = ie in(1. on 


d ata 
mal ')-< (u na mu (1.23) 


Differentiation of elementary ae forms the basis for the analysis of 
differential equations. The commonly used differentiation formulas for circular or 
trigonometric functions are 


meena a cos(w) =—sinu Le @ tan(w) = sec? w 
dx dx dx dx ax 
du d st 1 du d -] 1 du 
ne las u , tan u= 
— 2 dx’ l-y2 de dx l+u? dx 


The differentiation formulas for hyperbolic functions are 
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cs (sinhw) = coshu a a (coshu) = sinhu a ae (tanh w) = sech7u a 
dx dx dx dx dx d. 


x 
4 eae u) = : da a (cosh ed 
dx l4u2 & dx y2—| ax 
(1.25) 
(anh! 4) =“ 
dx l-u~ dx 
Differentiation of logarithms and exponential functions are 
d ldu d 1 du d;, 7 du 
Inu = ; log, u= : (a )=a Ina—, 
dx udx dx ulna dx dx dx 
(1.26) 
d , du 
~ ot = et 
dx dx 


1.3.3 Leibniz Theorem on n-th Order Differentiation 


The Leibniz rule for taking the n-th differentiation of a product of two functions f 
and g is: 


k d* f d"*g 
dx" — (2) = re ax* dx * (1.27) 


where the binomial coefficient appears in (1.27) has been defined in (1.5). For 


ae a n=4, we have 
3 2e 42 3 4 
Ue vf oo gt tee age (1.28) 
dx” dx dx” dx dx dx dx 
This ie is ae in dealing with higher order differentiation. For small n, the 
validity of (1.27) can be checked easily. The proof of (1.27) for general n can be 
by using ae induction. For n = 1, we have 


- I! df 1! dg df dg 
Cg=—+C = 1.29 
“= eg ee ool? ax api! ae eae a 
_ we have ae (1.5). The validity for n = | is established. Next, we assume 
(1.27) is true for the case of n = k. That is, we have 


W cpy-0 Ff ga cit f Bats as 
dxt 8 dk de® dk? ay? 


re +4 


X g+Cy 
(1.30) 


Taking differentiation with respect to x once more time, we obtain 
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a ay AF de 


k k-1 2 
(fe) = €%( yeti Gs S48 sas 


ae gt ak de a dx de ae 


+E dx! dx? 
af dig , of dig a ag 
+.. AC +C, —_ —. 1.31 
1 ae axt baat de k —)+ ae Ae 8 vee ax 7) ( ) 
ae = el a d°g 
= Cp par 8 (Ck + CG + (Ch + ps Co 
d d ie 
Hop +cpy £48 5 fat 


By employing the definition of the binomial coefficient given in (1.5) and the result 
established in Example 1.1, we can easily show that 


Cl =c?,, =Cr=Ce =1, cCh+cP =Cl4+1=%41=Cl,,, 


; (1.32) 
Ce@eCy, Cee ae aeial=C,, 
Thus, substitution of (1.32) into (1.31) gives 
qk ges 1 a’f dg 5 aly d’g 
Af (fe) = Chas atl Bt Cus at dk + CK ae de 
(1.33) 


df d' @ 8 aks d hg 
+Ch ade act —- Fa ae 
This agrees with (1.27) for the case of n = k+1. Therefore, if (1.27) is true for the 
case of k, (1.27) is also true for A+1. By mathematical induction, (1.27) is valid for 
all values of n starting from n = 1. 


1.3.4 Leibniz Rule of Differentiation for Integral 


Another ‘icauiala formula by Leibniz is the differentiation of integral: 

d phx h(x) 

Sf feed =f FED ae sx.) fs, 1 2 1.34) 
X # g(x) 


where both lower and ‘cm “on are functions Ps x. The formal ides of 
integration will be deferred to a later section. To prove (1.27), we first write the 
integral as 


h(x) 
aan=] fdas (1.35) 
The change of this integral with ere to an increment of Ax : considered: 


Ams) = Hr dx) —9o)= J" frranede-[" sends (1.36) 


The first integral on the right hand das ‘ (1.36) can be split i three integrals as 
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Amay= fi" forraxeydes [s+ axe)de 
(1.37) 


h(x+Ax ; x 

+ Gasset «flee ydé 
h(x g(x 

The second and fourth integrals on the right hand side have the same limits and are 

grouped together, and the limits of the first integral on the right hand side are 

reversed. Thus, (1.37) can be rewritten as: 


h(x) h(x+Ax) 
Ama)=[ fr+ And) fe ae+ [fet ax edé 
g(x) h(x) 


(1.38) 
g(xt+Ax) 
fi, Pet an ede 
The first term on the right of (1.38) can be expressed as: 
h(x h(x) Ax. 
J. Uree+ ang) feeds = af ESO ye; (139) 


whereas, by the mean value theorem for cic See 1963), we can express 
the last two integrals on the right of (1.38) as 


[jo Ferbane nde = fr ax. g hCa+ Ax) ACH) ie 


[oo Fer an didg = fort Ax Neer Ax) — 800) (41) 


where A(x)<é\<h(xt+Ax), and g(x)<&<g(x+Ar). Substituting (1.39)-(1.41) into 
(1.38) and dividing the whole expression by Ax yields 
A A(x) + Ax 

A) (MOU EEN 464 possan gt poor anes) 

Ax g(x) Ax 


. 42) 
Finally, we can take the limit that Ax goes to zero to ae 


dg(x . Ap(x AOD Of (x, 

AO) = tim SEO PTE ge 4 posmoyL-SegyZ (43) 
dx Ax>0 Ax g(x) Ox 

which is the Leibniz formula given in (1.34). This a i the conn of (1.34). 

This formula will be used repeatedly in the later part of this book. 


1.3.5 Partial Derivative 


When a function f depends on more than one variable (say x and y), differentiation 
has to be modified as partial differentiation. In particular, two partial derivatives 
are defined as, depending on differentiating with respect to x or y 


es = lim re = ae y) = Le, y) = (H y=const. — I sx 
Ox = Ax>0 Ax Ox - 

F = tim LOyt Ay fy) _ Fy 
oy Ay>0 Ay oy : 


(1.44) 


= Ts 
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Note that the partial differentiation with respect to the variable x or y is represented by 
the comma-subscript convention (see the last terms in (1.44)). When we take the 
partial differentiation with respect to x, we will keep y constant. Similarly, in taking 
partial differentiation with respect to y, we keep x as constant. This idea can easily 
be extended to the partial differentiation of function with three or more variables 
(say, x, y, and z) derivative. The symbol 0 is not a Greek letter, and it was invented 
by Legendre and has been adopted since (Cajori, 1993). Some people pronounce 0 
as “partial differentiation,’ some call it “partial d,’ but it was introduced as a 
rounded “d” so as to distinguish it from the normal “d.” Therefore, it seems easier 
to call it as “round”. 

An important consequence of partial differentiation lies in the calculation of 
the total change of a function or its total differential, which is defined as: 


=O a4 @ 
df = ee dx + ay (1.45) 


This fundamental result will repeatedly be used in the analysis of differential 
equations (e.g., the exactness of Ist ODE and solving Ist order PDE). To show the 
validity of (1.45), we first note that the total change of f when x increases to x+Ax 
and y increases to y+Ay is 


Af = f(x+ Ax, y+ Ay)— f(x,y) 


= f(x+ Ax, yt Ay)— f(x, y+ Ay) + f(x, y+ Ay)— f(xy) (1.46) 
(ARS REE yg EE Ey 
x Ay 
ie Fin fim (2 AI 
Ax>0 Ax Ay>0 Ay 
a7 tea 
Ox oy 


(1.47) 
This completes the proof of (1.45). When fdepends on n variables (xj, i =1,2,..., 1), 
the total differential becomes 


df = of dx, 4 of dX, +...4 aE dx 

Ox, Ox, Ox, 

The chain rule discussed in Section 1.3.2 has to be revised for the case of multiple 
variables; and, in view of (1.45) we have: 


df _ of dx of dy 
dt Ox dt oy dt 
where fis a function of both x and y, and both x and y are functions of ¢ Equation 
(1.46) is of particular importance in fluid mechanics and it is the basis of the so- 


called convective derivative or material derivative. We will return to this in Section 
2.8.2 of Chapter 2. 


(1.48) 


(1.49) 


Example 1.3 Consider the following partial derivative of a product of two functions 
u and v: 
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(1.50) 
du Ov Oy Oy Ou Ou 
= +u + +V 

Ox, OX, — Ox,OX, Ox, OX, = OX, OX, 


The geometric meaning of partial derivative Of/ox and Of/oy are demonstrated in 
Figure 1.4. They are the corresponding slopes for function f(x,a) and f(b,y). 


of 


Slove of f(%.) 


x 


Figure 1.4 Geometric meaning of 0f/ox and df/oy 


1.3.6 Commutative Rule for Partial Derivatives 


If a function has a continuous and smooth second derivative, and the second 
derivative is commutative as: 


Of _ af 
Oxdy Oyox 


This is known as Clairaut theorem. To prove this, let us consider the following 
function: 


(1.51) 


G= f (Xp +h, Vo +k)— F(X. Yo +H) — (Xp +45 Yo) + F(X Vo) (1.52) 

Next, let us further assume that 
O(x,y) = f(xth,y)— f(xy) (1.53) 
W(x, y)= f(x y+k)— f(xy) (1.54) 


With the definition given in (1.53), we can consider x being xo and y being yo or 
yotk 


P(Xq Vo +k) = (Xp +A, Vo +k) — F(X Yo +*) (1.55) 
Pq V0) = FX +45¥0) —F(%Vo) (1.56) 
Then, (1.52) can be rewritten as 
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G = P(X, ¥0 +k) — G(X, Yo) (1.57) 
Similarly, using the definition of (1.54) we have 
W(X9¥o +k) = F(X +h, Vy +k) — F(X +4, Yo) (1.58) 
W(X. Vo) = F(X. Yo tk) — f(%,%0) (1.59) 
Thus, alternatively the function G defined in (1.52) can be expressed as 
G=Y(X% +h, Vo)- W(X, No) (1.60) 
Recall the mean value theorem that 
ae TO Hs. GR (1.61) 


Application of this mean value theorem to (1.57) leads to 


0 
C= 950.34 +4) B90) =F LC. 39 + 4H) 0<4<1 


~ ss (1.62) 
= Kia, Go +h, ¥q + Ak) ay oe Yo +t Ok)} 
Application of this mean value theorem to (1.60) gives 
0 
G=Y(X% +h, yo) -W(X>Yo) = bh (x, +0,h,Vy) 0<@ <1 
ox (1.63) 


0 0 
= WZ (9 + Oh Jp +k) — L(y + Oh, Yp)} 
Ox Ox 


Applying the mean value theorem for the second time to both (1.62) and (1.63), we 
obtain 


G= ty CFG th Yo tOk) 0<O;<1 (1.64) 


OL (+ Oh Vo t Qk) 0< A <1 (1.65) 


G=kh 
Oye. 


Thus, we obtain 
2 


Fy + Oph Yo + 04k) =~ FL (ay Osh Vo +t Ak) (1.66) 


Consideration of the limiting case that h > é and k > 0 gives 
af af 
Xo, Xo, 1.67 
Bax 0 Yo) = Axdy ———(%.¥9) ( ) 


Thus, the order of partial differentiation can be reversed if the second derivative is 
a smooth function of x and y at point (xo, yo). 


1.4 INTEGRATION 


When a function f(x) is plotted against x as shown in Figure 1.5(a), the area under 
the curve f can be approximated by summing the areas of n columns. 
Mathematically, we can write 
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A=[f (x )Ax + f (xz )Ax +... + f(%, JAX] (1.68) 
The actual area under the curve can be written as an integration if we take the limit 
of n approaching infinity 


(i f(x)dx = lim sy f(x,)Ax (1.69) 
a no i=l 


If the integration is not integrated along the x axis from a to b but instead along a 
curve C in space or in a plane as shown in Figure 1.5(b), we have the contour 
integral. A curve C is depicted in Figure 1.5(b) from Point A to Point B. The 
direction of a curve C can be defined from Point A to Point B in Figure 1.5(b) or 
vice versa. Similar to (1.69), the line integration along curve C is defined as a sum 


(a) Ax (b) 


Figure 1.5 Integration as a sum of areas and a sum of lengths 


[fess = lim De /einyAs (1.70) 


where the length of each segment along the curve C is As, and point (x;, yj) is the 
point at the center of the segment i. The value of the line integral will be negative if 
we integrate from Point B to Point A. Formula (1.70) can be extended easily to 3-D 
space. If the curve C forms a closed contour (Point A overlaps with Point B), the 
integral sign will be changed to 


fC yds = lim" f.y,)As (1.71) 
i=l 


When a function f depends on more than one variable, we can extend the idea 
of a single integration to multiple integrations. For example, using the polar 
coordinate shown in Figure 1.6(a), we can define area integration as: 


I | f(r,0)dA = i {. f(r,0)drdO (1.72) 
i 


Similarly, we can extend the area integration to volume integration, which is 
formed by the revolution of a curve y(x) about the x axis as shown in Figure 1.6(b) 
or about the y axis as shown in Figure 1.6(c). 
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Figure 1.6 Area evaluation in terms of double integral 


1.4.1 Integration by Parts 


One of the most important formulas of integration is integration by parts: 
fu dv=w-—[v du (1.73) 


This technique will be used repeatedly in solving differential equations, and the 
mastery of this technique is essential. This formula can be generalized to the case 
of an integrand involving the n-th derivative of a function f multiplying another 
function g: 


[ree ax = fOVg— fe%ery fO% "1" [ fede (1.74) 


When upper and lower limits of integration are given, these formulas should be 
modified accordingly by assigning the limits to the non-integral parts in (1.73) and 
(1.74). 


1.4.2 General Rules of Integration 


There are a lot of mathematical handbooks on integration (e.g., Gradshteyn and 
Ryzhik, 1980; Zwillinger, 2012; Spiegel, 1968). Carrying out integration may not 
be an easy matter and, in general, integration is much more difficult than 
differentiation. Instead of summarizing all known integration formulas here, we 
only report some useful formulas that we would encounter repeatedly in solving 
differential equations 
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n+l 


futar=" 
n+l 
1 u u 
[—au= Inu ; fe du=e 
u 
gine a! 
[atau = [ered = = 
Ina Ina 


In addition, we report two general rules for carrying out integration 


| f(axdx = “| f (udu 


F(u) du 


[re f(o}de = [Fo = aa as 


1.4.3 Some Transformation Rules 


(1.75) 


(1.76) 


(1.77) 


(1.78) 


(1.79) 


There are some well-established rules for integrands containing certain groups of 
arguments. Here are some of them and their associated change of variables should 


be used: 


[ Flax+b)ax - * | Fudd , usaxtb 
[ PolarByde == fur udu u=Jax+b 
[ PGlax+b)ax = ne Fwd, u=ax+b 
[ Fle? =? )dx =a F(acosu)cosudu x =asinu 
[Pole +2? )dx =a F(asecu) sec? udu, x =atanu 
[Pol? =a? )dx = af F(atanu) secu tan udu, x =asecu 


| F(e™)dx = . [- “ du, u=e™ 


[Fannax = [Peta , u=lnx 


[ Fein" dx = af F(w)cos udu, u=sin! = 
a a 
Qu 1-u?. du x 
F(sin x, cos x)dx = 2 F( : , u=tan 
J l4+u? 14+u? 14u? 2 


(1.80) 


(1.81) 


(1.82) 
(1.83) 
(1.84) 
(1.85) 
(1.86) 
(1.87) 


(1.88) 


(1.89) 


Note, however, that there is no guarantee that these changes of variables will 
always work. It depends on the particular functional form of F involved. In Section 
1.7.4, we will see that Cauchy’s integral formula for complex variables is a very 
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powerful technique even for integrating real functions. Another less known 
technique is Ramanujan’s master theorem and the related Ramanujan’s integral 
theorem, and these will also be discussed in Sections 1.9 and 1.10. 


1.4.4 Mean Value Theorem 


Similar to the mean value theorem in differentiation given in (1.11), the mean value 
theorem also exists in integration. We have actually used the mean value theorem 
in the proof of the Leibniz rule of differentiation for the integrals in (1.40) and 
(1.41) in Section 1.3.4. In particular, there exists a point c within the upper and 
lower limits such that: 


[ seoa= 6-0/0). ee: (1.90) 


[ Peog@ae= (Of gear, a<c<b (1.91) 


where f(x) is a less rapid changing function comparing to g(x). This idea can be 
extended to double integrations. Similarly, the mean value theorem for the double 
integral can be established as 


{| f(r,0)dA = Af (1,9) (1.92) 
R 


where the point (70, ) lies within the domain of integration. 


1.4.5 Improper Integral 


When the integrand is not defined at its end point of integration, the integration can 
be modified as 


b b 
[ fepar=lim] sede (1.93) 


where f(a) tends to infinity. The integral given in (1.93) must exist in order that the 
integral is well defined. Another typical improper integral involves infinity as the 
upper limit 


| " f@de= lim | * ie (1.94) 


Again, the limit on the right hand side of (1.94) must exist in order for (1.94) to be 
well defined. 


1.4.6 Laplace/Gauss Integral 


Let us consider an integral that we often encounter in engineering applications. The 
integral was known as the Laplace integral or Gauss integral. It is defined as 


I = [eax (1.95) 
0 
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Note that (1.95) is an improper integral in the sense defined in the last section. As is 
normally done in engineering, we will omit the limit process but simply assume the 
limit exists. To evaluate (1.95), we will first note that the integrand in (1.95) is an 
even function with respect to x = 0. Thus, we can extend the integral to 


1-3" e? dx (1.96) 


Next, a clever mathematical trick is proposed. We can expand the integration to an 
area integration of an infinite domain as 


Pa [eras f et aa Tf fee aay (1.97) 


This integral can easily be evaluated by using the polar form, as shown in Figure 
LF. 


Figure 1.7 Area evaluation in terms of the double integral 


In particular, we assume 
x=rcos@, y=rsind, Pax +y? (1.98) 
The limits of integration for —o < x, y < 0 become 0 < r< w and 0 < 0 < 2. The 


integral actually represents an area integral. From geometric consideration given in 
Figure 1.7, we must have 


dA = dxdy = rd@dr (1.99) 
Thus, the integral becomes 


co pCO 2 po 
pal J } eV dedy = } } e” rdOdr (1.100) 
4 Jw Jo 0 0 
The integration can be conducted separately for @and r as 
1 1.” _2 a 
P= [0 (->)| ear?) == 1.101 
{lols (-5)], a7) = 4 (1.101) 
Finally, taking the square root of (1.101) gives 


[oo*a- (1.102) 
0 2 


This result will be used repeatedly in the later part of this book. 
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1.5 JACOBIAN 


In (1.100) of the last section, we have seen that when a change of variables is 
applied to an area integral, an additional factor needs to multiply the increment of 
the new variables. In general, we can cast the integral as 


[[ 1. naedy = | Em|y\dzan (1.103) 


where ly | is the additional factor. The symbol Bi | is normally used to represent 
this factor. The factor J was derived by Jacobi and is referred to as Jacobian. It is 
an important quantity for any mapping of a function of two variables. 

Following Hardy (1944), let us consider the following function in terms of wu 
and v: 


d(u,v) =0 (1.104) 
We assume there is an existence of a mapping for wu and v as functions of x and y: 
u=u(x,y), V=Vv(x,y) (1.105) 


Substitution of (mapping (1.105) into (1.104) and taking the partial differentiation 
with respect to x and y gives 


Op _ 0p Ou | Op Ov _ 
Ox Ou ox Ov Ox 
Op _ Op Ou OP Ov _ 
Oy Ou oy Ov doy 
In deriving (1.106) to (1.107), we have applied the chain rule for partial 


differentiation given in (1.49). Equations (1.106) and (1.107) can be rewritten as a 
homogeneous system in matrix form as 


(1.106) 


(1.107) 


ou ov |(d¢g 
ox ax || au 0 

= 1.1 
a — 
oy oy j|lav 


If the partial differentiations of @ with respect to u and v exist, the determinant of 
the square matrix in (1.108) must be zero. The determinant is called Jacobian and 
is defined as: 


ou av 
Gea ae OT ast a, (1.109) 

Ov Ov) Ox dy ax Ox _ 

Ox Oy 


If (1.104) exists, it means that uw can be expressed in terms of v (or vice versa). 
That is, w and v are not independent. In a slightly different way of saying this, if J = 
0, 0b/du and 0d/dv are nonzero. Then, consequently (1.104) exists. Conversely, if 
J #0, 0o/du and 0o/dv are zeros. Then, uw and v are independent, and consequently 
the mapping given by (1.105) exists. Therefore, for a mapping to exist, we must 

have its Jacobian defined in (1.109) not equal to zero. 
We now return to (1.103) and see why the Jacobian is involved. Referring to 

Figure 1.8, we consider a small diagonal element dr as 
r=xe,+ye,, dr=edxt+e,dy (1.110) 
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Now consider a mapping: 
x=x($,7), Y= V5.7) (1.111) 
Therefore, (1.110) becomes 
r=x(S,me+y(S.me,, dr=—ldo+——dn (1,112) 
0g ON 
The two vectors on the right hand side of (1.112) are physically shown in Figure 


1.8 as the components of dr in the é-7 plane. They can be determined using the 
first equation of (1.110) as 


or ag ee. or aca 
0g OF 06 On On On 
The small rectangular element with diagonal dr in the x-y plane in Figure 1.8 is 
dA = (dxe,)x (dxe, ) = dxdye, (1.114) 
whereas the area of the curvilinear element with diagonal dr in the &-7 plane is 


(1.113) 


dr 


ey 


Xx 


Figure 1.8 The mapping of a small vector dr 


ee ig ne a ee 
dé” On aé | ae dn | an 


eo Oy Ov Ox 
0€ On 0€ On 
These areas must be equal before and after the mapping, thus we have 
demonstrated the validity of (1.103). 
Although formula (1.103) was given in nearly all textbooks on engineering 
mathematics (e.g., Kreyszig, 1979; Wylie, 1975), its proof is seldom given. 


(1.115) 


)dEdne; =|J|dEdne, 
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1.6 COMPLEX VARIABLES AND EULER’S FORMULA 


The analysis of complex variables is an important area in applied mathematics. It is 
not possible to review such a huge topic in a chapter, let alone a subsection like 
this. Nevertheless, for later analysis we need to review some essential ideas here. 

The idea of introducing imaginary number can date back to 1545, when 
Italian mathematician Gerolamo Cardano considered the solution of the cubic 
equation. All real numbers can be considered as special cases of complex numbers. 
Setting the imaginary part of any complex numbers to zero, we get real numbers. 
When both the real and imaginary parts of a complex number are replaced by 
changing variables, we have a complex variable 


Z=x+iy, i= V-1 (1.116) 
where x = Re(z) and y = In(z) are the real and imaginary parts of z. When a 
function depends on the complex variable z, we have a complex function 
f(z) =u(x, y) + iv(x, y) (1.117) 
The summation, subtraction, and multiplication of complex numbers are similar to 
that for real numbers. The major difference of the arithmetic for complex numbers 
from that for real number lies in division: 
Zt _ +O)? =D.) _ Xt M2 , V2 ~U2 
22 Xt. (t+ I)M-dy) x + yy Xp + yy” 
In obtaining the result of (1.118), we have used the identity i? = —1. The conjugate 
of a complex number z is defined as: 


(1.118) 


Z=x-iy (1.119) 

We can easily show the following identities 
Z+z=2x=2Re(z), 2-7 =2iy =i2Im(z) (1.120) 
lf’ = = (x+)e-H) =x? -Py? = +9? (1.121) 


The |z| is called the modulus of a complex number. 
The polar form of a complex number is defined in Figure 1.9, and is given as 


z=rcos@+irsing = re” (1.122) 


Z=x+y 


x=rcos@ Xx 


Figure 1.9 Polar form of a complex number 
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where r= |z| and 0 is the argument of the complex number. Equation (1.122) is 
known as Euler formula. By applying the Euler polar form, we immediately obtain 
DeMoivre’s Formula 


z" =r"e” =r" (cosn0 +isinnO) (1.123) 
Let us consider the special case that 9 = zin (1.122), we obtain 
e’” =cosa+isinz =—1 (1.124) 
Rearranging (1.124) gives the well-known Euler’s formula 
e'7 +1=0 (1.125) 


Many mathematicians and scientists called this the most beautiful formula in 
mathematics. It links five fundamental constants in mathematics: the base of natural 
logarithms (or Euler’s number), imaginary constant, ratio of circumference to 
diameter of a circle pi, one, and zero; otherwise, these constants are seemingly 
unrelated. Nobel prize laureate in physics, Richard Feynman, called this the most 
remarkable formula in mathematics (in the cover page of Nahin, 2006). Among 
these five constants, m and e are the most fundamental and appear in all branches of 
mathematics and engineering applications. Appendix F compiles some important 
formulas for 1, whereas the definition for e is given next. 


Example 1.4 Let us consider the origin of the base of natural logarithms e. If a sum 
of money P is deposited at a bank, and the bank offered an annual interest rate of r, 
thus, at the end of one year, we gain an interest of Pr or the total return sum S as 

S=P(+r) (1.126) 
Then, what happens if we take out the money from the bank at the end of one-half 
year and redeposit it into the bank with the new initial sum. Mathematically, the 
return sum at the end of one year becomes 


= r r _ F<9. 
S=[P+ M+ 5)= P+) (1.127) 


This formula is actually the compound interest formula that we should have learned 
in high school. Clearly, the sum given in (1.127) is more than that of (1.126). If 
after each 1/n year, we take out the money and redeposit it again, when we do this n 
times in a year, we ultimately get 


Sa(pd4+5.40s5=Pa4+5" (1.128) 
n n n 


One may ask the ultimate question, what would be the largest gain in a year, if we 
decrease the deposit period to an infinitely small period and redeposit again and 
again. Thus, mathematically, we have the sum at the end of one year as 


S=Plim(1+2)" = Pe" (1.129) 
no n 
The last of (1.129) gives the definition of e, and this gives birth to natural 
logarithms. Clearly, we have 
nao 


e= lim(1+4)" = 2.71828... (1.130) 
nN 


The origin of e is from the deposit and interest problem, yet we will see that it is of 
central importance to mathematics as well as to the solution of differential 
equations. 
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The amazing properties of Euler’s formula do not stop at (1.125). Let us consider 
another example of (1.122). 


Example 1.5 What is 7 to the power 7? 


Solution: Consider the special case that 9 = 7/2 + 2nzin (1.122); we find 


gree. cos(5 + 2nm) +isin(S + 2nz) =i (1.131) 
Then, let us consider the amazing form of i to the power i 

i! = ehi(a/2+2n7) =e (#/2+2nz) (1.132) 
Surprisingly, i to the power 7 is actually real. There are infinite answers to it, the 
“simplest” one being e*? or 0.207879576... Some authors also called this the 
principal value among the infinite solutions. This amazing result was obtained by 
Euler in 1746. It has been reported unofficially that some mathematics departments 
will ask this question at the interview of potential candidates for their bachelor 
degree program. In the problem section at the end of the chapter, we will consider 
more amazing formulas resulting from (1.122). 


1.7 ANALYTIC FUNCTION 


Let us now consider the differentiation of a complex variable. In analogy to the 
definition given for a real variable, we can define 


r= 2) _ jin SEF) SE) (1.133) 
dz Az—>0 Az 

If this derivative exists, we call this complex function analytic. Actually, it is just 
the complex counterpart of a real function being differentiable. Therefore, let us 
reiterate that, in essence, analytic is just another word for differentiable in the 
complex variable context. We have learned that a function must be smooth for it 
being differentiable (or left hand limit equals right hand limit shown in (1.10)) and, 
similarly, we also have certain conditions that a complex function needs to satisfy 
in order to be analytic. This required condition is called the Cauchy-Riemann 
relations and is considered next. 


1.7.1 Cauchy-Riemann Relations 


Let us assume that a complex function can be written in terms of a real part 
function and a complex part function as given in (1.117). Then, the complex and its 
incremental change due to the change of z are 


f(z) = u(x, y) +iv(x, y), 
f(z+Az) =u(x+ Ax, y + Ay) +iv(x+ Ax, y+ Ay) 
Substitution of (1.134) into (1.133) yields 


(1.134) 
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i a. J(2th2)=f) 
Fe)= jm fea 
ae u(x + Ax, y+ Ay)—u(x, vy) +i v(x + Ax, y + Ay) — v(x, y)] 
Az>0 Ax +iAy 
Since Az = Ax + iAy, we have two independent paths of taking the limit. First, we 
can take Ay — 0, then Ax > 0: 
u(x + Ax, y) —u(x, vy) +i[v(x + Ax, vy) — v(x, y)] 
Ax 


(1.135) 


f'(z)= lim 
Ax>0 


Ou .Ov alee) 
— ee ee 
ox Ox 
Secondly, we can take Av —> 0, then Ay > 0. 
FG) = lim u(x, yt Ay) —u(x, y) 1 i[v(x, yr Ay) = V(x, y)] 
Ay>0 iAy 
(1.137) 


_10u , ov _ ov ; Ou 
idy dy oy oy 
The resulting limits by following these paths must be the same if the differentiation 


is unique. Equating the real and imaginary parts of (1.136) and (1.137) gives a pair 
of equations: 


ou _ ov Ov Ou 


> = (1.138) 
Ox Ov Ox oy 
These are the Cauchy-Riemann relations. 
Example 1.6 Check whether the following complex function is analytic: 
f(y=a2 (1.139) 
Solution: Expansion of the square term gives 
f(2=(xt py =x +2ixy—y? =utiy (1.140) 
Thus, u and v are 
u=x?-y*, v=2xy (1.141) 
Substitution of (1.141) into (1.138) gives 
OF ag, a gy (1.142) 
Ox Oy Ox Oy 
Thus, (1.139) is analytic or the differentiation of (1.139) exists. 
Example 1.7 Check whether the following complex function is analytic: 
f(z)=F° (1.143) 
Solution: Expansion of (1.143) gives 
f(2) =(x-iyy =x? -2ixy-y’ =utiy (1.144) 


Comparing the real and imaginary parts, we obtain 
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u=x°-y*, v=—2xy (1.145) 
Substitution of (1.145) into (1.138) gives 
Oe i ie SV thy aay (1.146) 
ox oy ox oy 


Thus, (1.143) is not analytic. 


Without going into details, we simply claim here that all functions involving only z 
are analytic, whereas any function containing the conjugate of z is not analytic. In 
addition, if uw and v depends on Im(z) or Re(z), we can show that its resulting f is 
not analytic (see Problem 1.11). 


1.7.2 Liouville Theorem 


Suppose that for all z in the entire complex plane: (i) f(z) is analytic and (11) f(z) is 
bounded (i.e., | Kz) | <M for some constant M). Then, f(z) must be a constant. In 
layman’s language, we can rephrase this as “if f(z) is analytic and bounded 
everywhere, f(z) is a constant.” This theorem was actually derived by Cauchy in 
1844. The term “Liouville theorem” was coined by Borchardt in 1880, who learned 
about this theorem in one of Liouville’s lectures in 1847 and unfortunately named it 
after Liouville. Some mathematicians called it the Cauchy-Liouville theorem. 
There are 16 concepts and theorems associated with Cauchy’s name, and this is the 
most among mathematicians (see biography at the back of this book). 


1.7.3 Cauchy-Goursat Theorem 
We now review the most important formula for complex variable—the Cauchy 


integral formula. First, let us consider an associated result called Green’s lemma or 
Green’s theorem in the plane. Figure 1.10(a) shows a plane domain R with 


boundary C. 
Green’s lemma can be stated as 
[[(2-Sraray =¢ (Pax + Ody) (1.147) 
i ox Oy C 


To prove this, we first consider the left hand side of (1.147). As shown in Figure 
1.10, the lowest point of the curve C is yi, whereas the highest point of C is y2 and 
the arcs 123 and 143 are single value functions represented by gi(y) and g2(y). The 
first term on the left of (1.147) can be reduced to 
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x 
Figure 1.10 A plane region and its boundary 
HNO 
J [ Qoavay = J e” Saacdy = J (Ole. tei I} 4 
= J Oteoy14--[ Ole nid (1.148) 
= [- Ogr(y)]dy + | " Ole (v)ldy = f Ody 
yi V2 c 
Similarly, the a term can be simplified as 
J [ Qeteay = [fr Seay = [PLAC PLC} 
fix) Oy x 
(1.149) 


= [ i PL fy(x)ldx— ) i PUf,(x)]ax = -$, Pdx 


Combining (1.148) and (1.149) gives (1.147). 
Now consider the closed contour integration of an analytic function 
f f (Zaz =p (u + iv)(de + idy) = f (udx —vdy) + ich (vdx+udy) (1.150) 
€ c c c 


Application of Green’s lemma given in (1.147) to (1. ne yields 


p.s@de ia = ao dy+ill eS 5 (1.151) 


By virtue of the Cauchy-Riemann relation, we arrive at the well-known Cauchy 
integral theorem as: 


p. f(2az=0 (1.152) 


which is true when f(z) is continuous. Figure 1.10(b) shows the modification of the 
proof to a multi-connected region. We first construct a cross-cut (dotted line in 
Figure 1.10(b)) such that the domain becomes simply connected as Figure 1.10(a). 
The closed contour integral of an analytic function becomes 
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f f(z)dz =0 (1.153) 
C+C3+C,+C, 


However, line integrals for curves C3 and C4 are the negative of each other. That is, 
we have 


} f(2dz =- I f(2dz (1.154) 
C; CG, 
Thus, (1.152) is reduced to 


{p 
p. + 


f(2)dz =f f(2)dz =0 (1.155) 
1+C, Cc 


where contour C is interpreted as a complete boundary of R, consisting of C; and 
C2, traversed in the sense that an observer walking on the boundary always has the 
region R on his left. As shown by Spiegel (1964), this proof had been extended by 
Goursat to the general situation that no Cauchy-Riemann relation is needed (i.e., 
the condition that f(z) is continuous is relaxed). This is the Cauchy-Goursat 
theorem. 


1.7.4 Cauchy Integral Formula 


If f(z) is analytic within and on the boundary C of a simply connected region R 
whose boundary C is sectionally smooth and if zo is any point in the interior of R, 
then 
f)=— > 122 (1.156) 
2mi IC z—-Zy 
where C is integrated in positive sense. 

To prove this Cauchy integral formula, we note first that /(z)z-zo) is analytic 
everywhere in R except at point zo. Thus, similar to Figure 1.10, we can construct a 
circle centered at zo with radius p (see Figure 1.11). Then, a cross-cut can be made 
to make it simply connected. Eventually, following the argument used in the last 
section or applying the Cauchy-Goursat theorem given in the last section, we obtain 


Figure 1.11 An analytic region for f(z) containing a point zo 
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f £02 _f [OX (1.157) 
Jc Z—Zo I Z—-Zo 
with p > 0. Note also that we have set both C and J’ contours counter-clockwise, 
which is different from that of Figure 1.10(b). On the circular contour 7} the 
following change of variable can be applied 


z—z) =pe, dz=ipe’d0 (1.158) 
The right hand side of (1.157) becomes 
f f(z)az _ {" f (zo + pe”) peidO _ 


0 pe’ 


2n 7 
lim | f (Zo + pe” Vid 
iT Z—Zo p00 


(1.159) 
2n . ; 
= f(2)| "id = 22f (2) 
Substitution of (1.159) into (1.157), we obtain the Cauchy integral formula 
flm)=—f £42 (1.160) 


2ni JC z—2Zy 
The proof is now completed. 
Equation (1.156) is actually the zero order of the more general Cauchy 
Integral formula, which is given by 


| 

fp) = faye (1.161) 
2mi 4C ( Z—Zo ) 

Mathematical induction will be used to prove this general formula. The zero order 

of (1.161) has just been proved. 7 pies that (1.161) is valid for n = k that 


f= =f. a (1.162) 


+1 


Taking differentiation of (1.162) one more we obtain 
£© (+ beg) - fF © (20) 
Azo 


al: i a ab _ fa f Sf (z)dz 1.163 
Emo] f, oe k+l ae a ( ) 


fF?" @ j= tim 
Az)—>0 


! 
ats samara ae, 
The square bracket term inside the integration sign can be simplified as 
ee 1 1 _ (2%) “(2-25 ~heo 
(z=25-Az) (z-z))"" (z- Zo - Azy)"* "(z-2Z 
(2—29)""{1-(1- dey (2-20) "| 
 (z=2)-Azy A (2=2))" 


Then, we can apply binomial theorem discussed in Section 1.2 to get 


ao 


yo 
(1.164) 
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(Azy)” K(k +1) 


(2-29) 2 


L= : aqii-ll za (k+1)+ 
(z—zy —Azy) z~ 29 
Back substitution of (1.165) into (1.163) we have 


kl. k+l 
fOEVG j=— lim (Pp “z) dz! 
O° Oni Az) 0 oF ) age 


ral (1.165) 


O(Azy)] (1.166) 


Finally, we have 
(k+1) (k+I! g _ f(z)dz _ 
GN tae 


Therefore, (1.161) is also true for n = k+1 if it is true for n = k. The proof is 
completed. 

The Cauchy integral formula is remarkable because it shows that if a complex 
function f(z) is known on the simple closed curve C, then the value of the function 
and all its derivatives can be found at all points inside C. It also implied that if a 
function has a first derivative (i.e., analytic) in a region R, all its higher derivatives 
exist in R. This is a major difference between a complex variable and real variable. 


(1.167) 


1.7.5 Residue Theorem 


Finally, we will review the most powerful technique in complex variables—the 
residue theorem. Let us first consider the Laurent theorem about a singular point a 
(i.e., the function is not analytic at point a). If f(z) is analytic inside and on the 
boundary of the ring-shaped region R bounded by two concentric circles C; and Cz 
with center at a and respective radii 7 and 72, then for all z in R 


f(2= = re Cons ee + dy +a,(z—a) +.a)(z—a)y* 
is (z-a) (z-a) (1.168) 
= >. a,(z—-a)* 
k=—00 
where 
ay a IG) gy k= Opel, 22... (1.169) 


2ni Sc(¢ —a)" 

The negative power terms of (z-«) are the principal part, from which the singularity 
comes, and the positive power terms of (z-a) are the analytic parts of the 
expansion. If the principal parts have infinite terms, the point a is an essential 
singularity. Note also that Taylor series expansion contains only positive power, 
and hence it is only a special case of the Laurent expansion. 

We will not provide a rigorous proof of the Laurent theorem, but we only 
sketch the main ideas. Applying the Cauchy integral formula given in the last 
section, we can write 


fo-—§ 2M wt Mw (1.170) 


) ay 
2ni Jo, (w- al 2ni JC, (w-z) 
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Figure 1.12 Annular region around a point a 


First, we can expand 1/(w-z) as 


1 1 
w-z (w-a)[l—(z-a)/(w—a)] 
St ga Sy, ae eS (1.171) 
(w-a) w-a w-a w-a 
— lo, z-a (za)! (27 4yn 1, 
(wa) (w-a) 7 (w-a)" wae won 


We have applied Taylor series expansion in obtaining the result in (1.171). 
Similarly, we can also expand —1/(w-z) as 


1 I 
w-z (z-a)[l-(w-a)/(z-a)] 
2! i ee a (1.172) 
(z-a) z—-a z—-a zZ—-a 
_ lo, wea, (w-a)"! Waa aa 
(z-a) (z-a) 7 (z-a)" Z-ad z—a 
Substitution of (1.171) into the first term on the right of (1.170) gives 
1 fp fm) #2 
ta teGea t a,(z a) Pvt a, 4(Z a) | | U,, (1.173) 
where 
ager ape Ad elf oer oay (1.174) 


tani Sc, (waar 2ni JC wa w-a 
with k = 1,2,....—1. Substitution of (1.172) into the second term on the right of 
(1.170) gives 

1 fw) a_y a 


— dw a1 4 _F2 4 Sn sy, (1.175) 
Qni IC, w-z zZ-a (z-a) (z-a)" 


where 
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a__, = 


sah cro ron, Yas af Cy LOR 1.176) 
2ni 2mi JC, z-a Z- 

with & = 1,2,...n. a n — ©, both U, and V, can be shown ae Zero 
provided the value of the line integral \cflz)dz i is bounded (Spiegel, 1964). This 
completes the proof. 

We are now ready to consider the residue theorem. Let f(z) be single valued 
and analytic inside and on a circle C except at the point z = a, which is chosen as 
the center of C (i.e., a is a singular point or so-called a pole). We now consider the 
closed line integral of f(z) which is given in (1.168) in Laurent series: 


p., See= oP. (z- ade ra if oa (1.177) 


We drop all positive power ak because all ne power terms are analytic and 
by the Cauchy-Goursat theorem, the closed contour integral must be zero. The 
remaining job is to evaluate the negative powers. Use the result of the Cauchy- 
Goursat theorem again or refer to (1.157); we have 
p au =f s (1.178) 
C (z-a)" r (z-a)" 
where the contour /“is given in Figure 1.11. When n = 1, we can again adopt the 
change of variables given in (1.158) and obtain 
2n id; 2a 
f id = ea ee = =| idO = 2ni (1.179) 


Cz—Z Tz-zy 40 pe 9 


For n > 1, we use the same change of variables given in (1.158) and obtain 


f dz _ f dz _ 2x peid = 1 ["emeao 

Cc (z—zy)" T(z—zy)" 0 pren? eo (1.180) 

= 1 — [e277 =i] a) 
(l-n)p 
Substitution of (1.179) and (1.180) into (1.177) gives 
= dz 

DY f(2gdz=) a_,~ mm =2z7ia_ (1.181) 
pf 2, p. (z- a)" : 


where a-, is called the residue of f(z) at z = a. 
Thus, we can find the residue as 


a_, = lim(z-a) f(z) (1.182) 

When z = a is a singularity of order 4, we find the residue by the following formula 
k-l 

a ~a) f(z)} (1.183) 


= fF f 
ee (k-1)! dz*4 @ 
The formula can readily be verified by the Cauchy integral formula. Let us consider 
that a function f(z) has a pole of order m at z = a, then we can write it as 

F(z 
fe)= 7 @ 


(1.184) 
(z —a)" 
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where F(z) is analytic inside and on C, and f(a) # 0. Then by Cauchy’s integral 
formula given in (1.161): 

L Lop F(a FF %V\(a 

—f [(2az = fh ( ) = ( ) 

2mi JC 2ni JC (z—a)" (m-1)! 


m-1 


(1.185) 


lim pig e- 9" FO} 


Combining (1.185) and (1.181) gives (1.183) and this completes the proof. If there 
is more than one singular point within contour C as shown in Figure 1.13, we have 


psd Iria (Aba dey Fad (1.186) 


The residue theorem will be used later in Section 1.7.7. 


Figure 1.13 Annular region around points a, b, and c. 


1.7.6 Branch Point and Branch Cut 


One major problem with complex analysis is the multi-value properties of complex 
functions around a certain point, which is called a branch point. Let us consider the 
function that 


w=zll", z= pe® (1.187) 


where n is a positive integer. Use the polar form given in the second equation of 
(1.187) we have 


wep e (1.188) 

If we go around the origin starting from Point A in Figure 1.14, following a 
complete counterclockwise circuit, we have 

we pinelOr22)/n _ lin gi8ln gidxin (1.189) 


which is different from (1.188). Thus, we end up with a different value for the same 
Point A. However, if we go around the origin 1 times, we should obtain the same 
value of w as in (1.188). We can describe one branch of the multi-valued function 
z'" for 0 < 6 < 27 as long as we do not exceed 27, and similarly we can find the 
other branches of the solution for larger values of @. Grouping all the answers, we 
obtain n branches of solution 
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Ja 


y 


z— plane 


Figure 1.14 A function of w =z!" 


O<z<2n7, 2n<2<A4nz,...,(4n—-2)a <z<4nz (1.190) 


Each branch is single valued, but to keep the function single valued, we must set up 
an artificial barrier that joins the origin and infinity. This artificial barrier is called 
a branch line or a branch cut, and this barrier should not be crossed for each branch 
of solutions. For this case, the origin, for which a multi-valued function appears 
when we go around this point a complete circuit, is called a branch point. As 
illustrated in Figure 1.14, we have chosen the branch cut along the positive x-axis, 
but actually any other branch cut connecting the origin and infinity along any 
orientation can be selected. 


1.7.7 Titchmarsh’s Contour Integral 


In this section, we consider the following integral: 


o yp-l 
I a a ee (1.191) 
0 1l+x sin px 


To get this result, we will first generalize the integral in (1.191) to a complex 
variable with the contour integral shown in Figure 1.15: 


zPt 
f dz, 0<p<l (1.192) 
cl+z 


The contour shown in Figure 1.15 was proposed by Titchmarsh. The contour 
ABCDEGHAA is a closed integral containing the pole or singular point at z= —1. 

On the numerator, z’~! is a multi-valued function, as we discussed in the last 
section. That is, when we make a circuit going around the origin, the argument will 
increase by an amount of 27 such that the value of 2?! changes its value. Thus, the 
branch point is at z = 0. By using the residue theorem, the residue of the integral in 
(1.192) is 


ze 


_ (er = ePDai (1.193) 
+2 


lim (1+ z) 
z>-1 
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On the left hand side of (1.192), we can divide the contour path given in Figure 
1.15 into 


(pla 
> =} ..4 wet ft |= 2c? 1.194 
$ aE dz } | I 2nie ( ) 
AB BCDEG = GH ~~ WA 
We have substituted (1.193) into (1.192) to get the last of (1.194). The contours 
given in (1.194) can be written as 


il xP _ i (Re!?)? 1 iR? dd A ik (xe? P| dx . ii (re? ir!’ do 
r lt+x 0 1+Re” R 1+xe?™ 22 = 1+ rel? 

in which we have used dz=iR'd@ and z = xe*™ along GH (i.e., 8 = 2x along GH). 
Note that the Titchmarsh contour excludes the branch point and a branch cut is 
formed along the positive x-axis. As long as the pole z = —1 is kept within the 
closed contour ABCDEGHJA, we can deform the closed contour arbitrarily such 


that the integral remains the same. Consequently, we can let r > 0 and R > o such 
that 


(1.195) 


dt (Roi \P-l- pid 
I Rey Rd? 50 as R>0,p<1 (1.196) 
0 1+Re’ 
0 (ppiO\p-1 4.10 
| ed _ Ge hs + SO (1.197) 
22 | + re’ 


Thus, (1.194) becomes 
a xP 0 e2t(PDi Play 
J 0 14+x J 
Or equivalently, we have 


=2nie?" (1.198) 


oo l+x 


Figure 1.15 Titchmarsh contour for integral given in (1.194) 
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1 
(l agri, aes — Iie’ (1.199) 
0 1+x 
Rearranging (1.199), we get 
00 Pol 2nie'? V™ 2ni 2ni 
J, jaa k= (et D) = oP-Dai _ gx (p-Di = [ete Pte tigtriy (1.200) 


= 2ni _ 
ett — eo Pm sin pT 
Finally, we complete the proof for (1.191). This formula will be used later relating 
to the discussion of the gamma function in Chapter 4. 


1.8 FRULLANI-CAUCHY INTEGRAL 


The following integral was first mentioned in a letter by Italian mathematician 
Frullani in 1821 and was later published in 1828 


i‘ JOT) 5 Heys” (1.201) 
0 Xx a 


This so-called Frullani’s integral is, however, exact for the case that (0c) vanishes. 
Cauchy in 1823 and again in 1827 provided the following exact form as: 


[PP a= - foo} (1.202) 


provided that the derivative of f(x) is continuous and the integral converges. The 
integral given in (1.202) is commonly known as the Frullani-Cauchy integral. In a 
sense, the Frullani integral given in (1.201) is correct only for the special case that 
fiw) = 0. We will see in Section 1.10 that there is a more general integral than 
(1.202) called Ramanujan’s integral theorem. 
Consider a special case that f(x) = e* and a = 1, we have an integral 
representation of In(x): 
ops — 9 XS 
} ea ee (1.203) 
0 ¢ 
This result could be employed later to derive the integral representation of the 
digamma function in Chapter 4. 


1.9 RAMANUJAN MASTER THEOREM 


The Ramanujan master theorem is considered in this section and this result could 
be useful to derive the integral representation for the digamma function. Although 
nearly all mathematicians should have heard of the legendary story of Ramanujan 
and the re-discovery of his lost notebook by George Andrews in 1976 in one of the 
boxes left behind by G.N. Watson. The coverage of Ramanujan’s discovery in 
mathematics textbooks, especially in engineering mathematics, is non-existent. In 
this section, we will show how his so-called master theorem leads to down-to-earth 
results that we have used regularly in engineering mathematics. Ramanujan was a 
self-taught Indian mathematical genius, and his story started with his 
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correspondence with Hardy. Upon invitation of Professor G.H. Hardy of 
Cambridge University, Ramanujan spent five years in England before he returned 
to India in 1919. Unfortunately, he passed away in 1920 at the age of thirty-two 
(Kanigel, 1991). Ramanujan’s brief biography is given at the end of this book. 
Ramanujan also published a number of formulas for generating accurate results for 
m. Some of these amazing formulas are reported in Appendix F. 

Ramanujan considered the following improper integral (Berndt, 1985): 


ls } "xR x) (1.204) 
0 


where n is not necessarily an integer. Ramanujan asserted that this integral J can be 
evaluated if F(x) is assumed to be expandable in the Maclaurin series. In particular, 
it is assumed that 


ce k 
F(x) = = (1.205) 


where (k) is the coefficient of the Maclaurin series expansion of F(x). Then, 
Ramanujan found that 

T=T(n)g(-n) (1.206) 
where I(x) is the gamma function (Abramowitz and Stegun, 1964). Conversely, if 
the integral is known, the coefficients of the Maclaurin series can be found. This 
was called Ramanujan’s master theorem by Berndt (1985). The proof of this 
theorem involves our knowledge of the gamma function and Laplace transform, 
and this knowledge is yet to be covered in this chapter. Thus, it will not be given 
here but is reported in Appendix Cl. Readers can refer to the proof in the 
Appendix after we introduce the gamma function and Laplace transform in later 
chapters. Ramanujan’s master theorem was reported in the first quarterly report of 
Ramanujan submitted to Presidence College in Madras, which admitted Ramanujan 
upon the recommendation of Hardy. It was published before Ramanujan’s 5-year 
visit to England. 


1.10 RAMANUJAN INTEGRAL THEOREM 


A natural consequence of Ramanujan’s master theorem is Ramanujan’s integral 
theorem, which can be obtained for the limiting case of nm —> 0 in (1.204). In 
particular, Ramanujan’s integral theorem states that 


[L@=ee) x= tf) ree ae me) : (1.207) 


provided that /(0) = g(0) and (00) = g() with a, b > 0. In addition, functions u and 
v are defined as the coefficients of the Maclaurin series of functions fand g as: 


f(x)- fle) = ae  g)-2(~)= ae aa 4.208) 


This is a power Seen of Frullani’s integral me given in (1.201). The 
proof of (1.207) is given in Appendix C2. For the special case that f= g, we have 
the last term in (1.207) vanishing and giving 
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[ tf (ax) — f(bx)} 
0 x 


de = (£0) fCe)} (2) (A.209) 


which is the Frullani-Cauchy integral given in (1.202). Thus, Frullani-Cauchy’s 
integral is a special case of Ramanujan’s integral theorem. Ramanujan’s integral 
theorem is a highly original and powerful result. Here is an example. 


Example 1.8 Prove the following infinite integration formula of the Bessel function 
by Weber (see p. 391 Watson, 1944) by using Ramanujan’s master theorem: 


1 
- he) 
| S10 a (1.210) 
one Qing — see) 


Solution: By using the definition of the Bessel function (Watson, 1944), the left 
hand side can be written as: 


pte = —v+u-l1 (-1)* t 2k+v 
[= [,' J, (t)dt = [ve > nee) dt (1.211) 


By applying a change of variable ¢ = y% to (1.211), we obtain: 


iy 
T=| yee > C one 1.212 
, - Ene +Ea 7 ( ) 


In order to apply ane s master theorem, we set a =1/2 and obtain 


(-y)* WC y 
j= [,» >) = Com [o» ne peas HAE hy (1013) 


Lare+kD 
where 
1 
k)= (1.214) 
alt) BT ek 41) 
Ramanujan’s master theorem gives 
7 
ce 
S ys Hy _# 2 
I= J, » urs y =I 1.215 
> (SoD (1.215) 


pam allie Cee . +1) 


This completes the proof of the Weber infinite integral for the Bessel function. 


1.11 CIRCULAR FUNCTIONS 


Trigonometric functions are also known as circular functions. As we all learned in 
high school, sine, cosine, and tangent are ratios of the sides of a right angle 
triangle. In this chapter, we will look at sine and cosine from the viewpoint of 
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differential equations. More specifically, let us consider the first differential 
equation of this book: 


@ 
Sth y=0 (1.216) 


Let us assume that the solution of y(x) can be expressed in exponent function with 
an unknown constant a: 


y(x) =e (1.217) 
Substitution of (1.217) in (1.216), we get the so-called characteristics equation for 
the differential equation: 


(a? +A*)e™ =0 (1.218) 
The roots for o are 
a=tis (1.219) 
Thus, the solution for y(x) becomes 
y(x) =Ce* +Ce (1.220) 


Note that the constant C; is complex and since (1.216) is real, we have y(x) real. 
Recalling that a complex number adding to its complex conjugate is real, we must 
set the unknown constant for the second term as the complex conjugate of Ci. 
Applying the Euler formula, we have 


e’** =cosx+isinx, e'** =cosx—isinx (1.221) 
Substitution of (1.221) into (1.220) leads to the standard solution form for (1.216): 
y(x) = D, cos Ax + D, sin Ax (1.222) 


The validity of (1.222) can be shown easily by substituting (1.222) into (1.216). 
The unknown constants D; and D2 need to be determined by appropriate boundary 
conditions of (x). The reader should bear in mind the solution form of (1.222) 
because we will encounter this differential equation many more times in this book. 
Sine and cosine are the only functions for which their second derivatives are equal 
to the negative values of themselves. 


1.12 HYPERBOLIC FUNCTIONS 


Hyperbolic functions appear naturally in our daily lives. The shape of a hanging 
chain, a spider web, or a hanging power line (catenary problem solved by 
Huygens), optimum shapes of arches (like the Gateway Arch at St. Louis, Missouri, 
USA or the largest vault Taq-i Kisra in Iraq), and the optimum shape of a soap 
sheet between two rings (catenoid problem solved by Euler). Apparently, it links 
naturally to the deformed shape of a flexible body under gravity. It also appears 
naturally in the solutions of waves in solids, the motion of falling objects with air 
resistance, and the solution of the temperature distribution in cooling solids. Its 
importance in physics, engineering, and applied mathematics should not be 
undermined. They are also regarded as the most commonly encountered elementary 
functions, probably only second to circular functions (e.g., sine, cosine etc.). Even 
the “M” logo of McDonald’s is also a double-inverted catenary. By definition, 
hyperbolic functions are defined as 
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x 77 =X x ne —xX x _ =X 
sinh x = © . ; coshx=2 7" tanh x = © = (1.223) 
2 2 e+e~* 
inh 1 1 h 
tanh x = noms , eschx=— , sechx= , cothx so (1.224) 
cosh x sinh x cosh x sinh x 


The close resemblance of circular and hyperbolic functions is obvious. These 
functions were first defined by J.H. Lambert, who proved the irrationality of x and 
was known to contribute to map projection and stereonet projection related to his 
works on cosmology. A brief biography is given at the end of this book. It is 
straightforward to show that 


sinh(—x) =—sinhx, cosh(—x) = cosh x (1.225) 

cosh? x—sinh? x = iF 1-tanh? x = sech*x, coth? x—csch?x =1 (1.226) 
sinh(x + y) = sinh xcosh y+ cosh xsinh y (1.227) 

cosh(x + y) = cosh xcosh y+sinh xsinh y (1.228) 


These will be left as exercises for the readers. 

Physically, the argument of the circular functions equals twice the gray area 
of the unit circle on the left of Figure 1.16, whereas the argument of the hyperbolic 
functions equals twice the gray area on the right of Figure 1.16. The curves of the 
circle and hyperbola are given respectively by: 


P+y=1, xy =1 (1.229) 
The gray area of the unit circle is clearly proportional to 8 or 
Ay x(I)° : -5 (1.230) 


The proof for the argument of the sete functions is less obvious. We can 
rewrite the gray area as the difference of AOPB and sector APB. The area of 
triangle OPB is clearly 


1 . 
Aopp = 4 oan? (1.231) 
The area of the curved sector can be integrated as 
cosh@ cosh@ 2 
dass =[ ydx =| x? —lde (1.232) 
1 1 
We can make a change of variables x = cosh u and obtain 


0 0 
Aipp= } Vcosh? u —1sinhudu = | sinh? udu (1.233) 
0 0 


By virtue of the ee of (1. oes ae ae, 
9 et — =e" 8 (ee 2u-1 
Asp =] (S ee ee Mu = f° EMH ay 


Tso 8 
4 2 


(1.234) 


by noting from (1.231) that 


l it 2 
Aopp = cosh Osinh 6 = ste’ +e "ye? —e") = : (27? 69) = inh 26 (1.235) 
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Figure 1.16 Physical meaning of the argument of circular and hyperbolic functions 


This can also be obtained from (1.227). 
Therefore, we finally have 


1 
Aops ~ Aare = oo (1.236) 


Again, the gray area equals half of the argument. 
We here only report some of the commonly used formulas of differentiation 
of hyperbolic functions without proof: 


# cited 9 eolinedihs (1.237) 
dx dx 
d 2 2 
—tanhx=sech*x, —cothx =—csch*x (1.238) 
dx dx 
d d 
—sechx =-—sechx tanhx, —cschx =-—csch xcothx (1.239) 
dx dx 
| 1 d = +] 
—sinh x= » —cosh x= (1.240) 
as Vl4x? & Vx? -1 


where the plus sign is for cosh"!x > 0 and minus when cosh"!x <0. In addition, we 
have 


a |x| >1 (1.241) 
x -1 


(1.242) 


where the minus sign is for sech"'x > 0 and plus when sech"!x <0. Some useful 
formulas of integration are 
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| a = sinh }—+C 
a’ +u? o 
du -1 
} =cosh #40 (1.243) 
ua a 
1 
-tanh {4 ifu’ <a’ 
f du a a 
Pane 1 1u Pe eae: 
—coth —+C ifu’>a 
a a 
d -1 _ 
| = = gech! Soc O0<u<a 
= a a 
“ — “ (1.244) 
u = -1 
[4 = Ssh —|+C u#0 
uju-t+ae 4 
The series expansions of sinhx and coshx are 
a 2 14 
sinhx=x+~-42~-4.., coshx =14+~—~4~ 4... (1.245) 
3! «5! 2! A! 


These series expansions can be obtained by Taylor series expansions, which will be 


considered in Section 1.13.2. 


Equation (1.237) suggests that the general solutions of the following second 


order ordinary differential equation are sinh and cosh functions: 


(1.246) 


Let us assume that the solution of y(x) can be expressed in an exponential function 


with an unknown constant a: 
y(x) =e 


(1.247) 


Substitution of (1.247) in (1.246) gives the so-called characteristics equation for 


the differential equation: 


(a? —A?)e** =0 (1.248) 
The roots for @ are 
a=tA (1.249) 
Thus, the solution for y(x) becomes 
y(x) = Cie +Cye* (1.250) 
Note from (1.223) that 
e* =coshAx+sinhAx, e 7* =coshAx-—sinh Ax (1.251) 


Substitution of (1.251) into (1.250) leads to the standard solution form of (1.246): 


y(x) = D, sinh Ax + D, cosh Ax 


(1.252) 


The unknown constants D; and D2 need to be determined by appropriate boundary 
conditions of y(x). This is the second most commonly encountered ODE in this 
book. Readers should familiar themselves with this differential equation. 
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1.13 SERIES EXPANSIONS 


In various areas of applied mathematics, functions are often expanded around 
certain points and their behaviors are considered around these selected points. 
Asymptotic expansion and perturbation are typical examples of series expansion 
(see Chapter 12). In this section, we will consider series expansion for some well- 
behaved functions. 


1.13.1 Darboux’s Formula 


We first consider the most general series expansion of a function, the so-called 
Darboux’s formula. According to Whittaker and Watson (1927), this formula was 
derived by Darboux in 1876, who was an expert in applying differential geometry 
to differential equations (see biography). Darboux’s formula relies on the following 
identity of the differentiation of a product of a power series of (z—a) of order m, a 
polynomial (t) of degree n, and an analytic function f(t) at all points from a to z 


~ {(-” (z _ a)” go) (t) fa + t(z = a)]} 


m=1 


(1.253) 
=—(z-a)p ()f Tat e(z—a)]+(-1)"(z- a)" GOS [a t1(z-a)] 

where the superscript bracket “(m)” indicates m-th differentiation of the function. 

This identity is remarkable in the sense that the differentiation of a series of n-terms 


will result in only two terms on the right hand side. To demonstrate the validity of 
(1.253), let us consider 1 = 1 such that the left hand side is 


“(-@ — a) p(t) fla +t(z—a)]} 


=—(z—a)$'(t) f'[a+t(z—a)|-(2-a) (fla + (z-a)] 
The right hand side of (1.253) is 
RHS =-(z-a)¢'(t) f'[a+t(z-a)]-(z- ay g(t) f"[a+t(z-a)] (1.255) 
Thus, the validity for n = 1 is demonstrated. For n = 2, the left hand side of (1.253) 
is 


(1.254) 


£{-@-aeWfla+te-a))+(2-a) Of" ta +12 ~a)| 


=~(z-a)p"(t) f [at t(z-a@)]- (2-4) #' OS a+ t(z-a)] (1.256) 
Hz-ay POS "lat uz -a)]+(z-ay POS [a +1 -a)] 
=-(z-a)p"(O)f la tt(z-a)]+(z-ay (OF [a+ 4 -a)] 
Note that the middle two terms cancel out one and another, and thus only two terms 
were left and they are equal to the right hand side of (1.253). Thus, the validity of 


(1.253) for n = 2 is demonstrated. This procedure can be easily extended to n = k; 
we have the left hand side of (1.253) as 
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“He —a)p" (1) f [a+ t(z -a)]+(z—a)’ 6% (1) f"[a + t(z—a)] +... 


+(-1)*"(2-a) gO DFE [a+ (z-a)] 

+(-I)(z-a) pf [a+t(z—a)]} 

=-(z-a)6 ) f a+ t(z-a)]-(z-a) 6S [a+ Hz -a)] 

+(z-a) gO f"[at t(z—a)] + (z—-ay po" &) fa + t(z —a)] +... 

+(-1)*"(z-a)F 1 gO Ms [a+t(z-a)] 

(“Da PO OFOla+tz-a)] 

+I (2-4) GO DFO la +z —a)]+ (Yh (2 - a HOSE Lat t(z—a)] 

=(2-a)p() f [a + (2 -a)] +(I (2 -a) OO SO [a+ (z—-a)] 

(1.257) 

Note that all intermediate terms will cancel one another except for the first and the 
last terms. The last of (1.257) is the right hand side of (1.253). Thus, the validity 


for n = k is demonstrated. 
We now rearrange (1.253) to get 


(z-a)b () f [a+ t(z-a)] 
“ Da i" (z a)” g(t) f ™ [a ra t(z —a)]} (1.258) 


m=| 


+(-1)"(z-a)" Of"? [a +t(z-a)] 


Recall that ¢(2) is a polynomial of degree n, and we have 


G (= dO) =c (1.259) 
where c is a constant. Note that 
(e-a)fla+te-a))=5 flare) (1.260) 


We then integrate (1.258) from 0 to | to get 


n | d | d . m m n-m m 
MO) hee ae= | CO Gee MOP laste-ala 
HD e-ay | go fr[a +i a) 


(1.261) 
The first integral on the right hand side of (1.261) can be evaluated by reversing the 
order of integration and summation such that 
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[2d {cvre-ay" om Oa +u(z—aylt 
odt m=l 


, (1.262) 
= Vey te- ay" GoM DF (D-6" OF} 


m=1 
Substitution of (1.262) into the first integral on the right hand side of (1.261) and 
reversing of the order of integration and summation gives 


OE OLFE)-—F@}= YY" =a)" GOMOD ()- GO DF} 


m=1 
HD e-ay" | go fr"P[a + za) 


(1.263) 
This is Darboux’s formula of the series expansion of function f(z). 


1.13.2 Taylor Series Expansion 
Two different proofs for Taylor series expansion formula are given here. 


Proof | 
We now consider a special case of Darboux’s formula given in the last 
section that ¢(f) = (t-1)”. Consequently, we have 


GQ) =n-1)"", 
g2) ()= n(n—l\(t-1)"?, (1.264) 
b(t) = 9 (0) =n! 
go” (t) = n..(m+I\(t-1)”, 
gr" (0) = n(n—1)..m-+ 1-1)" =", (1.265) 
Mm. 


g"™ (1) =( 
Substitution of (1.264) and (1.265) into (1.263) results in 


al{f(z)- f(a)} = yi y@-ay= us yn" f(a) 
m=1 (1.266) 


+D"E=a)""[ OF Mla tea) 


Rearranging (1.266) gives 
(m) 
f= 1+ Oe - a" 
m=l (1.267) 
o"e-o" fone teen ait 
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Now consider the case that (f) is an infinite series or n oo. Since fis an analytic 
function, all its higher derivatives exist such that f"*?[at+d(z—a)] is finite. The term 
(-1)"(z-a)""! is an oscillating function and remains finite compared to n > 
Thus, the last term will approach zero as n > «. We finally obtain 


f(2)= (a+ @ \z-ay" (1.268) 
m=1 
This is the Taylor series expansion which was derived by Brook Taylor in 1715, 
which is a special case of Darboux’s formula of function expansion. Note that the 
last integral in (1.267) actually gives the error in the Taylor series expansion if a 
finite number of terms is used. 


Proof 2 

We can also prove the Taylor series expansion formula by starting with 
Cauchy’s integral formula. Recall from Section 1.7.4 that any analytic function f(z) 
can be written as 


L(G)do 
fa=s-f C O-z ee 


where C is integrated in the positive sense. We recognize that (1.269) can first be 
si as 


oe > S(o)de -—-> I(Qdg 
Oni C G-Zz 211 an ies ( 
=f I(Qde —f. 2 2-2)" fag 
2ni Jc(€-z)l-(z-z)/(E zl Oni = (¢-z,  (f-n)" 
In obtaining (1.270), we have applied the following expansion 
1 = n 
= ee 2s: (1.271) 


This expansion can be obtained as a special case of the binomial theorem by setting 
n=-—1 in (1.4). We now interchange the order of summation and integral to obtain 


f Lae » SFG) , 
f@= = E eae Lode Zo) -> a" (z-z) (1.272) 


The last of (1.279) j is obtained by employing the general Cauchy’s integral formula 
given in (1.167). This completes another proof of Taylor series expansion formula. 


1.13.3. Maclaurin Series Expansion 
When we set a = 0 in (1.268), we obtain the so-called Maclaurin series expansion: 


(m) 0 
= f+ Om (1.273) 
m=l 
This formula was actually derived by Sticting in 1717 but was published by 
Maclaurin in 1742. 
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1.13.4 Laurent’s Series Expansion 
p 


One main assumption is that we have tacitly assumed the Taylor or Maclaurin 
series expansions are analytic, and only the positive power of the expansion exists. 
A very important theorem was published in 1843 by Laurent who considered that 
the point of expansion a is not analytic and in such case, by virtue of (1.168) or 
Cauchy’s integral formula, we can expand a function fas 


f= ya, (¢-a)" + (1.274) 
n=0 n=l (z-a) 
where 
o -1 ID gy, ga 5 f a (1.275) 
271 IC (w-a)"" 2mi JC, (w=a)" 


The contours C; and C2 are given in Figure 1.17. This theorem was actually 
contained in a paper by Weierstrass in 1841, but it was not published until 1894. 
The proof of this theorem has already been given in Section 1.7.5. If the principal 
part (second summation in (1.274)) of the Laurent’s series is zero, Taylor series 
expansion is recovered. 


1.13.5 Lagrange’s Theorem 


In the previous section, we have seen that functions can be expanded in a power 
series, either by Laurent’s series or by Taylor series. A more general theorem was 
derived by Lagrange in 1770, which deals with the expansion of a function in terms 
of another function. Let both f(z) and ((z) be analytic functions and expansions can 
be done about a point a as: 

0 f” ye 
{O=s@+>, re 


n=l 


[f'(a{ea}"] (1.276) 


where 


q 


Figure 1.17 The function fis analytic within the annular ring between r; and r, considered in 
Laurent’s series expansion 
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C =a+t¢(C) (1.277) 
A special form of (1.276) is 


a ye ‘aa 46a)" (1.278) 
The proof of this theorem can be found in Whittaker and Watson (1927) and will 
not be given here. 


1.13.6 Mittag-Leffler Expansion 


Some functions can be expanded in rational functions by applying the residue 
theorem. Such expansion is known as Mittag-Leffler’s expansion, which was 
derived by Swedish mathematician G. Mittag-Leffler in 1880. Although Mittag- 
Leffler is not particularly famous among applied mathematicians and engineers, it 
is probably because of Mittag-Leffler that we do not have a Nobel Prize in 
mathematics today. This story is given in the biography section at the back of this 
book. 
Let us consider the following identity: 
Be gl (1.279) 


Z-xX Zz 2z(z-x) 


which can be shown in a straightforward manner. Next consider a function f(z) 
containing singular points at a1, a2, ... with corresponding residues 51, bz, ... and 
choose a sequence of circles C;, of radius R,, with center at the origin, such that it 
does not pass through the poles and f(z) is bounded on the circle cm. Multiplying 
(1.279) with f(z) and considering is contour integral along cm, we obtain 


Lp faz _ Lj fee. x ¢ f(@dz (1.280) 


2mit z-x = 2ni 2mi 4 z(z-x) 
Cn Cn Cn 


Applying the residue theorem to the left hand side and to the first term on the right, 
we obtain 


J z(z—x) 


m d: 
Foy = f(0)4 a f(2)dz (1.281) 


in which we have assumed that there is no ee at z = 0 for f(z). Now 
consider the limit that m — 0; we have 


f= 10+ Dut (lia (2 (1.282) 


= n Ay me QT . z(z—-x) 


m 


Noting that f(z) is analytic (1.e., f(z) is finite as R, — 0) and x is finite, we have 


PONE gas ee ide (1.283) 
z(z—-x) R 


: m 
Cn 


where O(1/Rm) implies that (1.283) is of the order of 1/R. Therefore for a function 
with infinite poles, we have 
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~ 1 
= 1.284 
fe) =f) pa rere } (1.284) 


n 


This is the so-called Mittag-Leffler expansion in terms of rational functions, which 
was derived by Mittag-Leffler in 1884. 


Example 1.9 Show the validity of the following expansion by using Mittag- 
Leffler’s expansion: 


1x 1 1 
cosecz =—+ > eine ! (1.285) 
2. zZ-nq nt 
Solution: Let us consider f(z) as 
f() _eeeee (1.286) 
Z 


By noting that cosec z =1/sin z, we have infinite poles at z= nz for n = 1,2,3,... For 
this function we can consider a circular contour on which |z|=(n+1/2)z . The 
residue at singularity z = 0 is 
haa, sim Chie ea (1.287) 
20 z>0 sinz Z z>0 =«sinz 
The last limit of (1.287) is the indeterminate form of 0/0, and thus we can apply 
L’Ho6pital’s rule to get 


by = lim, # = lim 
z->0 Qa (sin z) z->0 
dz 


ca eee 
d ee (1.288) 


Again by applying L’H6pital’s rule, the residue at singularities z = tna with n = 
1,2,3,... can be evaluated as 


jn {c We) aoe ii Aaacleceetl weet 
zonTt zsinz zon sinz+Zcosz (1.289) 
= (-1)" 
Substitution of (1.288) and (1.289) into (1.284) and by noting that /(0) = 0 gives 
1 oy 1 1 
cosecz =—+ >» eh (1.290) 
A. Z—-nq nt 


This form can be rewritten in a slightly different form by grouping the —n and +n 
terms as 


a a ae oe 2z _ dz (1.291) 
z—-nq na ztnm na (z-na\(ztna) (z?-n'n’) 


By virtue of (1.291), the first few terms of (1.290) can be written as 


1 -1 1 1 
cosecz = +22{ — yt ns ; +} (1.292) 
Zz z—-nm z°-An” 2° -9n 
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This formula agrees with that given by Spiegel (1964). 


Following a similar procedure, one can show the validity of the following 
expansions using Mittag-Leffler’s expansion (Spiegel, 1964): 


secz = 7 7 5 = al = 5 te (1.293) 
(a/2)y-z GBa/2)-z2 (2/2) -2z 


1 
tanz = 2z ae = as 7a (1.294) 
(a/2)°-z° (a/2)-z° (5m/2)°-z 

1 1 1 1 
cotz=—+2z + + +... (1.295 
Z \z ~7 7? -4n? 2? -9n? 

1 1 1 1 
eschz =—-—2z —-—.—_~ + ——_ t... (1.296 
Zz is +77 244m? 27497? 


sechz = L : — ad = Gn (1.297) 
(m7/2)+z° (30/2)? +z° (5m/2) +z 


tanh z = 2z L as = F ze (1.298) 
(7/2) +z2° (3a/2) 4+z° (50/2) +2 


1 1 1 1 
nea ree ae (1.299 
Z ree 2 +4r? 2 49772 } ) 
2 ee 
é 1 
— cothzacoth zb = (1.300) 
ae pps (m? +a°\(n? +B") 


1.13.7. Borel’s Theorem 


In this section, we introduce Borel’s theorem. If a function $(z) is expandable in an 
infinite series as 


a,2" 
#)= (1.301) 
which is called Borel’s function, then the following integral can be evaluated as 
f= | e"Wendt = 4,2" (1.302) 
0 


n=0 
This is Borel’s theorem, which can be proved easily by reversing the order of 
integration and summation and by observing the definition of the gamma function. 
In particular, we can substitute (1.301) into (1.302) and reverse the order of 
integration and summation: 
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{(D)= [, etoenar = red on ay dt = y oot [eta 
Su = x! 


(1.303) 
=> “Tin +)= > 6,2" Zz 
n=0 n=0 
where /(z) is Euler’s gamma function and is defined as 
T(n+1)= I “ett"dt (1.304) 
0 


Note that for n being an integer, we have 7(n+1) = n!. In view of the importance of 
the gamma function in applied mathematical analysis, the properties of the gamma 
function will be discussed in more detail in Chapter 4. This completes the proof of 
Borel’s theorem. Comparison of Borel’s theorem and Ramanujan’s master theorem 
given in Section 1.9 shows that there is a close resemblance between these two 
theorems. The idea behind their proofs is exactly the same. 


1.14 FUNCTIONS AS INFINITE PRODUCT 


In this section, we will employ Mittag-Leffler’s expansion formula to expand 
certain types of functions as infinite products. Let us consider that a function f(z) 
has simple zeros at points a1, a2, ..., and in addition 

lim |a,,| > 00 (1.305) 


no 
Other than this, f(z) is assumed to be analytic for all values of z. Thus, f(z) must 
also be analytic. Consequently, f(z)/ f(z) can have singularities at the points a1, a2, 
.. Use Taylor series expansion for f(z) such that 


f@)=(-a,)f"(a,)+ 22 pra). (1.306) 
Note that we have used f(a,) = 0 in sien (1.306). Then, the differentiation of 
(1.306) gives 
I'(@2=f'(a,)+(2-4,) f(a, ) +. (1.307) 
Then, we have 


LR {f'G@)+@-4,)f"G,)+.3 


IO Gay f'ta)+ 2 pq +3 me 
The residue for the pole z = a, can be ore as 
ieee 7a “a 2a aoe (1.309) 
_ (2-4, ){f'(a,)+-—= f'"(a, 03} 
Let f(0) # 0; we can apply Mittag-Leffler’s oes to f(z)/ fiz) 
£E_fO, I 
FO FO Dee a, 4, Ss 


We can integrate both sides with respect to z as 
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1 FO, pan Ea = 
le [=n fle) Fe atime a+ 7146 (1.311) 
This can be inverted as 
LOS t(e-a,e2} LO, 
f(z)=Cef™ 2 % =Cef T[e- a, )e7/% (1.312) 


n=l 
where the product function is defined as 


[[- = AA A,A4°** A, (1.313) 
n=l 
Equation (1.312) can further be simplified by grouping constant terms as 


ea LO, w LO. w 
f@)= [IIs le 


(0) Tic- je ace! | [ido je) (1.314) 
n=l ay n=l 


Note that (0) = C , and thus (13 14) can be 
LO, © 
f@)= fel Tl-z =e} (1.315) 
This formula was reported in Section 15 - Whittaker and Watson (1927) and is a 
direct consequence of Mittag-Leffler’s expansion formula. 


Example 1.10 Show the 7 of the following expansion: 


sme Elo HlavSee| (1.316) 
nn 


Solution: Let us consider 40) = sinz/z, and the zeros of f(z) are z = +nz Thus, the 
limits are 


lim f(z) = lim tin (1.317) 
z>0 Z z>0 «1 
tim #"@) = tim(—= Sine 0082 lim( = **) = in| m2 = 0 (1.318) 
z>0 Zz z—>0 2 


Grouping terms for z = ad z=+na, we obtain from (1.315) 
sinz 7 z= z= 
= |———)e"* j+—)e ™ 
4 Ze Hlan oe] 
a (1.319) 


This completes the proof. 
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1.15 VECTOR CALCULUS 


The vector calculus considered here can be considered as a special case of tensor 
calculus, which was developed by Ricci and Levi-Civita and motivated by a paper 
by Riemann in 1854 on Riemann geometry. Subsequent development on tensor 
calculus was made by Weyl, Eddington, and Schouten. Other main contributors of 
vector analysis include Gibbs, Heaviside, Féppl, Wilson, and many others (e.g., 
Crowe, 1993). 

Let us consider vector calculus using the example of a moving particle in 
space. The position vector of the particle is a function of time ¢ and is represented 
by 

r(t) = x, (fe, +x, (te, +x; (He; (1.320) 
The time derivative of this position vector is the velocity vector v 
pee eg ae, (1.321) 
dt dt dt dt 
If a particle moves along a curve C and the arc length along curve C is s, that is, r= 
r (s), we can use chain rule to rewrite the velocity vector as 
dr _drds __,ds 


dt ds dt dt 
where r’ is the unit vector along the curve C of the moving particle and ds/dt = v is 
the speed of the particle along the curve. Let us call the unit vector along v as T. 
The acceleration of the particle is 


2 2 
dv_d gy err : _ NE age : 
dt dt dt ds dt dt p dt dt 
The direction of vector T is parallel to velocity v as suggested by (1.323) and N is a 
unit vector perpendicular to 7 or v as shown in Figure 1.18. The last of (1.323) can 
be proved as 


(1.322) 


a= 


(1.323) 


= le (1.324) 


ds As As| ¢ 


where p is the radius of curvature. The change of 7 can only be in the direction of 
the magnitude of a unit vector fixed as unity. Thus, | A7|=|A®@| is the radius of the 
circle shown in Figure 1.18 is fixed at 1. Equation (1.323) can be rewritten as 


= lim 
As—>0 


= lim 
As—0 


a=—T+—N (1.325) 
Therefore, there are two components of acceleration, one along the tangential 
direction and the other along the direction perpendicular to tangential direction. The 
second term is clearly the centrifugal force of circular motion (French, 1971). 


1.15.1 Gradient 


More generally, let us consider a scalar function @ = (x1, x2, x3), which is a 
function of position, such that the total differential of it is 
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dgo= a dx, 4 ae dx, + ay dX; (1.326) 
Ox, OX Ox, 


This is a scalar, but the structural form of (1.326) suggests that it can be viewed as 
the dot product of two vectors as: 


dp=Vo-dr = e+ ae e,4 < e,)-(dxje, +dx,e, +dx,e,) (1.327) 
X3 


x; OX 
C\,T+AT 
__ AT 
oe, 
‘ V 
As 
T 
A 


Figure 1.18 The change in the direction of the unit vector along the moving path C 


where e; (i = 1,2,3) are the base vectors of a particular coordinate system for which we 
want to consider the changes of the function ¢ with respect to the variables x;. In 
(1.327), we have used the vector differential operator that is expressed as V. It is 
commonly called “del” or “nabla” and in Cartesian coordinates it can be written as 
gradd = Vd =e, ad +e, ag +e, ae (1.328) 
Ox, OX Ox; 


In physics, it is sometimes referred to as the Hamilton operator as this symbol was 
introduced by Sir Hamilton (we will talk about his Hamilton’s Principle in Section 
14.5). His brief biography is given at the back of this book. The termV¢ is also 
termed the gradient of a scalar function ¢. Physically, the gradient of a scalar can be 
interpreted as the normal to the surface defined by @(x1, x2, x3 ) = C1, as shown in 
Figure 1.19. It is also the direction of the greatest rate of change of the function ¢. 
By considering the change of ¢ along the path C, we can rewrite (1.327) as 


P(X 5X75) = C, Vo 


Figure 1.19 Physical meaning of the gradient of a scalar function ¢ 
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do dr 

—=V¢:-—=V¢:T (1.329 
ds é ds é ) 
Thus, d@/ds 1s the value of V¢@ projected along the direction of T. If the direction of 
T is along a direction perpendicular to the normal of the surface @(x1, x2, x3) = Ci, 


we have d@/ds = 0. 


1.15.2 Divergence 


Since V is a vector operator, its operation on a vector can be expressed through 
either a dot product or a cross product. Indeed, the dot product of V on a vector 
function v is called divergence (a term introduced by Heaviside) or div of v as: 


5 Oe (1.330) 
Ox, OX, OX; 

The physical meaning of divergence is best understood in terms of fluid flow. For 
example, a control volume shown in Figure 1.20 for a flowing fluid can be used to 
derive the condition of continuity. The outflow subtracts the inflow along the x2 
direction and gives: 


divv=Vev= 


O(pv. O(pv 
[Pv + MOV) ee he — pv,dx,dx, = OI) ati (1.331) 
Oxy OX, 
Similarly, the net outflow along the x; and x3 directions are 
vee dx, dxdx3, pe dx, dx dx; (1.332) 
1 


The net outflow per unit volume from all three independent directions is 
(ory) | (eva) | Ors) _ 
Ox, . Ox, Ox 
Physically, the left hand side is the amount of “diverging” fluid from the control 
volume and the left hand side is the change of density to accommodate such 
changes. By now, the physical meaning of so-called divergence is clear. If density 
of the fluid does not change, we have zero divergence or 
Vev=0 (1.334) 
Similarly, we can also consider the divergence of electric or magnetic flux through 
a control volume. 


op 
Ve(pv) = 1.333 
(py) a ( ) 


1.15.3 Curl 


The cross product between V and a vector function v can be expressed as the so- 
called curl or rot as 
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e 2) e3 


curly =Vxv= é e we eG is ea" v2 iss 
Ox, OX, 0X3) OX, Ox, Ox, = OX, 
Mi Ne M3 
(1.335) 


Physically, the curl of a rotating fluid is related to the circulation or the strength of 
a vortex. If the curl of v is zero, the flow is irrotational (i.e., no vortex can be 
formed). 


1.15.4 Physical Meaning of Gradient, Divergence and Curl 


Alternatively, we can also consider the integral form of the definitions of gradient, 
divergence, and curl as: 


=a as 1. 
de a oi 
Vev= lim a vedS (1.337) 

MW ->0 AV 

OV 

1 
one ii — ‘ 1.338 
[Vxv(r)]en 1 as vedr ( ) 


The surface dS is the surface pointing outwardly from the volume AV. The point of 
evaluation for Vo and V-v is within the interior of the small volume AV. The 
integration in (1.336) to (1.338) is carried out over a closed surface or around a 
closed path as indicated by a small circle through the integral sign. The integral 
definition of divergence given in (1.337) provides another physical meaning that 
divergence is a measure of the strength of sources or sinks within the element AV. 

The integral form of the divergence can be interpreted from Figure 1.20 again 
by defining the flow (or flux) through a control volume AV 


=| (pv) ds (1.339) 
AY 
X3 


PV, 


x 


Figure 1.20 Interpretation of divergence through the fluid flow problem 
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However, by (1.333) the total flow per unit volume through the element is 
O(py, ) 7 O(PV2) 4: O(Pv3) 


=Ve(pyv 1.340 
Ox Ox, Ox; o 
Balancing the flow per unit volume, we have 
1 
lim —— (pv) «dS =V- 1.341 
im yy (pv) (pv) (1.341) 


av 
For constant density, (1.341) is the same as (1.334). 

The physical meaning of (1.338) can be interpreted through Figure 1.21. Let 
us consider a rotating fluid element ABCD as shown in Figure 1.21. The tangential 
velocities on the sides of the element are shown. If the fluid element ABCD is 
rotating, a resultant peripheral velocity exists around the element counter- 
clockwise. However, the center of rotation is not known; it has been proposed that 
rotation be expressed in terms of the sum of the products of circulating velocity and 
distance around the contour. This sum is called “circulation” /and is defined: 


P= Gvedr (1.342) 


Taking the integration counter-clockwise as positive, we can evaluate the closed 
contour integral as the summation 


LT apcp = Vyjadx, + (vy + ae dx, )dxy —(v, + ai dX )dx, —Vv,dx> 
Ox, OX, 
je. ee (1.343) 
=( -)dx,dxy =(V xv); - (dS); 
Ox; = OXy 


This expression is true for the configuration shown in Figure 1.21 (ie., for a 
constant value of x3), but it can be readily extended to the cases of more general 
non-planar and non-rectangular contour. The circulation can be generalized to 

I =(Vxv)-(dS) =(Vxv)-ndS =(V xv)-nAS (1.344) 


Figure 1.21 The circulation for a rotating fluid ABCD 
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Note that since J" is a scalar and thus independent of the actual directions of the 
velocity and normal vector, therefore it is also independent of the choice of the 
coordinate system. Equating (1.344) and (1. — we obtain 


fine hs 1.345 
(Vxv)-n lim vedr (1.345) 


This completes the proof for (1.338). 


1.15.5 Vector Identities 


A number of identities of vector calculus will be found useful and are reported 
here. 
The two most important formulas in vector calculus are probably 
Vx(V¢) =0 (1.346) 
V-(VxA)=0 (1.347) 
Because of these identities, we can see later that any vector can be represented by 
an irrotational part and a solenoidal part. This is Helmholtz’s representation 
theorem or Helmholtz’s decomposition theorem for vectors as discussed by Chau 
(2013) and will be presented in Section 1.16. If the curl of a vector vanishes or 
mathematically 
VxA=0, (1.348) 
it is called irrotational and it has an important consequence on fluid mechanics. 
There will be no effect of viscosity. If the divergence of a vector vanishes or 
mathematically 
V-A=0, (1.349) 
it is called solenoidal (a term that came from electromagnetism on wire coil) or 
divergence-free. More discussions on this will be given in the next section. 
Some basic formulas of the time differentiation of vectors are 


4 ace) tae al (1.351) 
t 
<(aBc)= c+ AAC+ 4B (1.352) 


There are many important and useful identities related to the properties of gradient, 
curl, and divergence, and some identities in Cartesian coordinates are 


V-(VéxVy)=0 (1.353) 

V «(¢A) =(V$)*A+gVeA (1.354) 

V (Ax B) =(Vx A)e B—- Ae(V x B) (1.355) 
V x(¢A) =V¢x A+GVXxA (1.356) 
V:Vb=V'¢ (1.357) 


V"(V¢) =V(V"¢) (1.358) 
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V+(V7A)=V7(Ve A) (1.359) 
Vx(V7 A) =V7(Vx A) (1.360) 

V(A- B)=A-VB+B-VAt+ Ax(Vx B)+Bx(Vx A) (1.361) 
Vx(Vx A)=V(V+A)-V°A (1.362) 
Vigy)=Wy+wWeo (1.363) 
Vib!) =(WWo-Wy) ly" (1.364) 
V(¢") =nd" Vo (1.365) 

V> (dy) = PV" +2(VO) (Vy) +w'd (1.366) 
Ve(Vy) =e v+VOeVy (1.367) 

V? (Aer) =2V*AtreV"A (1.368) 

V? (gr) =2Vb+rv'¢ (1.369) 

V(Atr) = A+(VA)*r (1.370) 


where ¢, ware scalar functions of position x, A, B are vector functions of position 
x, and r is the position vector. There are also a number of important integral 
identities in vector calculus and some of them are 


pas =o (1.371) 
[var = fdas (1.372) 
V oV 
[wv +Ve-Vyav = ov yas (1.373) 
V oV 
[wy -w’* dav = Gov y -wd)-ds (1.374) 
V OV 
[v-4ar = [ Aas (1.375) 
V oV 
[ovx4)-ds =| 4-dr (1.376) 
S C 


Green’s first and second identities are given by (1.373) and (1.374), which will be 
discussed again in Chapter 8, whereas Gauss’s divergence theorem and the Kelvin- 
Stoke’s theorem are given in (1.375) and (1.376). They will be discussed in 
Sections 1.17 and 1.18. 


1.16 HELMHOLTZ REPRESENTATION THEOREM 


The Helmholtz representation theorem for any vector is a natural consequence of 
(1.346) and (1.347). Mathematically, it can be stated as (Chau, 2013) 
v=V¢4+VxA (1.377) 
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The first term on the right hand side of (1.377) is called the irrotational field of a 
vector, whereas the second of the right hand side of (1.377) is called the solenoidal 
field of a vector. The function ¢ is called the scalar potential of a vector, whereas 
the vector function A is called the vector potential of a vector. In fluid mechanics, 
Vx<A is called the vorticity of A, whereas V¢ is called the gradient field describing 
the irrotational fluid. Whenever a vector can be represented by the first term of 
(1.377) only, the field is called conservative (such as a gravitational or electrostatic 
field). 

By identity (1.346), the first term is evidently irrotational, and by identity 
(1.347) the second term is evidently solenoidal because 

Vx(V¢)=0, V-(Vx A)=0 (1.378) 

Rearranging (1.377), we have 


v-V¢=VxA (1.379) 
Taking the divergence of (1.379) gives 
V-(v-V¢)=V-(Vx A) =0 (1.380) 
Thus, we can express the divergence of v as 
V-v=V'¢ (1.381) 
Similarly, rearranging (1.377), we have 
v-VxA=V¢ (1.382) 
Taking the curl of (1.382) gives 
Vx(v-VxA)=Vx(V¢) =0 (1.383) 
Consequently, the following identity is obtained for the curl of vector » 
Vxv=VxVxA=V(V-A)-V7A (1.384) 


The last of (1.384) is a result of the application of identity (1.362). As discussed by 
Chau (2013), we have the freedom of setting one constraint for the four unknown 
potentials (one scalar potential and three components of a vector potential). When 
VxA decreases with distance r more rapidly than 1/r, the term V(V-A) vanishes. 
Thus, without loss of generality we have 


Vxv=-V°A (1.385) 
In conclusion, the divergence of v is expressible in a scalar potential and the curl is 
expressible in a vector potential (see (1.377) and (1.378)). 


1.17 GAUSS DIVERGENCE THEOREM 


The Gauss divergence theorem can be visualized by using the integral form of 
divergence given in (1.337). In particular, consider a finite volume V in Figure 1.22 
being subdivided into a number of smaller volumes, and its finite closed surface S is 
subdivided into a number of smaller closed surfaces. By applying (1.337) to each of 
the small closed surfaces we have 


4 I ‘ : 
pee ss Die. a7 9 (pv) +dS) “Qian ae (1.386) 
Vj V 


This is the so-called Gauss divergence theorem and it transforms the volume integral 
to the surface integral or vice versa. This equation also applies to the case where v is 
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an n-th order tensor (e.g., see Segel, 1987). The mathematical proof for the 
divergence theorem can be found in standard textbooks for engineering mathematics 
(Kreyszig, 1979), which somewhat differs from the current approach using a physical 
argument. 
In electrostatics, the electric field intensity E is related to the charge density p as 
V-E=p(r)/& (1.387) 
where & is the electric permittivity of free space. Applying the Gauss divergence 
theorem, we have 


f £-nas=—| pray = 2 (1.388) 
av “oy £0 


This is the Gauss law of electrostatics and it states that the total flux of electric field 
intensity coming out across a closed surface is proportional to the total charge O 
enclosed by the closed surface S. 


dS 


Figure 1.22 The domain for the divergence theorem 


1.18 KELVIN-STOKES THEOREM 


Let S be a smooth surface bounded by a simple closed curve C, which does not 
intersect itself as shown in Figure 1.23. Similar to the argument used in deriving the 
Gauss divergence theorem, we can subdivide a closed loop C into a number of 
smaller closed loops C; and the surface S into a number of small surfaces S;. By 
applying the integral form of the curl given in (1.338) we have 


fiv<v@]enas = Slim, <—G var) =v «dr (1.389) 
7 F i iG, c 


This is the so-called Kelvin-Stokes theorem. 
In electromagnetism, Ampere’s law relates the magnetic induction B to the 
current density J by 
VX B= LJ (1.390) 
where jo is the permeability of the free space. Applying the Kelvin-Stokes’ 
theorem, we have 


P Bear = ch JT ear = ppl (1.391) 
Cc CG 
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This states that magnetic induction is proportional to the total current passing 
through the surface enclosed by C. 

As discussed by Chau (2013), Lord Kelvin played a fundamental role in its 
development, so it is also known as the Kelvin—Stokes theorem (see biography of 


G.G. Stokes). 


Figure 1.23 The domain for the Stokes theorem 


1.19 VECTORS AND TENSORS 


Certain quantities, like temperature and pressure, are independent of direction and 
are known as scalars. Displacement, velocity, acceleration, and force are quantities 
that depend on direction, and they are called vectors. The term vector was coined 
by Hamilton in the nineteenth century. In the three-dimensional domain, a vector 
has three physical components, as shown in Figure 1.24. For example, a vector v 
can be expressed as a sum of three quantities and each one is the projected value 
along certain directions and these directions are indicated by their base vectors e; (i 
= 1,2,3). In particular, v can be expressed as 


3 
V = Vie, + V7€y +133 = Ye =Ve; (1.392) 
i=l 


¥(V1,V2,V3) 


Figure 1.24 A vector in Cartesian coordinates 
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where v; is also called the indicial notation or index notation of a vector. The 
physical component of the vector is denoted as vj; physically, it is the length of the 
vector v projected along the i-th coordinate of the system. In the last part of (1.392), 
the summation sign X is usually neglected; thus, repeated indices imply summation 
automatically. This is usually referred to as the Einstein convention for summation. 
The index i becomes a dummy index (i.e., it can be replaced arbitrarily by /, k, etc.), 
and it is no longer a free index (i.e., 7 cannot be set to 1, 2, or 3 arbitrarily). Instead 
of using physical components, vectors can be denoted by the symbolic or Gibbs 
notation (1.e., v). 

Certain quantities like stress and strain depend on two directions; first the 
direction of the surface where stress and strain are measured, and second the 
direction of the stress and strain acting on this surface. For example, stress can be 
represented by 


“> 


t 


where the indicial form is written in terms of Cartesian coordinates. Again, the 
Einstein convention is used in the last of (1.393). Or more explicitly, we have 
O = O71 Ce; + On7€7€y +.033€3€3 + 01 3€1€3 +. 03103€, +01 72€)€> 


3 3 
YD oyee; = oye1e) (1.393) 
=1 j=l 


(1.394) 
+ On €,€; +.073€7€; + 0370385 

The symbolic form o is most general and is independent of any coordinate system, 
whereas the indicial form oj is the physical component corresponding to a particular 
Cartesian coordinate system. Therefore, second order tensors can be written in 
terms of two vectors side-by-side called dyads, such as e1e3. Note that dyads are in 
general not commutative, i.e., e1e¢3 # e3e;. This dyadic form was proposed by Gibbs 
in the 1880s. Gibbs was the first USA PhD graduate in engineering in 1863 from 
Yale. There are also tensors of order higher than second. For example, a typical 
fourth order tensor is the stiffness tensor 


3.3 3 3 
C= YD Cee exes = Cy @je exe (1.395) 

i=l j=l k=l [=1 
In explicit terms, there are 81 terms (= 3x3x3x3), but we are not going to write 
them out here. The scalar can be considered as a zero-th order tensor, a vector is a 
first order tensor, stress is a second order tensor, and so on. In other words, all 
physical quantities can be considered as tensors. This is like saying that all numbers 
are complex numbers, since real numbers are just complex numbers with zero 
imaginary parts. The tensor is a very powerful concept and its analysis allows very 

tedious steps to be simplified. 

To see the power of tensor analysis, let us note that the dot product between 

two base vectors can be expressed as 


ee) = 8) | Ped (1.396) 
a 0 i¥ 7 
where i, 7 = 1, 2, 3; and 6j is called the Kronecker delta function. Therefore, if we 
write 

u =uje;tu,e,tu,e,, Vv =Ve,+,e,413e;, (1.397) 
the dot product between uw and v is clearly 
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u-v =(uje;)-(Uje;) =Uujuj,O, =Ujy, = UY +UyV, +U3V3 (1.398) 


Again, Einstein’s notation for summing over all possible i and 7 is implied. The use 
of the Kronecker delta function suggests that 
uju ;O; =u;v, =u 


a (1.399) 


If i # 7, we must have zero values. Thus, we must substitute 7 into / or j into i, 
otherwise this quantity is zero. In addition, there is no free index (index only 
appears once) on both sides of (1.399) because both 7 and j appear twice or are 
repeated (recall from the Einstein convention that a repeated index means 
summation). The rule of this simple “substitution” allows a very efficient and 
effective way to carry out a dot product using tensor analysis. 

Another important operation between tensors is the double dot product 
between two tensors. Hooke’s law will be used to demonstrate the double dot 
product, and it can be expressed as 

o=C:é (1.400) 
Using the symbolic form, we have 
ao=Ciée= (Cie? jen) : Ca a Cit Emn@e j (ex 6e iy Ye, °e,) al 401) 


= Crna Emn km ini’ ii = Cra Ep Ge J 


Note that the result is a second order tensor with a pair of base vectors for each 
component. Note that any index in a tensor equation cannot be reused, and the 
index only appears once or twice and cannot be more than two. Mathematically, 
fourth order tensor can be considered as a mapping function between two second 
order tensors. 

Equation (1.401) can also be written using the component form (instead of 
dyadic or polyadic forms) as 


O7 = Cit Ex (1.402) 


The cross product between base vectors can also be conducted using algebra of a 
symbolic operation similar to the Kronecker delta in the dot product. In particular, 
cross products between any combinations of two base vectors are: 


€,X@, =@3, €)X€; =e, €;Xe, =e, exe, =0, e,xe, =), 


(1.403) 
@ X€, =—@3, @3 X@, = —€), @, X@, =—@>, e, xe, =0 
By applying (1.403), the cross product between two vectors is 
UXV = (UU, TU,e, Tue; ) X (Ye, TV7e, +13e 
( ue, ture, tu3e; ) x (Ye, +V,e, +133 ) (1.404) 


= (Un V3 — U3Vp ey + (U3V, —UyV3 ey + (Uy V2 —UQV, Jes 
The cross product is sometimes easier to remember using the following expansion 
of determinant: 
Qe & & 
UXV=|My Uy Uz] = (U_Vz —U3Vq Je; + (U3V, — UyV3 Jy + (UV —UyV, Jez (1.405) 
My Ya V3 
Now, we attempt to generalize the cross product analysis using algebra. Consider 
the special case of (1.405) that 
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Q ep & 3 
Q Xp =|1 0 0) = ein, + ejn@ + €123€3 = > e24ee =,27€, =€3 (1.406) 
0 1 0 ft 
where ej (in general, i, 7, k = 1,2,3) is a symbol to represent the different 
components of the cross product for base vectors. This was introduced by Levi- 
Civita and thus is known as the Levi-Civita symbol or permutation symbol. Since 
the cross product of two vectors must itself be a vector with three components, 
clearly, we obtain from (1.406) 
C121 =9, 22 = 9, en3 = 1 (1.407) 
Consider another special case of (1.405) that 


Qf 8 & 3 
€3X@, =|0 0 1) = e351, + €379€o + €393€3 = > esse = €39,€, =—e, (1.408) 
0 1 O ia 
Similarly, we must have the following values for the permutation symbols as 
C39, =—l,  ey9 = 0, €y73 = 0 (1.409) 
Let us consider another example of (1.405) 
Q 8 & 3 
e€3x€3;=|0 0 1] = e338; +3328) +€33383 = Y esses = @33,€, =0 (1.410) 
0 0 1 i 
Comparison of components on (1.410) gives 
233, =0, 632 =0, e333 =0 (1.411) 


Repeating this procedure for the rest of the combination of cross product between 
base vectors, we get 
€931 =1, p37 = 0, C933 =0, 311 = 0, 312 = 1, C313 = 9, 


4111 =9, 12 =9, 13 =0, p21 = 0, ey99 = 0, ey93 = 0, (1.412) 
1, =9, p12 = 9, €y13 =—-1, 3, =9, S32 =-1, e433 =9, 


By now, we have obtained all 27 components of the permutation symbols. We 
observe that 

€193 = €031 = 2312 =1, 132 = 13 = 2321 =—1 (1.413) 
Note that only nonzero values for ej, with indices i #7 # k, whereas all other 
components with repeated indices are zeros. We can summarize the values of the 
permutation tensor in a cyclic sequence shown in Figure 1.25. The magnitude of ej 
is either —1, +1, or 0, which is determined by the following rules: (1) ej equals 0 if 
any two indices are equal; (2) ejx equals +1 when i, j, k are 1, 2, 3 or an even 
permutation of 1, 2, 3; and (3) ej equals —1 when i, j, k are 3, 2, 1, or an odd 
permutation of 1, 2, 3. As shown in Figure 1.25, if the different subscripts follow an 
even or clockwise permutation, we have the values being 1; whereas for anti- 
clockwise or odd permutation, we have the values being —1. We also find that 
moving subscripts in a cyclic order (say the first subscript is moved to the last while 
all other indices shift forward) yields the same value. However, if the positions of 
any two indices are reversed, we will have the original value being multiplied by 
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—1. This permutation tensor was proposed by Italian mathematician Levi-Civita. In 
summary, we have 


Odd 


2 


Figure 1.25 The odd and even permutations for 1, 2, and 3 in the Levi-Civita symbol 


Ci =e jki = kay © ik gi Ci (1.414) 
By now, we can rewrite the cross product of any two base vectors as 
€; XO 5 = Cy ly = Sin lK (1.415) 
This formula can now be applied to the cross product of any two vectors 
W = UXV = UE; XV jE; = UV pyjlx (1.416) 


In terms of tensor notation, the physical components w; can be given by 
W, = Cp U Ve (1.417) 


In addition, the permutation tensor also finds application to the calculation of the 
determinant in tensor form: 


det| 45 


= ejjx Ait Aj2 Ak3 (1.418) 
More amazing formulas for ejx are given Problems 1.37-1.41. 


1.20 e-6 IDENTITY 


The following e-6 identity has been found extremely useful in tensor analysis: 
CijkCirs = O jrO ks — O sO kr (1.419) 
There is more than one way to prove this formula. The straightforward but tedious 
way is to exhaust all possibilities of j, k, r, and s. Instead, here we will present a less 
obvious approach. 
First, we will establish the following identity of the delta function and 
determinant 


Omp Omq Oms 
CsaCin=\Om Ong On (1.420) 
Or Orq Ors 
Consider the determinant of A; to be given as 
An Ai Ais 
det A=|491 Az An (1.421) 


A, A32 A33 
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When we interchange any two rows or any two columns of the determinant, the sign 
of the determinant changes sign. That is, 


An A22 Az3} |Ai2 An Ai3 
Au Ai2 Ai3{=|A22 Ar A23}=—det A (1.422) 
Ax, Ax. A33}_ [432 A313 

For an arbitrary number of row changes, we can write it as 


Am Am2 Am3 
Am An2  An3|=@mnr det A (1.423) 
A, Ar. Ar 

Similarly, for an arbitrary number of column changes, we can write it as 
Alp Aig Als 
A2p Arq A2s|= pgs det A (1.424) 
A;, A3q A3s 


Thus, for an arbitrary number of row and column change sequences, we can 
combine 


Amp Amq Ams 
Anp Ang ns |= € pqs@ mnr det A (1 425) 
Arp Arg Ars 


Take the special case that A; = 6), and detd = 1; we have (1.420). Next, (1.420) can 
be expanded as 


2 pqs@ mar — 5 mp6 ng rs ~ Onis) ~ Ong On ~ OnsO mp) 


(1.426) 
HG mslO apo ~ OngO rp ) 
Now we can set s =r in (1.426) to get 
e pqs@mns — 6 mp(6 ng ss ~ Ons0 sq) a 6 ml npO ss a OnsO sp) 
FO wislOri 6 sq Ongd ) 
cia Manas (1.427) 


= 365 mp6 ng ad O mp0 nq a 36 mqO np + 6 mqO np + 6 mqO np _ Ong mp 
= (Omso ng > 5 mqO np) 
Thus, the e-d identity is established. 


1.21 TENSOR ANALYSIS IN CARTESIAN COORDINATES 


The formulas of expressing the dot product by the delta function, of expressing the 
cross product by permutation tensor, and of e-6 identity are useful tools for tensor 
analysis. We will demonstrate the validity of (1.362) as follows: 


a aA, 
Vx A) =(e€, —)x (Ae) =(e; — =e,, A, e; 1.428 
( “ ) (e; — ex) (e; xe;) ax, ik k, je: ( ) 


é 
Vx (Vx A) = (€y, =) (eu 61) = (Cn * 1) At, = Ste At, nn (1.429) 
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Now, we apply the e-6 identity 
Vx (VX A) = Cnini@i¢ Ag, in€m = (Ong nk — Omk nj At, in@m 

= (Oj Onk At, in — Omk nj Ak, jn Ym (1.430) 

= (Ay mn — Arn,nn Em 


On the right hand side, we have 


jee ae 4 =8)4524424,, 030 


n,n 


OA 
V(V-A)=e, ee = thy anil (1.432) 
Xj 


rae mem — =6 An, jk© mm = An je, i&m (1.433) 


Summing (1.433) from (1.432), we a the right hand side of (1.362) equal to 
viv ° A) -_ VA= Ay nmem — An, ie im =(A, wm ~ ee Jen =Vx (V x A) (1 434) 


The last of (1.434) is obtained by using the result of (1.430). Note also that the order 
of partial differentiations is interchangeable, that is, 
Anam = An, (1.435) 


Similarly, other vector identities given in (13 53) to (1.370) can also be proved similar 
to the above tensor analysis. 


1.22 TENSOR ANALYSIS IN CYLINDRICAL COORDINATES 


Tensor analysis in polar cylindrical coordinates is more complicated than those in 
Cartesian coordinates. The main reason is that the base vector is no longer fixed in 
the space. Any position vector r in a Cartesian coordinate system can be written in 
terms of a cylindrical coordinate system (p, ¢, z) as shown in Figure 1.26: 

r= xe; +x,e, +x3e; = pcosde, + psin ge, + ze; (1.436) 


Figure 1.26 Cylindrical coordinates 


The new set of base vectors in cylindrical coordinates is defined by 
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pees (1.437) 
ha Ga 
where 
a be (oe) Toe 
fe= || =f 22) oe 22 | | = (1.438) 
Ody Oda Oda 04a 


and ga equals p, ¢, or z. In particular, we have for cylindrical coordinates 


2 2 2 
ooo l= ff 2] 4 2] af S2) = feos? prsin2¢ =1 (1.439) 
(eye) (eye) (eye) (eye) 
2 2 2 
or Ox, OX Ox, Jp? cos? b+ p' sin? b 
hg= = + = cos’ @+ p* sin” g = 1.440 
. Fi eB Gy e a a 
2 2 2 
poe (cl cai i ica en ce (1.441) 
“ | Oz Oz Oz OZ 
Thus, substitution of (1.439) to (1.441) into (1.437) gives 
1 a ‘ 1 a : 
ep > 7 cos ge; t+singer, eg= ; 5 = —sin ge; + cos Pez 
a Ks (1.442) 
eee 
e he e3 
It is obvious that we must have the following identities: 
oe, 0 
—— = — (cos @e,+sin =—sin cos =e 1.443 
ag oe pe ger) pe per =ey ( ) 
6 = © (sin ge,+008 $02) =-(cospe\+sin der) (1.444) 
sin + COS = —(cos de; +sin =-e, . 
ab og e1 e2 | e2 f 


while all other derivatives of the base vectors vanish. For example, the displacement 
gradient tensor can be formulated in dyadic notation as 


0 0 0 
Vu=(e, ay ee +e, Bp Moen * HH ep +u,e,) 


pop 
0 0 Ou 
a e.e,t+ d (u,+ “6 jee e,e,+ Lt aT is é e,e, (1.445) 
op ep 0¢ oz poo ** 
Ou, 9 


u., : Uu 
e 
a 7? Gr ? 7 & 
In obtaining the above equation, we have already used the coordinate variation of the 
base vectors obtained in (1.443) and (1.444). For example, the derivative taken with 
respect to is 


Ou, 
Se ad Ug Je ge p 
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0 
(up€, tug ey +u,e,) = eg—— (Une, ) +eg—_ (Uy &g) +eg—_,€ 


€4 5g Ho 7 ey pop ** 
Ou, p oe oy Oug Ug e, ou, er 
= 86 Tag? tet ap pag? py ap °F? pa¢ Sm 


Ps ee P + eye Oy de eee 
This result explains why two extra terms appear in the bracket terms on the right hand 
side of (1.445), and they are the results of the direction change of base vectors with 
respect to the change in variable ¢. Clearly, Vu is a second order tensor since every 
component is accomplished by two base vectors. This result can readily be used to 
obtain the strain tensor, which is defined as 


E= 5 (Vatu) (1.447) 


where uV = (Vu)! or the transpose of Vu. Substitution of (1.445) into (1.447) yields 
the following physical components in cylindrical coordinates 


a a 
Paes pe, poe eee (1.448) 
OZ fa) Pp poo 
1 Oug 1 Ou, Ug 11 Gu, Oug 
°e6 2 Sp po p & 2% ag Bz 
(1.449) 
ed 


These equations are the same as those obtained by Timoshenko and Goodier (1982), 
starting from the kinematics of compatibility in deformations. Thus, the tensor 
equation (1.447) provides a concise and elegant form for the strain-displacement 
relation, and, more importantly, it is independent of any coordinate system. 

Similarly, the following identities in cylindrical coordinates can be obtained: 


Ou, Le 4 1 Cus  6u, 


Veu= 1.450 
6p p” pod & ( ) 
1 Ou é p 64g 1 OU, 
Vxu=e 7 e 2\+e + 1.451 
a ap ors ap? are : > od? ' ) 
, 2 10 ] 2 2 
vifavevp=2L yi Wt OF of (1.452) 
dp” pop pag Oz 
2 Ou Uu 2 Ou 
V-u=e,(V'u, ———_°-—)+e,(V"u tog, oe Vu, (1.453) 
P Pp p° od a ¢ @ ar) a 


The three components of the equilibrium ee V-o = 0, can be written explicitly 
as 


(1.454) 
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00 
18 (p¢,,,)+ = +4 Ct <9 (1.455) 
p op opp og 
Oey 804 0 
1 90H 5 Ppp | 924 4 Prd 9 (1.456) 


p o¢ 0p Oz p 
These equations are the same as those obtained by Timoshenko and Goodier (1982) 
by considering force equilibriums along p-, ¢-, and z-directions for an infinitesimally 
small element. However, the present approach is more efficient and systematic in 
obtaining the component form of the equilibrium equations in terms of stresses. 


1.23 TENSOR ANALYSIS IN SPHERICAL COORDINATES 


The development of this section follows closely the discussion in the previous 
section. Any position vector r in a Cartesian coordinate system can be written in 
terms of a polar spherical coordinate system (7, g, @) as shown in Figure 1.27: 

r= Xe, + Xe, +x3e, = Psind cos~e, + PsinO singe, + pcosOe, (1.457) 
For spherical coordinates, ga equals either p, g, or 0. Thus, 


or Ox, ° OX ; Ox, ‘ 
= + + 
(eye) (eye) 0p (eye) (1.458) 


= sin? Acos” p+ sin” 0 sin” ) +cos’ @ =1 


or Ox, : Ox, : OX; ; 
- + + 
ol) 0p 0p 0p (1.459) 


= Ve sin? O sin” pt p” sin” @ cos” yg =psind 


Or|_ |{ ox, a OX im Ox3 ; 
00 00 00 00 (1.460) 
= Ve° cos” cos” p+ p* cos’ Asin” p+ p* sin”? 6 = p 
Substitution of (1.458) to (1.460) into (1.437) results in the following base vectors 
e, = (sin @cos g)e, + (sin Osin g)e, + cos Ge, 


hp= 


hom 


€9 = (cos cos ~)e, + (cos Acos ~)e, — sin Ge, (1.461) 
€, =—SiN Ye, + cos pe, 


The variation of base vectors along coordinate directions is more complicated than 
that for cylindrical coordinates; in particular, the following nonzero terms are 
obtained: 


Oe 
a =ey, S: = -€,; o = cos Oe, , 

ig (1.462) 
Oe 


oe, 
= —sinOe,—cosGe,, —— =sinde 
P @ Op Q 
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Figure 1.27 Spherical coordinates 


while all other derivatives of the base vectors vanish. The differential operator in 
spherical polar coordinates is 
V=|e, g +g Le +€, l e (1.463) 
0p poe psin 8d 0p 
Without showing the details here, we quote the following physical components for the 
strain-displacement by specializing (1.447) to the spherical coordinates: 


Ou, Uu Ou u 
eam : OP 4 MO cota, Epp =. Ey = ae Cus (1.464) 
psind 0p p op (eye) Pp poe 
Ou Ou, Ou, Uu 
Bag I (e.g 1 Mo ee Te 0) Ab) 
2 0p po op 2 psind 0g Oop op 


eee | dug , 1 Up Ug 
2 psind 0g p oO 
Again these results are the same as those obtained by considering the kinematics of 
compatibility in deformations considered in Timoshenko and Goodier (1982). 
The following identities can be obtained in polar coordinates as 


up 2 1 Oug 1 OUg 


cot 0) (1.466) 


£60 


u= +—wu 4 + cot uy) + =e 1.467 
ap p?” 00 0) psin@ do ( ) 
e du, 0 
Vxu=e 4 (Oe 5 cot Ou, a U6) 4 09( ! Up, Uy Ug 
’p 00 sin? 0g psind 0g dp -—p 
(1.468) 
(ove ug 1 Op 
Op pp 00 
2 
V2f= re) (p? of % 1 (6) ne 1 of (1.469) 


Zs : re: 
opp’ sing 0d 08 p’sin*0 0p 
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2u, 2cotO 2 OUug 2 Uy 


Vu=e Vu u 
pl p 2” 2” 0 Fe 00 p’sind do 
Ou Ou 
ee a (1.470) 
p 08 psin’d p'sind Og 
Ou, Uy 2cotA Cus 


+e (V-u + 
es? pp’ sind 69 p’sin’?@ _p*sind Op 

In spherical coordinates, the three components of the equilibrium equations, V-o = 0, 

can be written explicitly as 


Oo 00 Oo 
po, 1 Lae | 00 | (2G |, — O99 Typ + Fy9 COLO) =0 (1.471) 


op p 00 psin@d Cp Pp 


1 
+—[30 7 +(Op, —9,,,)cot@]=0 (1.472 
ap ? a0 psind ap Fa po ( 00 oo) | ( ) 


Oo (eley 
Pe Op + 1 


ea Fe 130 426, cotéy=0 (1.473) 
0p p 00 psin@d 0p pep Opp ss , 


Readers are advised to work out the details in obtaining these equations. 


1.24 SUMMARY AND FURTHER READING 


In this chapter, we have summarized a number of important theorems and formulas 
that will be useful for our later determination of the solution of differential equations 
in engineering and mechanics. Due to the space limitation, we only sketch their proofs 
and demonstrate their applications through examples. For elementary mathematical 
analysis, we recommend the books by Hardy (1944) and Tranter (1957). For more 
discussions on classical methods and analyses, we refer readers to the advanced 
textbook by Whittaker and Watson (1927). Stories of two Nobel Prize winners should 
be mentioned here. Lars Onsager, 1968 Nobel Prize winner in chemistry, worked out 
all “difficult problems” in Whittaker and Watson (1927) when he was a teenager, and 
this laid down the mathematical skill for his latter achievements. In 1933, Onsager was 
awarded the Sterling and Gibbs Fellow at Yale University, but an embarrassing 
situation occurred, when it was discovered that he had never received a PhD. His 
colleagues advised him to try for a Yale PhD, and Onsager wrote a thesis on Solutions 
of the Mathieu Equation of Period 41 and certain related functions. Nobody in the 
chemistry and physics could understand it and it was sent to Prof. E. Hille of 
Mathematics Department, who was an expert in the area. Prof. Hille was so impressed 
that he suggested to Prof. Hill (head of Chemistry Department) that the Mathematics 
Department at Yale would be happy to recommend him for PhD. Not wishing to be 
upstaged, the Chemistry Department awarded Onsager a PhD in Chemistry. Mathieu 
equation was covered in Chapter 19 of Whittaker and Watson (1927). The second 
story is on Subrahmanyan Chandrasekhar (1983 Nobel Prize winner in physics). 
During his 1930 voyage to England, Chandrasekhar derived his celebrated 
“Chrandrasekhar limit” for white dwarf stars, and studied Whittaker and Watson 
(1927) seriously on board. The mastery of this “old” book is not easy but clearly 
would be helpful in understanding more advanced methods of mathematical analyses. 


74 Theory of Differential Equations in Engineering and Mechanics 


Spiegel (1963) summarized a lot of essential results in calculus. For the analysis of 
complex variables, Spiegel (1964) covered a lot of classical results in a concise 
manner, and it is a good reference book to start with. Other references on complex 
variables include Watson (1914), Silverman (1974), Copson (1935), and Forsyth 
(1893). For results related to Ramanujan’s master theorem, Ramanujan’s Lost 
Notebooks by Berndt (1985, 1989) are the authorities. For the discussion on tensor 
analysis, we refer to Segel (1987), Wong (1991), and Chau (2013). 


1.25 PROBLEMS 


Problem 1.1 Use the binomial theorem to find the following sum of binomial 
coefficients: 


SH14C 4 Co 4-40 49 CP 41 (1.474) 
Ans: $= 2” 


Problem 1.2 Find the following sum 
S,=1=C)+4C +...4(-1)" °C?" 4+C1)" (1.475) 
Ans: S;=0 


Problem 1.3 Use integration by parts to prove the formula 


oO F | — '! 
I Pt Qa (1.476) 
0 


grt 


where n is an integer and (2n—1)!!=1-3-5---(Qn—1). 


Problem 1.4 Prove the formula 
«2 2 ! 
} tte ax = ne (1.477) 
0 2 


where n is an integer. 


Problem 1.5 Show that the Jacobian for the change of variables given in Section 
1.4.6 is indeed r. 


Problem 1.6 Find the value of the following number: 


Vi (1.478) 


Ans: Infinite answers and the principal one is 4.810477381... 


Problem 1.7 Show that 


eee =| age (1.479) 
2 2 


Problem 1.8 Find the value of the following number: 
. (1.480) 
Ans: Infinite answers and the principal one is 0.08858... 
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Problem 1.9 Find the value of the following number: 
pt (1.481) 
Ans: Infinite answers and the principal one is 0.15787... 


Problem 1.10 Find the roots of 


i (1.482) 
Ans: 1, (-1+iV3)/2, and (-1-iv3)/2 


Problem 1.11 Check whether the following complex function is analytic: 
f(z) = F\[Re(z)]+iF5[Im(z)] (1.483) 
Ans: No 


Problem 1.12 Show the following identities: 
cosh(iy)=cosy, sinh(iv) =isin y (1.484) 


Problem 1.13 Show that: 
zo =e8 OY 2") eosin Ino + E(w + 2mz)|+isin[7n Ino + E(w + 2mz)]} (1.485) 


where 
6 =€+in, z=o(cosy+isiny) (1.486) 
and m is an integer. 


Problem 1.14 Use the result of Problem 1.13 to show that: 
x' +x =2cos(Inx)cosh[{2z(n+m)]—2isin(In x)sinh[2z(n+m)] (1.487) 


where x is real and m and » are integers. 


Problem 1.15 Show that the modulus of the result in Problem 1.14 is 
xi + | = J2{cos(2 In x) + cosh[4z(n + m)]}" (1.488) 


where x is real and m and n are integers. 


Problem 1.16 Find the real part of 
Re fiMDy = e4e+D2"/8 ogg G (4k +1) In 2} (1.489) 


where & is an integer. 


Problem 1.17 Find the value of 


2in{ 4) (1.490) 


1+i 
Ans: There are infinite solutions, and the simplest one of it is 7 
Note that this is the definition of z given by Wronski! No wonder he is interested 
in metaphysics (see biography section)! 
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Problem 1.18 Show by chain rule that if @ = o(w) and u = u(x1, x2, x3), the gradient 


of is 
d 
Voe= EN, 
du 
Problem 1.19 If A is any vector function, show that 
2 
(Ax a = Ax dA 
dt dt dt? 
Problem 1.20 Find the time derivative of 
2 
(4, 
dt dt 
3 
Ans: A ca ails 
dt dt 


Problem 1.21 Show the validity of the formula 
V-(Vy)—V (WV 9) =gV'y -W'd 
Problem 1.22 Show the validity of the formula 
1 
vx(Vxv)= Pe a 


Problem 1.23 Evaluate the divergence and curl of the position vector r 
Vxr, Vor 

Ans: 0 and 3 

Problem 1.24 Show that 


v4) =0 
Fr 


(1.491) 


(1.492) 


(1.493) 


(1.494) 


(1.495) 


(1.496) 


(1.497) 


Hint: Chau (2013) called 1/r the granddaddy of all solutions of Laplace equations. 


Problem 1.25 Find the value of 1 such that 
V2 n a 0 
Hint: More than | answer 
Problem 1.26 For spherical coordinates, prove the following identities 


é 
a0 ( €,€p) = € £0 + Coe > 


0 
39 (2080) =~ Soko +€p€g 


(1.498) 


(1.499) 


(1.500) 
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0 
39 (6080) =— “oko (1.501) 
0 : 
Bg (er) = Coen + Epp) sind (1.502) 
0 ; 
Bop (6068) = Cnt a SING + ply COSE (1.503) 
a) : . 
ap ene eee re geone) (1.504) 
6 
30! Clg) = (Cglg + Cgly) COSO (1.505) 
0 : 
Bp (2920) = Colo —Epep)SIN G— lop COS (1.506) 


Problem 1.27 For cylindrical coordinates, prove the following identity 
au ou, u au a) 2 
V(V-u) = e,( ane p_—£ ta L i re? 
app opp popod p> dp Opec 
2 a a a 
2 (1.507) 
p add p’ d¢ pdpog p’ d¢ 


au, 1 Ou, 1 Ou, Ou, 


+e, + + + 
““ézdp p Oz pdz0d az? 


Problem 1.28 For spherical coordinates, prove the following identity 
Ou, 20u, 2u 10 1 0 
V(V-n) = e,( 2 4 0 p Ug : Ug cot OUug 
dp 6p Op op pp Op08 pp’ 08 p Op 


+e, ( 


) 


_ cot ee 1 Ou, 1 Wy 
p ” psind dpe9 p*sin@ dy 
ere au, 2 Ou, 1 up cot dug Ug 


+ 
d6p p? 00 p? a6? p’ 26 p’sin’ A 
re . - (1.508) 
1 Ou,  cotO OU, 
+ 
p’sin@ 0@gp p’sind 0 
Ou Ou 2 
+e,,( : Bigg ae O"Uug 
psind 0gop p*sin@ OP p*sin@ 06g 
cotO Oug 1 Ou, 
26 a ee 2 
p sind 0p p*sin’ 0 dy 
Problem 1.29 For cylindrical coordinates, prove the following identity 
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1 Oru, | Ou, 1 eu, Ou, Ou. 
p oped p* od p? d¢? a2? ~— Opa 
2 2 2 
l Ou, Oug O'Us | OUuy . uy 1 Ou, ; 1 0 py (1.509) 
paz0p a dp pap pp’ p’ oO p dpo¢d 


au, 1 Ou, Ou, 1du, 1 Ou, 1 OU, 
+ ae 


Vx(V xu) = e,( 


+ey( 


p 
““éz0p p oz ap pop pp’ dg’ p dzd0¢ 


4 


+e 


Problem 1.30 For spherical coordinates, prove the following identity 
1 Ouy 1 Og 1 OU,z cotO dug ,_ 00t 8 


Vx(Vxu)=e 
( ) 0K 0000 p 00 ea 067 fe) op p 0 


cot@ Ou, 1 au, 1 au, 7 1 OUg 
p 00 p’sin’ @ ay’ psind dp0y_p’sind Op 
he 1 Ou, cotO Ou, 1 OUuy aug 
p’sind 0609 p’sind 09 p’sin? 6 dy" ap” (1.510) 
2 Oug :: 1 Ou, 
p Op p opoe 
a. 1 au, uy 2 Ou, iT Ou, cot @ Ou, 
**psind dp0p ap? pop p? dae pp 06 
Up 1 @'uy  cotO Guy 
"2D a” Seen ain alent Oe 
psin’O p° sind g p° sind CP 
Problem 1.31 Show the validity of the following equations 
Re[(1+a)!*#] = et 2-7/4" cos(in J2 + 2/44 2nz) (1.511) 
Im[(.+a!*!] = el ¥2-#/4-2" sindin J2 + 2/44 2nz) (1.512) 
Problem 1.32 Find the value of 
(2° +27)/2 (1.513) 
Ans: 0.769238901... 
Problem 1.33 Find the value of 
en (1.514) 


Ans: e7sinh() 


Problem 1.34 Prove the following identity 
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(1+ iz)" 
-e= ) ——— 1.515 
2, i (1.515) 
Problem 1.35 Prove the following identity 
in2 
sesb reo e:.dok Ones (1.516) 
2sin@ 


Hint: Use Euler’s formula and note the following identity: 


x2" aye =(x—y)(x2"" gee age ye +..42997"? idles | (1.517) 


Problem 1.36 Solve for z in the following equation 
cos(z) =2 (1.518) 


Ans: z=iIn(2+~3) 
Hint: cos(ix) = cosh x 


Problem 1.37 Show the following tensor identity for the permutation tensor: 


emjkenjk = 28 nn 


(1.519) 
Hint: Use the 6-e identity. 


Problem 1.38 Show the following tensor identity for the permutation tensor: 
&kmj©kinenpg = ‘ml © jpg =O ing (1.520) 


Hint: Use the 6-e identity. 


Problem 1.39 Show the following tensor identity for the permutation tensor: 
mj &kinenpm a e ipl ( 1.52 1) 


Hint: Use the 6-e identity. 


Problem 1.40 Show the following tensor identity for the permutation tensor: 
Emjk©kinEnpg&gim - 6,5 p) + 61 6; (1.522) 


Hint: Use the 6-e identity. 


Problem 1.41 Show the following vector identity: 
emjk &xinenpg©gim = oj Onl + 61 Oni (1 .523) 


Hint: Use the 6-e identity. 


Problem 1.42 Show the following vector identity: 
Vx(uxv)=u(V-v)+(v-V)u-[v(V -u) + (u-V)v] (1.524) 


Problem 1.43 Show the following vector identity: 
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Viwv ) =(v-V)u+(w-V)v+yv x (V xu) +ux(V xy) (1.525) 


Problem 1.44 Show the following infinite product for the sine function: 
2 2 2 
sinx =x(1-~,)(1 val 
ya 


es (1.526) 


2 x 3x" 
Hint: Recall from Taylor series expansion that 
305 7 
Re ae alee a a (1.527) 
3! 5S! 7! 
Can you factorize the infinite polynomials as products of their infinite roots? Note 
also that this result agrees with formula (1.316) given in Example 1.10, but in this 


problem, without using Mittag-Leffler’s expansion formula. 


Problem 1.45 Following the similar idea used in Problem 1.44, show the following 
infinite product for the cosine function: 


Ax” Ax? Ax? 


cos x = (1 re yd Jee (1.528) 
x 3° x" ea 
Hint: Recall from Taylor series expansion that 
2 14 6 
ele ye oe (1.529) 
2! 4! 6! 


Can you factorize the infinite polynomials as a product of their infinite roots? 


Problem 1.46 This problem considers the Basel problem. Following Problem 1.44, 
we can rewrite (1.526) and (1.527) as 


: 2, 2 2 
sin x x x x 
=(1 1 1 in 1.530 
ts (1.530) 
: 2 4 6 
acai (ae en ler (1.531) 
x 31° 5! 7! 


(i) | Show that the coefficient for the x* term on the right hand side of (1.530) is 


1 1 1 1 1 
(—+ + +... =- (1.532) 
mg? 3x7 a 2, n 
(ii) Compare this coefficient for the x? term on the right hand side of (1.531) and 
prove that 
1 1 1 41 1 a 
= $$ peep (1.533) 
2, nm PP 2 32 we : 


The quest for the exact sum of (1.533) was first posed by Mengoli in 1644 and 
again by Jacob Bernoulli in 1689 to the boarder mathematical community. This is 
known as the Basel problem because of its association with the Bernoulli family 
who lived in Basel, Switzerland. This result was first obtained by Euler in 1734 at 
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the age of twenty-eight. The closed form sum in the last part of (1.533) brought 
Euler instant fame in the mathematical world. This result also led to his later 
analysis of the Euler-Riemann zeta function and the well-known Riemann 
hypothesis (see Problem 4.56 in Chapter 4). 


Problem 1.47 Show the following series expansion of sine integral Si(x) defined as 
(Abramowitz and Stegun, 1964): 


; _ sin x _ oO Cie 
i J ao data! ny 


Hint: Recall (1.531). 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


CHAPTER TWO 
Introduction to Differential Equations 


2.1 INTRODUCTION 


Most of the textbooks on the history of mathematics either did not cover or only 
briefly mention the discovery of calculus, let alone the historical development of 
differential equations. In terms of the coverage of the development of differential 
equations, the three-volume series by Kline (1972) Mathematical Thought from 
Ancient to Modern Times are the best. In addition to mathematical thought, a lot of 
mathematical details were covered. The application background of differential 
equations was also covered in detail. 

The origin of differential equations naturally arises from the study of calculus, 
discovered independently by British physicist and mathematician Newton and 
German mathematician and diplomat Leibniz. The dispute on the priority of the 
discovery of calculus between Newton and Leibniz is briefly covered in the 
biography section. Our present usage of symbols and terminology owes mainly to 
Leibniz. Leibniz also discovered the method of separation of variables and the 
general theory of homogeneous type for first order ordinary differential equations 
(ODEs). Through correspondence between Leibniz and the Bernoulli brothers, 
many problems of differential equations were solved. The brachistochrone problem 
occupies a central place in the development of differential equations, and it was 
solved by Jacob and Johann Bernoulli, Newton, Leibniz, and L’H6pital. Daniel 
Bernoulli, son of Johann Bernoulli, derived partial differential equations for fluid 
mechanics, whereas Leonhard Euler, a student of Johann Bernoulli, derived and 
applied differential equations for mechanics analysis. The method of integrating 
factors for ODEs was also discovered by Euler, and he was responsible for the first 
systematic solution technique for solving differential equations. Lagrange, an 
Italian-born French mathematician, contributed significantly to the development of 
theory for the solutions of differential equations, particular for the particular 
solution for nonhomogeneous ODEs and first order partial differential equations 
(PDEs). Laplace used differential equations to model the motions of celestial 
bodies, and the Laplace equation is one of the most important second order PDEs. 
Considering the vibrations of strings, D’Alembert considered the first solution of 
wave equations, another important second order PDE. Many others made important 
contributions to differential equations, including Riccati, Clairaut, Fourier, Airy, 
Jacobi, Poisson, Bessel, Legendre, Hermite, Hankel, Monge, Charpit, Cauchy, 
Pfaff, Gauss, Goursat, Ampere, Green, Helmholtz, Riemann, Tricomi, Courant, 
Robin, Dirichlet, Fredholm, Fuchs, Liouville, Sturm, Painlevé, and many others. 
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2.2. TOTAL AND PARTIAL DERIVATIVES 


If a function u(x) depends on only one variable, the change of the function with 
respect to the change on the variable is denoted as 


du 
ded 2.1 
z (2.1) 


This notation was proposed by Leibniz and “d” for differentiation or derivative. 
However, when a function depends on more than one variables, say u(x1, x2, ..., Xn), 
as discussed in Chapter 1, the change in function u with respect to the change of 
only one variable is called partial differentiation. Our current adopted notation for 
partial differentiation was proposed by Legendre as the “partial d” or “oO” (Cajori, 
1993) 

OU(X1,Xq5-+45X),) 


Ox; 


L 


(2.2) 
where 7 = 1,2,...,n and x1, ..., X» are called independent variables. Since “0” also 
symbolizes a rounded d, it is sometimes pronounced as “round d” or simply 
“round.” 


2.3 ORDER OF DIFFERENTIAL EQUATIONS 


The unknown function wu is also called the dependent variable. In general, a 
differential equation for an unknown u can be written symbolically as: 


Ou Ou O"u 
F555 o> a Saye )=0 (2.3) 
Ox, ox, Ox," Ox," +++ Ax,” 
where mj, m2, ..., Mn are integers. This is clearly a partial differential equation or 


PDE. The order of the differential equation is determined by the highest derivative 
term. For example, the order of the equation given in (2.3) is 


m=m,+m,+'+m, (2.4) 
For the case of single variable x, (2.3) can be simplified to: 
du d™u 
F (x,u,—,"", =0 22) 
i( dx dx” ) 2») 


Clearly, the order of the differential equation is m. 


2.4 NONLINEAR VERSUS LINEAR 


When a differential equation is linear, all terms of the unknown or its derivatives in 
the differential equations can only appear linearly. For example, the following is a 
linear ODE of u 
n n-l 
BM 5G" 4 ag Ce 7G) (2.6) 
ne” dx" 
This ODE is linear no matter how nonlinear the functions aj,..., a, depend on the 
variable x. Whether the differential equation is linear or nonlinear, it all depends 
on the nonlinearity of uw and has nothing to do with the coefficient functions d)...., 
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dn, Which are not functions of the unknown u. All terms of wu or its derivatives 
appear in (2.6) in their first order or first degree. In general, if a differential 
equation is not linear, it is nonlinear. However, in more advanced books dealing 
with nonlinear differential equations, there are two kinds of nonlinearity: (i) quasi- 
linear differential equations, in which nonlinearity of u appears in the coefficient 
functions in q,..., dy in (2.6); and (ii) nonlinear equations in which nonlinearity 
occurs in the highest order derivative term (Kevorkian and Cole, 1981). Clearly, 
the first kind of nonlinearity is called quasi-linear because its solution technique is 
simpler than those for the second type of nonlinearity. 

For example, the following second order partial differential equation is quasi- 
linear because there is no nonlinearity in its highest derivative (second order): 


ou Cu. Ou ou Gu. Ou 
A ’ 294577 9 +A u,——,>— 
OMe a, age Aa Oral sa ae ae, 
au ‘ (2.7) 
Pie ea 
ax, "Ot, Oe ax,” Ax, 


For first order partial differential ecintionn the following form is quasi-linear 
Aa %,0) E+ Ay (a,,%,,0) 2 = Bx, x,,0) (2.8) 
OX, Ox, 


Most of the established methods of solutions for differential equations are for linear 
ODEs or PDEs. Indeed, most of the differential equations to be discussed in this 
book are linear. Although some methods for solving nonlinear differential 
equations approximately have been developed (such as the perturbation method to 
be discussed in Chapter 12), in general, there are no well-accepted methods to deal 
with nonlinear differential equations. Quite often we have to rely on numerical 
methods to get approximate solutions for nonlinear differential equations. Exact 
solutions for nonlinear ODEs or PDEs exist only for certain special forms of simple 
nonlinear equations. Certain types of nonlinear ODEs can be transformed to linear 
ODEs and are solvable in known functions. The most notable examples are the first 
order Riccati equation and the Bernoulli equation. In considering a problem posed 
by Picard for second order nonlinear ODEs, Painlevé in 1900 identified six 
equation types that can be transformed to linear ODEs and can be solved 
analytically. Their solutions are called Painlevé transcendents and these equations 
contain only movable singularities as poles (or the so-called Painlevé property) (see 
Section 4.14). It was discovered that Painlevé equations arise as reductions of 
soliton equations, which are solvable by the inverse scattering technique (IST) 
(Ablowitz and Clarkson, 1991). See more details in Chapter 4. Painlevé was a 
student of Flex Klein, and served as French Prime Minister twice (see biography 
section). Therefore, our ability of classifying whether a differential equation is 
linear or nonlinear is crucial before we even try to look for a solution. 

One major problem in solving nonlinear PDEs or ODEs is that nonlinear 
differential equations may have more than one solution at some points of the 
independent variables. That is, there is no guarantee of having a unique solution at 
a certain domain of the problem. Typical examples include buckling of a long bar 
under compression (mathematically it was called Euler’s buckling because of his 
contribution to this problem), necking of a long steel bar under tension, and 
barrelling of a short soil specimen under confined compression. All these physical 


86 Theory of Differential Equations in Engineering and Mechanics 


phenomena of a uniquely and homogeneously deforming material yielding 
suddenly to a non-uniformly deforming pattern can be modeled mathematically as 
bifurcation problems of nonlinear differential equations. If the specimen continues 
to deform uniformly, the solution is called the equilibrium state. If the specimen 
deforms into a non-uniform state, the solution is called a bifurcated state or a 
bifurcation (since a unique solution bifurcates into two or more solutions at that 
point). When such non-unique solutions exist, the point in the domain is called a 
bifurcation point. If there is more than one solution in the solution domain, there is 
a related issue of which of these bifurcated solutions or the equilibrium state is 
stable (or it would appear in nature). The issue of dynamic stability needs to be 
considered mathematically (see the chapter on nonlinear buckling in volume two of 
this book series). This problem of bifurcation can occur repeatedly, that is, the 
bifurcated state can further bifurcate into a secondary bifurcation, and then to a 
tertiary bifurcation, and so on. When repeated bifurcation occurs, it is called a 
cascading bifurcation. It is believed that the onset of such complicated cascading 
bifurcations will lead to a “nearly” unpredictable solution or chaotic behavior of 
the solutions (or simply called chaos). In 1975, a Los Alamos scientist, Mitchell 
Feigenbaum, when solving a differential equation using the difference equation, 
discovered that period doubling of a nonlinear iterative solution has a universal 
pattern for a large class of nonlinear equations when chaotic solutions occur (the 
so-called Feigenbaum number of 4.6692 for predicting the occurrence of period 
doubling). This discovery leads to the new hope and belief that the so-called chaos 
is actually predictable, as opposed to the original definition of chaos. This view 
was popularized by James Gleick (1988) by his book Chaos: Making of a New 
Science. Unfortunately, not much progress has been made so far. 

In the present book, we mainly deal with linear differential equations. But 
some simple nonlinear differential equations will be covered whenever exact 
solutions for them can be found. 


2.5 PDE VERSUS ODE 


The main difference between ordinary differential equations (ODEs) and partial 
differential equations (PDEs) lies on the number of independent variables. If an 
unknown u depends on only one variable, the resulting differential equation is an 
ODE, otherwise, it is a PDE. The method of solution for ODEs is more well 
established (which will be covered in the next few chapters) whereas PDEs are 
often converted to ODEs using the method of separation of variables. For example, 
we can use separation of variables to convert the following PDE called the Laplace 
equation into an ODE: 


2 2. 
C4 {4 26 (2.9) 
ox” oy 
In particular, we assume 
u(x, y) = X(x)¥(y) (2.10) 


Substitution of (2.10) into (2.9) yields 
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a (XY) n O (XY) _ d’X(x) 
Ox? 6y? ae 
Note that the round 0 is replaced by d since X is only a os of x, whereas Y is 
only a function of y. Rearranging (2.11) gives 
1 dd’ X(x) 1 d°Y(y) _ 
X(x) dx? YQ) @& 
where X and Y are clearly nonzero (otherwise, there is no solution for u). Since the 
first term is only a function x whereas the second term is only a function of y, both 
of these terms must be constant. Thus, 2 must also be a constant. Whether / is 
positive or negative, it depends on the boundary condition. Later discussions in 
Chapters 7 and 9 on separation of variables will further tackle this issue. 
Nevertheless, (2.12) actually gives both the governing equations (ODEs) for 
functions X and Y: 


¥(y)+X(x)2 fa =0 (2.11) 


(2.12) 


2 
oe (2.13) 
a (2.14) 


From our discussion in Sections 1.11 and 1.12, one should immediately realize that 
the general solutions for (2.13) and (2.14) are respectively 


X =C,sinh Jax+C, cosh Vax (2.15) 

Y =C,sinVAy+C, cosVAy (2.16) 

where the unknown constants need to be determined by boundary conditions. For 

certain given boundary conditions, / need to prescribe as negative (or A = —|A1), 
and thus the general solutions for X and Y will be reversed: 

Y =C,sinh Aly +C, cosh ffaly (2.17) 

X =C,sin,/|A|x+ C, cos ,/|A|x (2.18) 


For this particular example, we see that solving one PDE (i.e., (2.9)) is equivalent 
to solving two ODEs (1e., (2.15) and (2.16)) if separation of variables is 
applicable. Unfortunately, as we will demonstrate in a later chapter, separation of 
variables does not work for all PDEs. 


2.6 NONHOMOGENEOUS VERSUS HOMOGENEOUS 


Recall from equation (2.6) that there is a term f(x) on the right hand side which is 
independent of the unknown function u. Whenever such term exists, the differential 
equation is called nonhomogeneous and f(x) is called the nonhomogeneous term. 
For example, we can rewrite (2.6) as 
d"u d”u 
he + a, (x) Ae 
Clearly, this is the same as (2.6) if we realize that f = -g. Thus, the 
nonhomogeneous term can be on the left or on the right; it will not change the fact 


t++s+a,(x)u+ g(x) =0 (2.19) 
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that it is nonhomogeneous. As will be seen in the next chapter, the general solution 
of nonhomogeneous differential equations must be the sum of the solution of the 
corresponding homogeneous equation (i.e., by setting fin (2.6) or g in (2.19) to 
zero) and the particular solution satisfying the nonhomogeneous equation. 
Mathematically, we have 

u(x) =u,(x)+u, (x) (2.20) 
In other words, if the differential equation is nonhomogeneous, we have to solve 
the problem twice. Therefore, it is important to classify the type of differential 
equation before trying to solve for its solution. 


Example 2.1 Classify the following differential equations 

wy 2x+1 (2.21) 

dx 
Solution: This is a linear first order nonhomogeneous ODE. The term 2x+1 makes 
it nonhomogeneous. The highest derivative term is first order. There is only one 
variable, so it is an ordinary differential equation. Note that this first order ODE can 
be integrated directly as 

y= [QxtIde+C=x? +x4+C (2.22) 


This is called a separable ODE and will be further discussed in Section 3.2.1. 


Example 2.2 Classify the following differential equation 
(y—x’)dy —S5ydx =0 (2.23) 
Solution: First of all, we can divide the whole equation by dx to get 


Co ee (2.24) 
dx 


The coefficient for dy/dx is a function of y, thus it is a nonlinear differential 
equations, or more precisely, quasi-linear. It is a homogeneous first order ordinary 
differential equation because there is no term in the equation that is independent of 
y. 


Example 2.3 Classify the following differential equation 


dy _, (dy) 
We +t (2) +y=0 (2.25) 
Solution: There are both first and second derivative terms, but the highest order 
derivative term always controls. Thus, it is of second order. The differential 
equation is nonlinear because of the nonlinearly in the second term, but is 
homogeneous as all terms are functions of the unknown y. There is only one 
variable and “d” is used instead of “Q” so it is an ODE. 
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Example 2.4 Classify the following differential equation 
4 2 

Oe RO peels (2.26) 
dt* dt’ 
Solution: Again the highest derivative term controls, and thus it is a fourth order 
ODE, not second order. In contrast to the last example, here x is the unknown, not 
the independent variable (¢ is the independent variable). The term sinf¢ on the right is 
independent of the unknown x and thus is the nonhomogeneous term. Thus, it is a 
nonhomogeneous ODE. All x and its derivative terms appear linearly, and thus it is 
a linear ODE. 


Example 2.5 Classify the following differential equation 
2 2 

ae ey uz =0 (2.27) 

ox” Oy” 
Solution: The highest derivative is of second order. This is clearly a PDE as wu is a 
function of x, y, and z. All terms involving wu and its derivatives are linear, and thus 
it is a linear PDE. The term x+y is independent of the unknown wu and thus it is a 
nonhomogeneous term. Therefore, (2.27) is a linear nonhomogeneous PDE of 
second order. 


2.7 SOME SIGNIFCIANT NONLINEAR PDES 


In reality, nearly all real systems are nonlinear in nature. It just happens that most 
of these physically real systems are well behaved and can be modeled adequately 
by a linear model most of the time, except at certain given parameters that satisfy 
the bifurcation condition. Many significant physical phenomena cannot be 
explained by a linear model. In this section, we will list some of them. 

The pendulum equation is one of the first nonlinear differential equations that 
attracted the attention of applied mathematicians. The horizontal force equilibrium 


for the mass m shown in Figure 2.1 is 
2 


m< + F, sind =0 (2.28) 
2 
where @ is the angular rotation and is measured in radians. For small-amplitude 
oscillations, the vertical acceleration of the mass can be neglected. For such case, 
the tension in the spring can be related to the weight of the pendulum 
F, cos@ =mg (2.29) 

where g is the gravitational constant (9.81 m/s”). For a small value of @ we can 
further assume the following approximations: 

x=Lsn@xL0, cosO~1, F, »mg (2.30) 
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where L is the length of the pendulum shown in Figure 2.1. Substitution of (2.29) 
and (2.30) into (2.28) yields the following differential equation for the angular 
rotation of the pendulum 
do 
dt? 
If the amplitude of pendulum motion is small, we can approximate sin@ ~ @. As 
expected, the linearized equation becomes 
do 
dt” 
where @ = g/L. This is the governing equation for sine and cosine as we discussed 
in Section 1.11, and thus we have 
0=C, sinat+C, cos at (2.33) 


Physically, @ is the natural circular frequency of the swinging motion. It is also 
clear that there is no damping built into the model given in (2.28). In reality, there 
is always air resistance or internal friction in the pendulum against the swinging 
motion. Nevertheless, neglecting air resistance and assuming small amplitude of 
pendulum motion, we have the natural period of oscillations as 


ran [2 (2.34) 
z 


There are at least two major significances associated with this simple result of 
pendulum motion. In the eighteenth century, Newton used the observed changes in 
the period of oscillation of a pendulum at various places on the earth’s surface and 
deduced that the gravitational constant is not a constant. He further deduced that 
the earth bulges at the equator. This period of oscillation is an intrinsic property of 
the earth. For example, the period of oscillations of the same pendulum on Mars 
would be different because of the difference in the gravitation field on Mars. 

On the other hand, the strength of gravity on any planet clearly relates to the 
total mass of the planet which, in turn, depends on the size of the planet. Because 
of this, it has been proposed that the “standard length” on earth should be related to 
the size or gravity of the earth. It was proposed that the standard length meter can 
be defined as the length ZL of a pendulum that gives a fundamental period of 2 
seconds. Thus, rearranging, 


+Fsind=0 (2.31) 


+@°0=0 (2.32) 


dx 
Per eae 
dt’ 


mg 


Figure 2.1 Pendulum problem 
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L= Pee, = 9.821) = 0.995075344m (2.35) 
2a 2a 


However, the currently adopted definition of a meter is taken as one ten-millionth 
of the distance between the equator and the pole. 


(4. Pe __* __ 20 ppeeenouin 36) 
4 "10000000 4 10000000 


where R is the radius of the Earth. 


Example 2.6 Solve the nonlinear pendulum equation given in (2.28) with the 
boundary condition that the maximum angular rotation equals @ when dO@/dt = 0. 


Solution: The first and also the most crucial step is to recognize the first integral 
(or integrate the equation once) to get 


1 dé, g 
0=C 2.37 
2a a al 


where C is an arbitrary constant. If we differentiate (2.37) once, we have 
d0.d°0 ¢. dO. dOdO0 ¢. 
+2sin@ = + 2sin 0) =0 2.38 
Ce L ae di L ) oe 
Since d@dt cannot be zero (otherwise there will be no oscillation), we must have 


2 
T+ Esind =0 (2.39) 


This is precisely (2.28). Next, we can impose the boundary condition to (2.37) to 
find C as 
C= ~F cos @ (2.40) 


Substitution of (2.40) into (2.37) and rearranging the resulting equation gives 
do 2g {cosd cosa) 
— =,/— J(cos0—cos@ 2.41 
HNL ( ) (2.41) 


Clearly, all functions of @can be put on one side to yield (this is so-called separable 
and will be discussed in the next chapter) 


tie oe (2.42) 
2g ,/(cos @—cos @) 
The following change of variables is then assumed 
cos@ =1-2k? sin? ¢, k= sin 5 (2.43) 
Recall the double angle formula for cosine; we have 
1-cos@ =2sin? ) (2.44) 


Using this formula, we have 
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cos @— cos @ = 2k’ cos’ ¢, 
sin 0 = 2k sin g/1—k° sin’ ¢, (2.45) 


sin 0d0 = 4k’ sind cosd¢d¢ 
Substitution of (2.45) into (2.42) leads to 


: fF dg 
& Jl-k’ sin’ ¢ 
For @= 0, (2.43) shows that cos @=1 and thus ¢= 0. If the angle rotation increases 
to &, po can be evaluated from 


(2.46) 


1-cos@, _ sin’(0, /2) 


sin’ ¢, = ap 2 (2.47) 
Therefore, we have 
in(O, / 2 
¢, = sin (2.48) 
Integrating (2.46) from 0 to ¢p, we have 
L ph do L 
= = |—F(q,k) (2.49) 
fz Ji-esin'g Ve” 


where F(¢,k) is the elliptic integral of the first kind with k < 1 (Abramowitz and 
Stegun, 1964). This integral cannot be evaluated in terms of any known functions, 
and thus, a numerical table of this integral has been evaluated. The elliptic integral 
appears naturally in many problems in engineering and science, including the 
elliptic motions of celestial bodies. This function was studied extensively by A. 
Legendre. To find the period of pendulum oscillations, we note that when 0 = a, 
the angular velocity is d@/dt = 0 and thus ¢ = 7/2. Consequently, the period T 
becomes 


raale [ ” “Flaw k)=4 a =K(k) (2.50) 
yi-K sin? 6 = sin? & 
where K(k) is the complete elliptic eee of the first kind (Abramowitz and 
Stegun, 1964). 
For the general solution of rotation at any time, we have 


Leo dy 
= (2.51) 
(Es yl—k? sin’ y 


Finally, the angular rotation can be evaluated from (2.51) by adopting Jacobi’s 
elliptic functions, which was proposed independently by C.G.J. Jacobi and N.H. 
Abel in 1827. In particular, we can write 


oS I’ dy 
° Jl-? sin’ y 
We can solve for ¢ from (2.52) and denote its solution symbolically as 
sn(u,k) =sing, cn(u,k)=cos@ (2.53) 
where sn is called the Jacobi elliptic sine and cn is called the Jacobi elliptic cosine 
(Abramowitz and Stegun, 1964). In a sense, you can consider the Jacobi elliptic 


(2.52) 
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functions are the solution of the integral (2.52) with given u and k. Elliptic integral 
and Jacobi’s elliptic functions are among the very first functions specially defined 
to solve nonlinear differential equations. They are considered as “special functions” 
(versus the standard functions of circular and hyperbolic functions) and the essential 
results for studying nonlinear differential equations. Applying (2.52) and (2.53) to 
(2.51), we obtain 


: g,, 1.0 
sing so] rae (2.54) 


The last of (2.54) is a consequence of using the first equation of (2.43). Finally, we 
have the exact solution for the rotation 0 as 


O(t) =2sin™ c suf | k= sin(5) (2.55) 


This is one of the very first nonlinear differential equation for which an exact 
solution can be found. But, in general, this is not possible for most nonlinear 
differential equations. 

More mathematical properties of the Jacobi elliptic function can be found in 
Appendix D. 


Another nonlinear differential equation used to model the oscillations of current in 
an electric circuit connected to a tunnel diode with nonlinear damping is called the 
van der Pol equation, which was named after Dutch electrical engineer Balthasar 
van der Pol (1889-1959). Mathematically, the van der Pol equation reads as 


2 
u(l ye +y=0 (2.56) 


dy 
dt? 
Although the nonlinear damping term is only of second order and looks innocent, 
but it leads to chaotic behavior of the oscillations. 
In gas dynamics and traffic flow, a one-dimensional model called Burgers’ 
equation is found useful 


2) 
—+u— =v— 2.57 
Ot Ox Ox? ca, 
This equation was studied by Burgers in 1948. We will see later that this is just a 
special case of the Navier-Stokes equation for fluid flows (see Section 2.8.2). 


Example 2.7 Show that Burgers equation can be transformed into a linear diffusion 
equation of a function ¢ by the Cole-Hopf transformation defined by 


y 
=y =-2v= 2.58 
u=Y, v F (2.58) 


Solution: The first and the most crucial step is to rewrite (2.57) into a form of 
conservation law: 


u, Gu -vu,), =0 (2.59) 


Independently, Hopf in 1950 and Cole in 1951 introduced the following system 
u=Y, (2.60) 


94 Theory of Differential Equations in Engineering and Mechanics 


y, = vu, 50 (2.61) 


It is straightforward to show that (2.60) and (2.61) is equivalent to (2.59). Next, 
they introduced the following function o 
y=-2vlng (2.62) 


Differentiation of (2.62) with respect to both x and ¢ gives respectively 


u=Yy, =-2v db. » W,=-2v d, (2.63) 
ob op 
Differentiating u once with respect to x gives 
le =e = DV {* é } (2.64) 
op ¢ 


Substitution of the first equation of (2.63) and (2.64) into (2.61) gives 


_ 1 an 2 px p..2 1 2/%.2 
ae | & &y | ae 


(2.65) 
=-2y’ Bex =-—2v ¢, 
op 


Recognizing the second of (2.63), we get the last of (2.65). Thus, the last two of 
(2.65) can be reduced to 


Vo.. = 9, (2.66) 


This is the linear diffusion equation, which will be discussed later in Chapter 9. This 
leads to a much deeper question: What kind of nonlinear PDE can be transformed into 
a linear PDE? As discussed in Section 2.4, Painlevé equations provide a partial 
answer for the case of the second order ODEs (see Section 4.14). 


Another important nonlinear PDE is the KdV equation: 
Gu Ou. eu 
ss ye 
ot ack Ox 

Korteweg and his PhD student de Vries studied the solitary wave observed in 

Scotland by civil engineer, John Scott Russell (1808-1882), using this nonlinear 

equation. The two terms on the right hand side of (2.67) are competing terms; the 

dispersion term tries to diffuse out the wave amplitude while the nonlinear 
convective term tries to build amplitude nonlinearly. When there is a delicate 
balance between these two terms, the amplitude of such a wave will not decay with 
propagation and it is called a soliton. Scientists believe that the “giant red” eye on 

Jupiter is a soliton, rogue or freak wave observed in oceans is a soliton, and a cloud 

system called “morning glory” in Australia is also a soliton. A soliton is not a 

phenomenon restricted to waves in fluids, and it has been observed in many 

different disciplines of science and engineering. For example, Alan Hodgkin won 
the Nobel Prize in medicine in 1963 when he found that the nerve pulse in squid 
behaves as a soliton. This equation can be solved by a new analytic technique for 

nonlinear differential equations called the inverse scattering technique (IST). In a 

sense, the IST is similar to the Fourier transform technique for linear differential 


(2.67) 
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equations, and is a major advance in solving soliton problems (Ablowitz and 
Clarkson, 1991). It is also related to Painlevé equations. 
Another differential equation closely related to KdV is the Boussinesq 
equation: 
Ou Ou Ou ,d(u’) 
— -— - +3 
or ae pe Ox? 
It was named after French physicist and mathematician J. Boussinesq (1842-1929). 
The first two terms form a “typical” 1-D wave equation, the third term is the fourth 
order dispersive term, and the last term is the nonlinear term. In a sense, this 
closely resembles the KdV equation, with the third and fourth terms competing one 
against another. Again, soliton type solutions can be obtained for the Boussinesq 
equation. However, it should be noted that there are many different versions of the 
so-called Boussinesq equation in the literature, although somewhat similar to 
(2.68). 
The Duffing equation is a nonlinear differential equation that models 
nonlinear oscillations of a pendulum subjected to friction and driven by a period 
force: 


=0 (2.68) 


2 
SP ytey =0 (2.69) 
George Duffing (1861-1944) published a small book on this equation in 1918 and 
it has been investigated by many others since then. The period of oscillations not 
only depends on ¢, but also on the amplitude of the motion. 

The sine-Gordon equation was studied by Walter Gordon (1893-1939) and is 
another equation closely related to the soliton: 

Ou 
ot? 
This equation can again be solved by using the IST. 

A nonlinear differential equation called the Thomas-Fermi equation, named 
after British physicist L.H. Thomas and 1938 Nobel Prize recipient E. Fermi, was 
formulated in modeling the distribution of electrons in a heavy atom. This theory 
was proposed independently by Thomas and Fermi in 1927. The Thomas-Fermi 
reads (Davis, 1962) 


Ou, . 
+ oy tsinu =0 (2.70) 


a oe y (2.71) 
This equation is of fundamental importance in quantum mechanics, and has been 
studied and improved by many famous physicists. The unknown y is proportional to 
the potential energy of an atom (the sum of electrostatic and chemical potentials) or 
equivalently proportional to 2/3 of the power of the electron density, and the variable 
x is a normalized distance from the center of an atom. This equation is for a neutral 
atom with spherical symmetry in electron density. 


Example 2.7 Give an approximate solution for the Thomas-Fermi equation with the 
following boundary conditions: 
y(0)=1, y(x)> 0, as x0 (2.72) 
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Solution: The following approximate solution was proposed by A. Sommerfeld in 
1932 (see Davis, 1962). We will mention Sommerfeld again in later chapters and 
his legendary story of being nominated 81 times for the Nobel Prize (see Appendix 
B). First, the following change of variables for (2.71) is applied 


1 w 
x = ’ = 2.73 
ae (2.73) 
Thus, we have 
dx = -sat (2.74) 


dy _ldw wdt _ ,aw 


dx tdx ¢ dk dt 

The last of (2.75) is a result of the substitution of (2.74). Differentiating (2.75) one 
more time with respect to x, we get 

d’y dt dw dew, dw 

dx? dx dx dxdt dx 


+w (2.75) 


(2.76) 
2 dw 3 d’w 2 dw 3 d’w 
= —+P — =P — 
dt dt dt dt" 
The right hand side of (2.71) becomes 
3/2 
pry? - At _ 1 an (2:77) 
t t 
Equating (2.76) and (2.77), we obtain a governing equation for w 
d°w 
: =w? 2.78 
dt? oa 
The boundary conditions given by (2.72) become 
w~t, ast—>o, w(0)=0 (2.79) 


The degree of nonlinearity of (2.78) is actually similar to that of (2.71). First of all, 
Thomas in 1927 recognized that the following is a particular solution of (2.78) 


w, = 14424 (2.80) 
Differentiating (2.80) twice leads to 


d’w 
= 17287 (2.81) 


Ee 


On the other hand, substitution of (2.80) into the right hand side of (2.78) results in 

wr? = (144y? AO 21277? = 17280 (2.82) 
Evidently, (2.78) is satisfied by (2.80). In addition, the second boundary condition 
is satisfied. However, the first boundary condition remains to be satisfied. We must 
modify the solution given in (2.80). We assume the following form 


w=w(l+at’) (2.83) 
Substitution of (2.83) into the left hand side of (2.78) yields 
d’w d>w dw, 
*—— =f —+(l+at*)+2— an” AA 10?” 
Ce eg Oe (2.84) 


=17281°(1+ at*)+1152aAt*** +144aA(A—1)t**® 
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This can be further simplified to 
2 
«dW _ 1708 {1 tat? + : at + . ana ov’ 


t = 
1? 


=17289° {1 +at’ [1+ oA + SAA -»)| (2.85) 


= 1728¢° {1 + 526 +A)(4+ a} 


Substitution of (2.83) into the right hand side of (2.78) leads to 
we? = wr? (l ‘oy = (144)*” peo (ear y* 


2.86 
= 17287° {ieSar +e ait --} er) 


The last expansion of (2.86) is a result of Binomial series expansion (see Section 
1.2 of Chapter 1). Clearly, (2.85) cannot be set equal to (2.86). However, when t > 
0, higher order terms of ¢ on the right hand side of (2.86) can be neglected. Thus, 
we can get an approximation as 


1 3 
—(3+A)(44+A) == 2.87 
ae (3+A)(4+4) 5 (2.87) 
Or equivalently, we have 
A? +7A-6=0 (2.88) 
The two roots of this quadratic equation are 
4 SERIE =0:772, 24,= see 7979 (2.89) 


We obtain an approximation of (2.78) if these values of A are chosen. But yet we 
have not satisfied the first boundary condition given in (2.79). Note also that we 
have not arrived at any value for a. We further extend the approximation to a more 
general form (compare (2.83)) 
w=w(l+ Bt") (2.90) 

We seek to choose B and n properly such that the first boundary condition can be 
satisfied. Substitution of (2.80) into (2.90) into gives 

w=w(1+ Bt*)” =[1440 (1+ Bt7)" It (2.91) 
For t > ©, we need to satisfy the first equation of (2.79) so that it results in 


n n n 1 n 
14427 (1+ Bt*)"= 1448 B"t* re ~1 (2.92) 


For large ¢, we can set 
34+nA=0, 14468" =1 (2.93) 
The first equation of (2.93) will ensure that (2.92) is proportional to 7° (ie., 
independent of f), and the second of (2.93) will ensure the magnitude of (2.92) is 
one as t > 00. Recall from (2.88) that 
J? + 74-6 =(A-A) A= Ay) =A? (Ay $y) + Ady (2.94) 
Thus, we must have 
AA, =-6 (2.95) 
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We can set 


3 A 1 
A = EJ = = — J = 
A A 2 B 144" 


where A; and Az have been obtained from (2.89). Substitution of (2.80) and (2.96) 
into (2.90) results in 


(2.96) 


1 


w= 1440714 mae pe (2.97) 

We can now back substitute (2.97) into (2.73) 

144 1 lia van 144 144.4 3,2,/2 
= 14 ya = 14 ag ter 2.98 
yoy a? = Sa) (2.98) 
This can be written as 
Ay /3y Ag /2 144 

Y=VOUMYCOr FY, WiC) (2.99) 


As a double check, we see that as x > 0 we have y; > « and thus 
V=YQDAF Ly C7}? = yCOLY CO = y, COLA. =1 (2.100) 
Therefore, the first boundary condition in (2.72) is satisfied. 


The solution given in Example 2.7 is found in reasonably good agreement 
with numerical integration of (2.71). We can see from this example that one needs 
extraordinary mathematical skill and insight to find an approximation to the 
nonlinear differential equation. Unlike the pendulum problem, no exact solution is 
found. In addition, such ingenious solution is only an approximation, and the 
solution procedure is not straightforward and cannot be generalized easily to other 
nonlinear differential equations. Because of this difficulty, the method of solution 
for nonlinear equations is less developed. 


2.8 SYSTEMS OF DIFFERENTIAL EQUATIONS 


In this section, we will report a number of systems of differential equations that we 
commonly encounter in engineering and mechanics. They include Maxwell’s 
equations, Navier-Stokes equation, and equations of motion of waves in solids. 


2.8.1 Maxwell Equations 


For electrodynamics, the magnetic field vector B and electric field vector E are 
coupled through the set of Maxwell equations: 
vV-E=£, V-B=0, 
. ae oe (2.101) 
Vx BF =-—, VxB=u(Jt+e—) 
Ot Ot 


where ¢ is the permittivity of the material, 1 is the permeability of the material, p is 
the charge density, and J is the electric current. The first, second, third, and fourth 
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equations of (2.101) are the Gauss law for electricity (relation between electric 
field and electric charge), the Gauss law for magnetism (non-existence of magnetic 
charge), the Faraday law (electric field induced by changing magnetic flux) and the 
Ampere law (magnetic field induced from electric current). This set of Maxwell 
equations unified all magnetic and electric effect in terms of electrodynamics. Note 
also that the Maxwell equations can be expressed in integral forms. However, we 
will not pursue such possibility in the present book, as our focus here is on the 
study of differential equations. 

For the special case of a vacuum with no electric charge (9 = 0) and no 
electric current (J = 0), electromagnetic waves can be specialized as: 

V-£=0, V-B=0, 


aB 1 0B (2.102) 


VeE=——.. Vane 
Ot 2 Ot 


where 


ae (2.103) 


Hoéo 
Here c is the speed of light in a vacuum and jo and & are the permeability and 
permittivity in a vacuum. 
Taking the curl of the third of (2.102), we have 
Vx(VxB)=V(V-E)-V’E=-V’-E 
a(v x B) (2.104) 
Ot 
where we have used the Gauss law of electricity that the divergence of E is zero. 
Take the time derivative of the fourth of (2.102) yields 


é 1 CE 
—(V¥xB)=—— 2.105 
a ) carve (2.105) 
Combining (2.104) and (2.105) gives 
1 0B 
Ys — = (2.106) 
ce or 


This is a vector wave equation for the electric field F. Similarly, taking the curl of 
the fourth equation of (2.102) gives 


Vx(VxB)=V(V-B)-V’B=-V’B 


_ 1 AVxE) (2.107) 
ce ot 
On the other hand, the time derivative of the third of (2.102) gives 
a aB 
(Vee) ==—.- (2.108) 
Ot Ot 
Combining (2.107) and (2.108) results in another vector wave equation for B 
2 
VB oS (2.109) 
c Ot 


In conclusion, for the case of a vacuum, both the electric field and magnetic field 
propagate as waves, called electromagnetic waves, and their speed of propagation 
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is the speed of light in vacuum (c ~ 2.99792458x10%m/s). In addition, if the electric 
and magnetic fields are independent of time, we have 
V°B=0, V’E=0 (2.110) 

That is, electrostatics is governed by the Laplace equation. This also suggests the 
existence of an electric potential and a magnetic potential. Since the electric and 
magnetic fields do not change with time, physically the Laplace equation models 
the equilibrium problem of an electric field. 

Without going into the details, we report that for the case of a non-vacuum 
with charge and current, we can express B and E in terms of a vector potential and 
a scalar potential as 


1A 
B=-v9-* , B=VxA (2.111) 
t 
The resulting governing equations for A and @ are 
2 
A 
Vda je pd (2.112) 
Ot 
2 
Vp— pe 2 =f, (2.113) 
Ot E 
provided that the following condition, called the Lorenz gauge, is assumed 
Viana? (2.114) 
ot 


This Lorenz gauge is only one choice of the gauge freedoms and it is not unique. 
The Lorenz gauge is named after Danish mathematician L.V. Lorenz (1829-1891). 
Note that (2.112) and (2.113) are nonhomogeneous vector and scalar wave 
equations. Therefore, nonhomogeneous wave equations occupy a central place in 
the solution of electrodynamics or Maxwell equations. However, the 
nonhomogeneous wave equation is not covered in most textbooks on differential 
equations. We will discuss it in Section 9.3 and the solution is given in (9.207). 
Another choice of gauge is the so-called Coulomb gauge (Zangwill, 2013) 

which is 

V:-A=0 (2.115) 
It is named after French physicist Charles-Augustin de Coulomb (1736-1806), who 
made contributions on electrostatic force and friction. The resulting governing 
equations for A and @ become 

2 


VA ye = pd + we(L) (2.116) 
Vv o=ee (2.117) 
é 


This gauge choice is popular because we can solve the Poisson equation for @ 
given in (2.117) before solving (2.116). 

In a later chapter in the second book of this series, we will discuss these 
gauge theories and the mathematical theory of Maxwell equations in more detail. 
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2.8.2 Navier-Stokes Equations 


In deriving the Navier-Stokes equations, we need to use the concept of a control 
volume versus a point mass considered for rigid body dynamics. The idea of 
control volume is depicted in Figure 2.2. In classical mechanics, the dynamics of a 
rigid body is referred as the Lagrangian formulation in which the displacement 
vector is referred to as a fixed point in a rigid body. We say that the displacement 
vector is formulated in Lagrangian coordinates. For fluid mechanics problems, if 
control volume is used, the displacement vector is referred to as a fixed point in 
space (i.e., does not follow a fluid particle). We say that the displacement vector is 
formulated in Eulerian coordinates because the control volume formulation was 
first proposed by Euler. 

First of all, we assume that there exists a flow field described by the flow 
velocity field uw, which is a function of a three-dimensional position (x, y, z) and 
time ¢. Let us consider the total differential of u with respect to its dependent 
variables, 
jae ee ee Se (2.118) 

Ot ox oy Oz 


Dividing the whole expression by dt, we have the acceleration of the flow 


Du ou oOudx oOudy oOudz ou Ou Ou Ou 

= = +u—+Vv—+w (2.119) 
Dt ot Oxdt Ovdt cozdt oa ox oy OZ 
where the physical components of the velocity u are denoted as u, v, and w. Note 
that instead of d we have employed the notation D. The differential operator D/Dt 
is called the material time derivative or the advective time derivative in contrast to 
the ordinary time derivative d/dt. In terms of tensor notation, (2119) reads as: 

D 
Pe (2.120) 
Dt ot 


Flow streamlines 


Fluid particle 


Control volume 


Figure 2.2 Particle and control volume descriptions of a fluid flow 


where the first term is a result of unsteady flow and the second term is the acceleration 
induced by fluid flowing in and out of the control volume. Physically, the acceleration 
calculated in (2.119) is experienced by somebody who moves locally with the fluid in 
the control volume. Since the second term is resulting from fluid flowing in and out, it 
is normally called the convective term. The main nonlinearity in Navier-Stokes 
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equations actually comes from this term. Alternatively, the material time derivative 
can also be expressed as 
2 
_Du_ ou : Ou \u | 


a= — =— +u-Vu=—+V(—)-uxVxu (2.121) 
Dt ot Ot 2 


This result is illustrated in the following example. 


Example 2.8 Prove the following vector identity: 

1 2 

ave =uxVxut(u-V)u (2.122) 
Solution: To simplify the notation, let us rewrite the magnitude of u as 


u=|u|=Ju-u =(uu,)"” (2.123) 


In terms of the Cartesian tensor, the left hand side of (2.122) is 


1 2 1 @uw 1 Ou 
V|ul =-e, — =-e,2u 
2 2 Ox, 2 Ox, 
(2.124) 
1 1 Ou, Ou, Ou, 
=eu TE [u, + u, | =U, e, =Uu,U, je; 
2 (u,u,) Ox, OX, Ox, 


On the other hand, we have (recall tensor analysis in Section 1.19 of Chapter 1) 
UXV XU = UX (CU, C;) = Cm: PixU mM 


nmi ~ ijk n 


= (6,,O nk = On On U,, U; je, (2. 125) 
— (UU, , a Uy Uk Je, 
and 
é é 
(u-V)u =(ue,-e; ——)uje, = Oj, ——U,e, = Uj, €, =U,U, .e, (2.126) 
Ox; Ox; 


Thus, adding (2.125) and (2.126) we have 

UXVXUt(U-V)U = (UUy 4 TU Uy HUM, JO, =U Uy n€n (2.127) 
Comparison of (2.124) and (2.127) yields the required result given in (2.122). This 
completes the proof. 


With the proved identity given in (2.122), the last convective term given in (2.121) 
is obtained. 

We now consider the continuity of fluid within any control volume. The mass 
flux through the closed surface of the control volume adding to the change of mass 
with time must be zero: 


§ pu-dA+ I pav =0 (2.128) 
ov ot V 
By applying the Gauss theorem to the first integral we have 
O (eye) 
V-(pu)dV +—| pdV = |[V -(pu) + —]dV =0 2.129 
J (pu) al? Jt (pu) += (2.129) 
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Since the choice of control volume is arbitrary, we must require the integrand itself 
to vanish or 


V-(pu)+ 2-0 (2.130) 


This is called the continuity condition of fluid flow. Note that this continuity 
equation is true for all fluids, whether viscous or not. If the density of the fluid does 
not change with time, we have 
V-u=0 (2.131) 

This is called the incompressibility of fluid. When this assumption is valid, the fluid 
flow is called incompressible flow. We will see that incompressible flow also leads 
to the Laplace equation, indicating the existence of a potential (see Section 9.7 for 
details). Therefore, incompressible flow is also called potential flow. 

We will now look at force equilibrium in fluid flow. The force acting on a 
control volume of fluid can be considered as the divergence of a stress tensor: 


p(t +u-Yu)=V-o (2.132) 


Physically, the change of stress along spatial coordinates (the right hand side of 
(2.132)) leads to the unbalancing of force on the control volume and this force will 
lead to acceleration of the control volume (the left hand side of (1.132)). For fluids, 
the stress tensor can be subdivided into two contributions, a pressure term and a 
viscous stress term, as 

o=-pl+oa' (2.133) 
where J is the unit second order tensor, p is the fluid pressure at a fixed point, and 
o’ is called the viscous stress tensor. In terms of dyadic notation, the unit tensor is 

T=6,ee, (2.134) 


you's 

The viscous stress models the irreversible “viscous” transfer of momentum whereas 
the pressure term is for reversible transfer of momentum. Viscosity also leads to 
dissipation of energy through a frictional type of heat loss, and this is why it is 
irreversible. 

Naturally, the stress within the fluid depends on how fast the fluid is flowing. 
A good first order of approximation is that the viscous stress tensor is proportional 
to the gradient of the velocity vector (Landau and Lifshitz, 1987): 


o =n[(Vu)’ Yu SV -w)]+ S10 -u) (2.135) 


where 77 and ¢ are the coefficients of viscosity and second viscosity. At first look, 
the viscosity term looks odd and there is a common factor overlapping with the 
second viscosity term. The choice of the first term actually reflects that the viscosity 
is only related to the deviatoric terms (or non-axial terms) and physically viscous 
irreversible transfer of momentum only relates to the rate of shear deformation and 
is independent of the rate of volumetric change. To see this, we can take the trace of 
(2.135) such that 


; 2 
tr(o’) = 7[2u, , mg igh Oa =3¢u (2.136) 


ii 
This clearly reflects that the viscosity 77 relates to the rate of shearing deformation 
only. We can rearrange (2.135) rewriting the viscous stress as 
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o' =7{(Vu)’ +Vul+Al(V -u) (2.137) 
where 


GaA+ton (2.138) 


Physically, both n and € must be positive or 

¢>0, n>0 (2.139) 
The first inequality of (2.139) reflects the fact that dissipation leads to a decrease in 
mechanical energy (instead of increase); whereas the second term of (2.139) reflects 
the fact that irreversible processes of internal friction lead to increase in entropy, a 
term in thermodynamics associated with the change from less probable state to a 
more probable state. The value of € reflects how fast or slow the relaxation time is 
to restore equilibrium. Normally, the order of magnitude of viscosity and second 
viscosity are comparable (i.e., ¢ ~ 1). Under certain circumstances, we may have € 
>> y and in such case there will be a change of volume and in turn a change in 
density. Although we normally assume € is a constant, in reality € reflects the rate 
of compression versus the relaxation time. For example, for sound waves in fluids, 
€ should depend on frequency (how fast the compression and tension are applied). 
When this happens, the waves in fluid are called dispersive. 

Substitution of (2.134) and (2.133) into (1.132) leads to 


p(s +u-Vu)=—-Vp+V- jc +Vu- 51 wi} VIS(V-u)] (2.140) 


This is the most general Navier-Stokes equation for viscous fluids. This can be 
rewritten in a more compact form by noting the following identity: 


V-{7(Vu)' | =Vxt(V xu)]+V -[7(Vu)] (2.141) 


This is proved in the following example. 


Example 2.9 Prove the following vector identity: 
V-{7(Vu)' | =Vxt(V xu)]+V -[7(Vu)] (2.142) 
Solution: Let us consider the first term on right hand side of (2.142) 
Vx[7(V xu)]=Vx (e.My, /&:) = Cnmi ijk (Uy jm C1 = Sinm ik (Uy jm e, 
= (Oj Fn — One Ong Mg, mn = Mann m En — M4 mm En (2.143) 
=V-{n(Vu)’ }-V-[7(Vu)] 
Rearranging (2.143) gives 
V-{7(Vu)' | =Vxt(V xu)]+V -[7(Vu)] (2.144) 
This completes the proof. 


With the identity given in (2.141) in hand, the last two terms of (2.140) can be 
rewritten as 
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V- {rcv +Vu- =v wa} +V[C(V-u)] 
=Vx[7(V x u)]+V -[7(Vu)]+ V -[7(Va)]— SVinv ‘W)I+V[G(V-u)] (2.145) 


= VV x u)}+2V [Va] + VAS —F (V0) 


Finally, combining (2.145) and (2.140) we have another form of Navier-Stokes 
equations 


p(s +u-Vu)=—-Vp+Vx[n(V xu)]+2V -[7(Va)]+ VCE -SV-w)} (2.146) 


Note, however, that this equation differs from (3.48) of Hughes and Brighton 
(1967) and from (1.33) of Hughes and Gaylord (1964). There are clear typos in 
both of them. To the best of our knowledge, this correct form has not been reported 
in the literature. 

In general, both ¢ and 7 can be functions of pressure and temperature. If both 
¢ and n are independent of pressure and temperature, we have 


1 
pH +u-Yu)=—Vp+ Vu (+= VT -u) (2.147) 
For incompressible flow, by virtue of (2.131) we have 
pt +u-Yu)=-Vp +n (2.148) 


Let us reiterate that all nonlinearity of the Navier-Stokes equation comes from the 
convective term. In fact, the KdV equation, Burgers equation, and Boussinesq 
equation are special cases of Navier-Stokes equations. For example, for 1-D flow 
with no pressure gradient applied, we can replace u by u and 


du, Qu_noOu_ Ou 


+u Vv 2.149 
ot ak pax ( ) 
where 
pot (2.150) 
p 


is the kinematic viscosity (n is also known as dynamic viscosity). Evidently, (2.150) 
is the Burgers equation given in (2.57). At 20°C, the values of v for water and air 
are 0.010 and 0.150 cm?/sec. Thus, water is 15 times more viscous than air, and this 
is the reason why it is so hard to swim fast in water. 

In closing, we should mention that the Clay Mathematics Institute offered US 
$1 million for anyone making “substantial process toward a mathematical theory 
which will unlock the secrets hidden in the Navier-Stokes equations.” Indeed, this 
nonlinear differential equation leads to all phenomena that we observe in fluids 
every day, including waves behind our boats, and turbulent currents following a 
modern flight jet. For example, there is no satisfactory theory for turbulent flow yet 
and much remains to be done mathematically on Navier-Stokes equations. 


106 Theory of Differential Equations in Engineering and Mechanics 
2.8.3 Elastodynamics 


Following the discussion by Chau (2013), for dynamics problem for solids, we 
consider the force equilibrium of a solid of volume V as 


[,FdV +) ,n-odS = <j,pSav (2.151) 


where F, o, u, n and pare the body force vector, stress tensor, displacement vector, 
normal vector pointing outside the volume, and density of the solid (see Figure 2.3). 
Applying the Gauss theorem to the second term on the left of (2.151), we have 


| -[ pow 
|, (WeotF)av =| p = dV (2.152) 
Since the body is taken arbitrarily, we must have 
Vo+F = pot (2.153) 
t 


For isotropic elastic solids, Hooke’s law only involves two independent moduli 4 
and pL, which are called the Lamé constants. In tensor notation, it is 

o =Atr(e)I+2ue (2.154) 
For small deformation and small strain, the strain-displacement relation is defined 
(Chau, 2013) 


€=4[Vut(Vu)’ ] (2.155) 
Substitution of (2.155) into (2.154) gives 
o =Al(V-u)+ WVut(Vu) | (2.156) 


Then, substitution of (2.156) into (2.153) yields 
au 


AV(V -u)+u[V -VutV (Vu) ]+F = Pa (2.157) 
t 


Note that 


Figure 2.3 Force equilibrium of a solid of volume V and surface S 
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a) 6 
V(V -u)= Ve, re Uje,) =e, 0,U,,) =e, os (u, =U, @; (2.158) 
J k 


a 
To ji Idi 
Ox, 


O 6) 6) 
V (Vu) =V-(e, €.) =e -(u.ee.) =e, —-(u..ee. 
le ( ' x, uje;) er ax, 4,,2;) “ ax, (46,6) (2.159) 


= OU; 2, =U; ge; =U; i; 


Therefore, we can group these terms in (2.157) as 
2 
(A+p)V(V -u)+uV -VatF = a (2.160) 
t 
This is the equation of motion for an elastic isotropic solid. By virtue of vector 
identity (1.362) of Chapter 1, we can rewrite (2.160) as 


2 
(A+ 2,VV uu x(Vxu)sF = poo (2.161) 
Expressed in terms of compressional and shear wave speeds, (2.161) becomes 
2 
2VV uo Vx(Vxu)t iF =o (2.162) 
d Ss p ar’ 


where 


— — an fe (2.163) 
p p 


Let us consider the divergence of (2.162) as 
a(V-u) 
at? 

This is a nonhomogeneous wave equation for dilatation V-w. In obtaining (2.164), 
we have used the vector identity given in (1.347) setting A = Vxu as 

V-(VxA)=0 (2.165) 
As discussed in Chapter 1, all vectors can be decomposed into an irrotational term 
and a solenoid term according to Helmholtz theorem (see Section 1.16 of Chapter 
1). Let us decompose the displacement vector as: 

u=Vo+Vxy (2.166) 


2v7(V-u)tLy-F= (2.164) 
fe) 


Taking the divergence of u gives 
V-u=V ¢6+V-(Vxp=V¢ (2.167) 
Substitution of (2.167) into (2.164) yields 


Vfc2v7g 2b lye (2.168) 
Ot 72) 


This is a nonhomogeneous differential equation for the potential o. Physically, 
(2.167) shows that V7 is the dilatation. Therefore, (2.168) is also called the 
dilatation wave equation, and it governs the motion of the so-called P-waves. 

On the other hand, if we take the curl of (2.162), we arrive at 


ovxu 
ot? 


OV XV(V-u)—2V x[Vx(Vxu) LV <F = (2.169) 
p 
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The first term vanishes by virtue of vector identity (1.347) while the second term 
can be expanded by using (1.362) as 


Vx[Vx(Vxu)]=V[V*(V xu)]-V? (V xu) =-V?(V xu) (2.170) 
The last result follows from (1.347) or (2.165). Substitution of (2.170) into (2.169) 
gives 
2 
eV? (WV xu) -——_=-— Vx F (2.171) 
p 


This is a nonhomogeneous vector wave equation for the rotation tensor Vxu/2. 
Taking the curl of (2.166) we obtain 


Vxu=VxVO+VxVxy =V(VIw)-V y 


(2.172) 
— -V’y 
In terms of vector potential, (2.172) can be written as 
Viewty Yl yx F (2.173) 
ot =p 


This is a nonhomogeneous vector differential equation for the vector potential y. 
If there is no body force, we have 


2 
c V'= oe h (2.174) 


Vy = aa +H (2.175) 


where fA and A are arbitrary harmonic functions. However, they are normally 
neglected. That is, 


ar? at 
In summary, in terms of dilatation and rotation tensors the elastodynamics 
problems are governed by the following nonhomogeneous wave equations: 


pA 
Vga os Vy = ow (2.176) 


ev7y-4)- 20 --ly.r (2.177) 
Ot 72) 
2 

2v2(yxu)- 2" =f yx (2.178) 
Ot Pp 


Physically, the wave equations given in (2.177) and (2.178) closely resemble the 
scalar and vector wave equations given in (2.112) and (2.113) as Maxwell 
equations. Therefore, elastodynamics and electrodynamics are mathematically the 
same problems. The mathematical technique for one problem is applicable to the 
other. To the best of our knowledge, this resemblance has not been mentioned in 
any textbook on differential equations. The solution of the nonhomogeneous wave 
equation is discussed in Section 9.3 and is given in (9.207). 


2.8.4 Three-Dimensional Elasticity 


Let us look at the special case that the displacement field is independent of time. 
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(A+W)V(V -u)+uV*> u+F =0 (2.179) 
If the Helmholtz theorem is used, we can specialize (2.168) and (2.173) 
VV’ b= V* g=- Vv (2.180) 
A+2U 
VV y =Vy sl xF (2.181) 
L 


These are nonhomogeneous scalar and vector biharmonic equations for @ and yw. 
It is also possible to introduce a single vector potential called the Galerkin 
vector G 


u=2(1-v)V’G-VV+G (2.182) 
Taking the Laplacian of (2.182) gives 
V-u =2(1-v)V’V°G-V’ (VV «G) (2.183) 


Similarly, we consider the gradient of the divergence of u as 
VV -u =2(1-v)VV «(V'G)-VV «(VV «G) 
= 2(1-v)V?(VV «G)-V’* (VV -G) (2.184) 
=(1-2v)V*(VV«G) 
Substitution of (2.183) and (2.184) into (2.179) yields 
(At+w—2v)V (VV + G)4+2u(1—-v)V°V’°G— LV? (VV «G)+F =0 (2.185) 
Simplification of (2.185) results in 
[A-2v(A+ WIV’ (VV +G)+2u1-V)V°V’G+ F =0 (2.186) 
However, from the result of elasticity, we know that Poisson’s ratio v can be 


related to Lamé constants as (Chau, 2013) 
A 


v= (2.187) 
A+L 
Substitution (2.187) into (2.186) gives 
VVG=V'G=-— _F (2.188) 
2u(1—v) 


This is a nonhomogeneous biharmonic equation for vector potential G. For the 
special case of zero body force, we have biharmonic equations 

V'G=0 (2.189) 
Thus, we see that the biharmonic equation is of fundamental importance for three- 
dimensional elasticity. Yet, most textbooks on differential equations for engineers 
and scientists do not cover the biharmonic equation. Further discussions of the 
biharmonic equation will be given in Sections 7.9, 8.13.4 and 14.12. 


2.9 SUMMARY AND FURTHER READING 


Although the main focus of this book is to discuss the mathematical techniques in 
solving differential equations and their applications in engineering and mechanics, 
it is essential that we should know what kind of information is available even 
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before we attempt to solve them. We should be familiar with handbooks on 
differential equations. A good start is to refer to the handbook by Zwillinger 
(1997), which summarizes different procedures for different types of differential 
equations with examples. There are also a lot of useful and handy comments on 
different techniques. For the most comprehensive and up-to-date handbooks on 
differential equations, we recommend the series of handbooks by Polyanin (2001), 
Polyanin and Zaitsev (2002, 2003), and Polyanin et al. (2002). For integral 
equations, which are closely related to differential equations and the focus of 
Chapter 12, one should consult the handbook of Polyanin and Manzhirov (2008). 
However, there is not much discussion on the method of analyses, but instead only 
the final solution is given if they are available. It is a good idea to check the 
handbooks of Polyanin (2001), Polyanin and Zaitsev (2002, 2003), and Polyanin et 
al. (2002) for your differential equation at hand before even thinking about solving 
it by yourself. 

Regarding textbooks on differential equations, there are hundreds of available 
textbooks on differential equations but most of them were written by 
mathematicians, not much by engineers. A number of very good books are 
recommended here, but it should be warned that some of them are not easy to read. 
The series of books by Forsyth (1890, 1893, 1900, 1902, 1906, 1918, 1956) remain 
classic although there are some obsolete topics. Some topics in these references 
are not readily comprehensible. Other good books include Airy (1873), Craig 
(1889), Bateman (1918, 1944), Whittaker and Watson (1927), Sommerfeld (1949), 
Erdelyi (1953), Ayres (1952), Ince (1956), Myint-U (1987), Myint-U and Debnath 
(1987), Zill (1993), Zachmanoglou and Thoe (1986), and Zill and Cullen (2005). 

For the history of differential equations, the three volume book series by 
Kline (1972) is a must read. Struik (1987) also provides some useful information 
about differential equations. 

A number of handbooks of mathematical functions should be available if you 
want to learn differential equation seriously. The number one handbook is 
Abramowitz and Stegun (1964), and Olver et al. (2010) is considered an updated 
version of Abramowitz and Stegun (1964). In terms of handbooks on integration, 
we recommend Gradshteyn and Ryzhik (1980). A handy handbook by Spiegel 
(1964) provides a quick reference. 


2.10 PROBLEMS 


Problem 2.1 Consider a fluid with uniform rotation with angular velocity @ the 
velocity field v is defined as 

v=Qxr (2.190) 
Show that such a uniform rotational field will not produce any viscous stress o”. 
That is, there will no energy loss for such uniform rotation. This result is somehow 
consistent with Kelvin’s theorem on the conservation of circulation. Actually, 
conservation of circulation in fluids was first established by Helmholtz in 1858 for 
incompressible flows, but was extended to compressible flows by Kelvin in 1869. 
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Problem 2.2 Classify the following system of differential equations for plate 
buckling 
2, 2 
VF = EH Sey 
Oxdy = Ox” Oy 
- Oo wor Ow , OF Ow 
Y pe Oe OF OxOy OxOy 
where w is the deflection of . ore and F is the Airy stress function for the stress 
in the plate. This is called the von Karman-Foppl equation. It models the buckling 
of plates and was proposed by von Karman and Foppl independently in 1910 and 
1907 respectively. 


(2.191) 
DV*w= p+ & 


Problem 2.3 Classify the following system of differential equations for shallow 
shell buckling 


Co 2 4 

o¥ ony Sw gag POEM 2G (2.192) 
ox a ” Oy R° ox 

where the initial stress state in the shells is denoted by a superscript “0.” This is the 

linearized Donnell equation for shallow shells proposed by Donnell in 1934. 


Viw-V4(N®. 


Problem 2.4 Classify the following differential equation for cylindrical shell with 
nonuniform wall thickness shallow shell buckling 


ld ..«l Ww 12 are 12=v) i %) 
w x 


x + 

wi” ax? ) aa E@ 

Where the thickness is h = ax and a is the radius of the shell. The depth of the 

water is defined as d—xo and d is the height of the cylindrical tank. This theory was 
derived by Reissner in 1908. 


(2.193) 


Problem 2.5 Classify the following differential equation for a circular ice plate 
supported on water (Nevel, 1959): 


VV-wtw= (2.194) 


Its solution was given by Wyman (1950) in terms of Kelvin’s functions, which will 
be discussed in Section 4.9. This problem relates to landing of aircraft on ice sheets 
(Assur, 1959). 


Problem 2.6 Classify the following differential equation for vibrations of circular 
plate: 


lo, 0,10, a 1 @u 
r r =0 2.195 
r ar * or C ar‘ or M+ b* at? ( ) 
where b is a constant. The axisymmetric boundary value problem was considered 
comprehensively by Reid (1962) for simply supported, fixed support and free 


support. 


Problem 2.7 Classify the following differential equation for the bending theory of 
cylindrical shells (Timoshenko and Woinowsky-Krieger, 1959): 


112 Theory of Differential Equations in Engineering and Mechanics 


(l-v’) OF | 0 
ce aé4 
where F' is Vlasov’s stress function, € is a normalized distance measured from the 
center, vis Poisson’s ratio, and c 1s defined as: 
eo 
12a’ 
The thickness of the shell is / and the radius of the cylindrical shell is a. 


VVVWF + 


(2.196) 


(2.197) 


Problem 2.8 Classify the following Monge-Ampere equation, which is a 
differential equation encountered in differential geometry, gas dynamics, and 
meteorology: 


Wye) —Vaexx =f) (2.198) 


Problem 2.9 Classify the following differential equation for shells with surface of 
revolution under axisymmetric bending (the unknown U is related to shear force 
and the variable is @): 


2 
nse =o fst a (2.199) 
D i, 
*C. ‘ t? 
FA fe each Pe 2 cog @ 2" 25 8300) 
iy dp rdpr fr, dy rf; 


where E = Young’s modulus, v= Poisson’s ratio, h = thickness, D [= Eh3/(1-v’)], 
and r; and r2 are parameters controlling the shape of the shells. 


Problem 2.10 Classify the following differential equation for a mode III dynamic 
crack branching problem: 
aw ow aw |i 


coths +——+-W=0 (2.201) 
Os? os oO 4 


CHAPTER THREE 


Ordinary Differential Equations 


3.1 INTRODUCTION 


The techniques for solving ordinary differential equations (ODEs) are of 
fundamental importance in the theory of differential equations. Most of the 
techniques to be discussed in this chapter were developed by founders of 
differential equations, including Leibniz, Euler, and Bernoulli. The mastery of 
these techniques is essential for our later chapters in solving partial differential 
equations (PDEs). In later chapters, we will see that a PDE is normally converted 
to a number of ODEs by assuming a technique called “separation of variables.” For 
the two-dimensional case, separation of variables leads to two ODEs; and for the 
three-dimensional case, separation of variables leads to three ODEs. Thus, solving 
PDE becomes solving ODEs. Therefore, the solution of ODEs is of fundamental 
importance in solving PDEs. 

For first order ODEs, the topics to be covered in the present chapter include 
separable equations, homogeneous type equations, exact ODEs, integrable 
condition, integrating factors, the Stokes method for homogeneous type, the Jacobi 
method, the Euler method, standard linear form, the Bernoulli equation, the Riccati 
equation, integration by differentiation, the Clairaut equation, singular solution, the 
Lagrange equation, factorization of nonlinear form, and Taylor series expansion. 
For second order ODEs with constant coefficients, the solution methods for a 
differential equation (DE) with non-zero nonhomogeneous terms include 
undetermined coefficients, variations of parameters, and operator factors. 

For second order ODE with non-constant coefficients, we discuss the Euler 
equation, Laplace type equation, Liouville problem, Mainardi approach for 
Liouville problem, and finally the Liouville transformation. Both homogeneous and 
nonhomogeneous forms are considered. Another general transformation (differing 
from Liouville transformation) and its associated concept of invariants of ODEs are 
discussed. 

For higher order ODEs, we consider the Euler equation of order n, adjoint 
differential equation of an n-th order ODE, Sarrus method, rule of transformation, 
and homogeneous equation. For nonhomogeneous equations, we have extended the 
method of undetermined coefficients and variation of parameters to n-th order 
ODEs. The technique of reduction of an n-th order ODE to a lower order ODE is 
discussed. The exact condition for an n-th order ODE is presented. The idea of 
factorization of ODEs is presented. A symbolic method for solving 
nonhomogeneous ODEs is discussed. The special technique of removal of the 
second highest derivative is also covered in the context of the n-th order ODE. The 
general idea of reduction of order for autonomous n-th order ODEs is summarized. 
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3.2. FIRST ORDER ODE 


The most general form of first order ordinary differential equation (ODE) is 
d 
F(=,y,x) =0 (3.1) 
dx 


Assuming that dy/dx can be solved from (3.1), we may express the first order ODE 
as 


® _ x,y) G.2) 
dx 


Note however that, in general, (3.2) may not be obtainable by solving (3.1). 


3.2.1 Separable ODE 


If the function on the right hand side of (3.2) can be separated as a product of two 
functions, one is a function of x only and the other is a function of y only, we get 


“ = fO60) G.3) 


Thus, we can put all functions of y on the left hand side with dy and all other 
functions on the right hand side with dx as: 


dy 


—— = f(x)dx (3.4) 
oy) 
Integration can be carried out independently on both sides of (3.4): 
dy 
“=| f(x)dx+c (3.5) 
J oy) J 


Therefore, the solution of the differential equation is obtained by integration only. 
Alternatively, we can rewrite first order ODE as 

M(x, y)dx + N(x, y)dy =0 (3.6) 
The condition of being separable implies M being a function of x only and N being 
a function of y only. That is, we have 

M (x)dx =-—N(y)dy (3.7) 

Thus, the solution of the first order ODE can be obtained by integrating on both 
sides separately as: 


} M(x)dx =-[ NO)dy+C (3.8) 


In other words, the first order ODE can be solved by integration alone if it is 
separable. Thus, a separable first order ODE can be considered as the simplest type 
of first order ODE. 


Example 3.1 Find the general solution of the following first order ODE 
Yo (y? +1) (3.9) 


dx 
Solution: The given ODE is clearly separable and can be rewritten as 


Ordinary Differential Equations 115 


oa ss (3.10) 
yor 


The integration on the left hand side can be conducted by using the change of 
variables as 


y=tan0, dy=sec? 6d0 (3.11) 
Recalling the trigonometric identity that 
tan? 9+1=sec* 0 (3.12) 


we find 


j dy a aL re 


y +1 tan? 0+1 (3.13) 
= tan! y 
Consequently, the solution can be expressed as 
tan! y=a +C (3.14) 
Alternatively, y can be written as: 
y= tan( x? +C) (3.15) 
Example 3.2 Find the general solution of the following first order ODE 
2 
dy__ x : (3.16) 
ag 
Solution: This ODE is clearly separable and can be arranged as: 
(I-y? Jay =a (3.17) 
Integrating on both sides, we have the solution as 
2 3 
x y 
+ =C 3.18 
3 C 3 (3.18) 


Note that the solution of the first order ODE has only one unknown constant, 
and it must be used to fit the boundary value prescribed for the ODE. Let us 
assume that 


dH, (x) dH,(y) 
————S = —— 3.19 
M(x), a NYY) (3.19) 
Substitution of (3.19) into (3.8) gives: 
Hi, (x)+ H,(y)=C (3.20) 


Now suppose that we know the value of y = yo for x = x0 as: 
V(X) = Yo (3.21) 
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Application of this boundary condition into (3.20) gives the following value of C as 


FL (X9) + Hy (¥) =C (3.22) 
Back substitution of this value of constant into (3.20) gives 
F(x) + Hy (y)— Hy (x9) — Hy (%) = 0 (3.23) 


Note that now the solution contains no unknown constant. This solution can also be 
expressed in terms of the original given function M and N by noting that 


Hy()-Hy%) = J M (dé (3.24) 
Ho(y)~Ha(9)= | NEaE (3.25) 
Therefore, the final solution can also be written as 
[ m@de+[" mede=0 (3.26) 
x0 0 


Note again that the boundary condition given in (3.21) has been used in prescribing 
the lower limit of the integration. 


Example 3.3 Find the general solution of the following first order ODE 
dy _ 3x7 +4x4+2 
dx 2A y-1) 
Solution: This equation is separable and can be integrated with respect to y on the 
left hand side and with respect to x on the right hand side as: 


, y(0)=-1 (3.27) 


2(y —]dy = (3x? + 4x4 2)dx (3.28) 
Integration on both sides gives 
y? —2y =x? +2x7 +2x4+C (3.29) 
Applying the boundary condition at x = 0 we have 
Cl? —2(-) =0° +207 42004C (3.30) 
This gives the unknown constant C 
C=3 (3.31) 
Substitution of (3.31) into (3.29) results in the final solution 
y? —2y =x? +2x? 42x43 (3.32) 


This is a quadratic equation in y and its solution is 


_ -B+ VB? -44C 
4 2A 
where A, B and C are coefficients of the quadratic equation. However, we have to be 


very careful in picking the final solution. It can be shown that the solution is (instead 
of taking the positive sign in front of the square root): 


(3.33) 


ya=l-Vx2 42x? 42x44 (3.34) 
To check this, we have to back-substitute (3.34) into the boundary condition: 
y(0) =1-2 =-1 (3.35) 


If we take the positive sign, we can see that the boundary condition is not satisfied: 
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y(0) =1+2=34-1 (3.36) 
The process of taking the square root creates an extra solution that does not satisfy 
the boundary condition. Therefore, we have to be extra careful when we deal with 
the square root of the quadratic equations. 


3.2.2 Homogeneous Equation 


We now consider a special form of first order ODE that can be readily transformed 
into a separable form and thus can be solved easily in principle (provided that we 
know how to conduct the subsequent integration). This type of ODE is called the 
homogeneous type. However, we must emphasize that this homogeneous type first 
order ODE should be not confused with the so-called homogeneous ODE in the 
classification discussed in Chapter 2, in which all terms in the differential equation 
involve the unknown of the equation. 
The functional form on the right hand side of (3.2) is only a function of yx. 
That is, whenever x and y appear, their appearance must be in the form of y/x. In other 
words, we have 
® = 9) (3.37) 
dx x 
where g can be any arbitrary function of y/x. To solve (3.37), we apply the following 


change of variables: 


i=. or y=ux (3.38) 
x 


Differentiation of the second of (3.38) gives 


dy du 
eae (3.39) 
The last two terms of (3.39) can be rearranged as: 
ae ee (3.40) 
dx x 


This is clearly separable and can be integrated as: 


leas =Inx+C G.41) 


This provides the implicit solution of the differential equation. If integration can be 
carried out explicitly, we can obtain the solution in closed form by substituting this 
result into (3.38). This depends on the given function g(u). 


Example 3.4 Find the general solution of the following first order ODE 
xO ay =y (x >0) (3.42) 
x 


Solution: This is a linear first order ODE of homogeneous type 
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a 2 ae (3.43) 
dx x Xx 
Adopting the following change of variables 
u=> > (3.44) 
x 
we obtain 
Oa xu = tu (3.45) 
dx 
Comparison of the last equation of (3.45) gives 
ea (3.46) 


Dale 


Integrating both sides independently, we find 


Ju =Inx+e (3.47) 
Thus, wu becomes 
u=[Inx+c] (3.48) 
Finally, back-substitution of (3.48) into (3.44) gives the solution as 
y=x[Inx+cfP (3.49) 


For the case that / and N are given as 


Max + Ndy =0 (3.50) 
M=x"¢(2), N=x"v() (3.51) 
x i 
we have 
o(~)dx+y(~)dy =0 (3.52) 
XxX = 


This is obviously a homogeneous type and we use the following change of 
variables 


y=vx, dy=vdx+xdv (3.53) 
The first order ODE becomes 
o(v)dx +w(v)(vdx + xdv) =0 (3.54) 
Grouping the differential terms gives 
{(v) + vy (v) }dx + (v)xdv = 0 (3.55) 
Be VO (3.56) 
x v)+w(v) 
x =Cexp{ | pAR i (3.57) 
pv) + vv) 


Back-substitution of the definition of v after integration gives the final solution. 
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3.2.3 Rational Polynomials 


Another form of the first order ODE, which is closely related to the homogeneous 
type first order ODE, is given as: 


dy _ axtbhyt+ec (3.58) 


dx a,X+byy+c, 


Both the denominator and numerator are given as a linear function of x and y. 
There are three possible scenarios associated with (3.58). 


Case I: The simplest scenario is that the constant terms in both numerator and 
denominator are zero: 

Cy =C, =0 (3.59) 
Dividing all terms in the denominator and numerator by x, we obtain a homogenous 
type of ODE as: 


y 
dy _axtby _ a, +6, x 


y 
(3.60 
dx a,x+b,y a, +b,2 8) ) 
x 


As discussed in the last section, we make the standard change of variables given in 
(3.38) to make this ODE separable as: 


dy du 
oa (3.61) 
du_ gtu)-u_ at —a,)u-b,u’ (3.62) 
dx x (a, +b,u)x 
Therefore, we can integrate it as 
J nee i} Mei (3.63) 
a,+(b,-a,Ju-bu* * x 


where C is an integration constant. 

There are three possible forms of the integration of (3.63), depending on the 
values of ai, bi, a2, and b2. To integrate it, let us first note that the following 
integration formulas will be useful to integrate (3.63) (Formulas 14.66, 14.265, and 


14.266 of Spiegel, 1968): 
2 wn | aan? b’ <4ac 


V4ac—b? V4ac—b? 


f dx 2 


5 = = b* = 4ac (3.64) 
ax” +bx+e 2ax+b 


1 2ax+b—b*? —4ac 7 
In b° > 4ac 
Vb° —4ac 2ax+b+Vb? —4ac 


_- = ings Hake) = 7 a 
ax +bx+c 2a 2a* ax +bx+e 


Applying these integration formulas, for (a2 — b )?+4 a, bo > 0, we have 


(3.65) 
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nee == pe b) <a 
2 x . x 


.66 
a, +b, In 2b, (y/x)+(a, —b,)- (a, -5,)° +.4a,b, (268) 
2 (a, -by + 4a, 2b, (y/x)+(a, —6,) + (a, -by + 4a,b, 


For (a2 — b; )°+4 a; by = 0, we have 


a, +b, 
2b,(y/x)+a,—b, 


For (a2 — b; )°+4 ay bo < 0, we have 


Inx+C= inf ee 4 (3.67) 
x 


2b, 


neeCe hy He,<8) 2 05 
2 lis x 


(3.68) 
a, +b, | elon} 


¥7(a, -by — 4a,b, ¥-(4, —by’ —4a,b, 


Case 2: The second scenario corresponds to the case that the following determinant 
formed by the coefficients of x and y in the numerator and denominator equals zero. 
That is, 


eg (3.69) 
bh by 
Alternatively, this can be expressed as: 
a,b, —a,b, =0 (3.70) 
Thus, a; and 5; can be expressed in term of a2 and bp as: 
Go Py ay (3.71) 
a, b, 


where k is a constant. Substitution of (3.71) into (3.58) gives 
d axtbhyt+caq  kax+by)+c 
Y  aethy ta Meet hye pars by) (3.72) 


dx = a)x+byy+c, A,X + byy +Cy 
Note the fact that x and y only appear as a functional form of u defined as 


U=a,x+byy (3.73) 
Naturally, we can adopt uw as the change of variables. Differentiation of (3.73) gives 
BE ty sb? RIGS (3.74) 
dx dx 
This renders (3.74) separable and it can be integrated as: 
(—# west (3.75) 
dy +b, f(u) 
This can be integrated easily to get: 
ke 
l se Dg aah =x+C (3.76) 
dy + byk dy +byk ay +byk 
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Case 3: The third and last scenario is for the case that the following determinant 
formed by the coefficients ai, bi, a2, and b2 is nonzero. That is, 


a a 
bby 
To make this ODE homogeneous, we can use a change of variables to remove the 
constants c; and c2, such that the mathematical form of (3.60) can be recovered. To 


find the change of variables, we first formulate a system of two equations of straight 
lines by setting the numerator and denominator of (3.58) to zero: 


axt+bhyt+e =0 
a,X+byy+c, =0 


#0 c, 40 and c, 40 (3.77) 


(3.78) 


Let the solution of this system be x = a and y = £. A set of new variables can be 


defined as 
X=x-@ 
(3.79) 
ray=p 
Alternatively, this can be rewritten as: 
x=X+a 
(3.80) 
yor+p 


Substitution of (3.80) into the numerator and denominator of (3.58) yields 
axt+bhyte =a(X+a)+h(¥+ f)+e, 


=aX+bhY¥+(qa+bhf+e) (3.81) 
=aX+byY 

AnX +byy+Cy =a,(X +.a)+b, (V+ B)+c, 
=a,X ++by¥ +(a,a+b) B+ cy) (3.82) 
=d,X ++b)Y 


Therefore, (3.58) can be reduced to a homogeneous form in terms of the new 
variables X and Y as: 
dY _aX+bY 
dX a,X+b,Y 
Obviously, the solution given in (3.66) to (3.68) remains valid for (3.83) if the 
following substitutions are made: 
x€-x-a, yey-P. (3.84) 


(3.83) 


Example 3.5 Find the general solution of the following first order ODE 
dy _x+y-l 
dx x- yt+3 

Solution: The determinant of the factor of (3.77) is clearly non-zero. This 


corresponds to Case 3 discussed above. 
By setting the numerator and denominator to zeros, we have 


(3.85) 
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: +y-1=0 (3.86) 
x-yt+3=0 
Addition and subtraction between these equations give the following solutions 
x=-l y=2, (3.87) 
That is, we can set the shift of coordinates using 
a=-l f=2 (3.88) 
The new variables X and Y can be defined as 
X=x+l, Y=y-2 (3.89) 
Substitution of (3.89) into (3.85) gives 
Y 
dY dy xt+y-l1 X+Y a 
=== = = (3.90) 
dX dx x-y+3 X-Y Pa 


1- 
xX 

This is clearly a homogeneous form and we can assume the standard change of 

variable as 


Y 
u=— 3.91 
z (3.91) 
With this change of variable, (3.91) is converted to the following separable form: 
2 
yt _l+u (3.92) 
ae l-u 


Putting all terms containing wu on the left and grouping all terms containing X to the 
right, we have 


(l-u)du _ dX 
1+u? x 
Integrating both sides, we obtain 


(3.93) 


tan“! In(l +u2)=In|X|+e (3.94) 
Back substitution of (3.91) into (3.94) gives 
tan”! u = tan() =In[J(ltu?)X]+c=In[y(X7?+Y7)]+e (3.95) 


Finally, substitution of (3.89) into (3.95) yields the solution in the original unknown 
and variable y and x: 


tan” come, —) = Inf(x+1)? +(y-2)? +e (3.96) 


This is the final solution in ae form and we would not attempt to solve for y 
explicitly. 


3.2.4 Integrable Condition 


For non-separable first order ODEs, it is very tempting to integrate with respect to x 
while treating y as constant and similarly for integration with respect to y. Whether we 
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can do that is the subject of this section. Euler showed that only certain forms of first 
order ODEs could be integrated directly on both sides even though the equation is not 


separable. 
Consider the total differential of a function wu and by definition we have 
EA te ale Ne | (3.97) 
Ox ov 
Suppose that uw equals a constant: 
u(x,V)=c (3.98) 
Thus, we arrive at the following differential form of an ODE: 
Hey pO (3.99) 
Ox ov 
Recasting this into a standard form of ODE using M and N gives 
ae Y) te moe ip MtebaNe ab (3.100) 
x y 


This equation reveals that if the differential form given in the second of (3.100) is 
integrable, there must exist a function u such that 


%—ucy, “=nay) (3.101) 
Ox oy 
Differentiation of M with respect to y gives 
2 
ae (3.102) 
oy  oyox 
On the other hand, differentiation of NV with respect to x gives 
2 
ce (3.103) 
Ox  Oxdy 
Recalling the fact that the order of differentiation can be reversed, we must require 
2 2 
Oe ee (3.104) 
Ovox Oxdy 


Thus, the integrable condition is obtained: 
OM(x,y) _ ON(x, y) 
ayti‘(i‘i 
If and only if (3.105) is satisfied by functions M and N of a given ODE, we can 


integrate the equation using the following procedure. We can start with the 
definition for M that (alternatively we can start with that of N): 


(3.105) 


OF 995) (3.106) 
ox 


Integrating with respect to x by treating the variable y in M as constant, we get 
u(x, y) = [u (x, y)dx + @(y) (3.107) 


Note that we have to add an arbitrary function of y in the process (instead of a 
constant) because of the definition of partial differentiation. Next, differentiation of 
u just obtained in (3.107) with respect to y leads to 
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=< fc») —_ a; (3.108) 


The last of (3.108) is ie ] virtue of the Pe of N. Rearranging (3.108) 
gives an equation for finding the arbitrary function 9: 


AO) -y—2 [Mcx,y)d0 (3.109) 
dy oy 
Integrating both sides with respect to y, we get 
0 
Py) = j|x- 2 Fasesys fo (3.110) 


Note that we can treat x in the bracket of (3.110) as constant. Finally, back 
substitution of (3.110) into (3.107) gives 


u(x,y) = [cw aes fv [mes y)dx]dy =e G.111) 


We should emphasize that the whole procedure works because we start with u(x,y) 
= c Therefore, it is of utmost importance to set uw to a constant as the last of 
(3.111). Otherwise, we still do not obtain any solution, but just define a function w. 
Most students will make this careless mistake when they first learn this method. 


Example 3.6 Find the general solution of the following first order ODE 


(e* + y)dx+(x—2sin y)dy =0 (3.112) 
Solution: Identifying VM and N, we have 
M(x, y) =e" +y, N(x, y) =x—-2siny (3.113) 
It is straightforward to see that the integrable condition in (3.105) is satisfied: 
AM(x,y) _,_ NG) eal 
ov ox 
We can integrate directly by assuming the existence of a function u as 
OF es (3.115) 
Ox 
OY ac Daa (3.116) 
oy 
Integration of (3.115) with respect to x gives 
u(x,y) = [(e' + y)de+ Wy) =e" +yx+ 0) G.117) 
Note that we have treated y as a constant during the integration as: 
[ue, y)dx =e* ty (3.118) 
Differentiating with respect to x and equating it to N, we find 
d 
M2 h(x, pdx + PO = y= x-2sin y (3.119) 
oy oy dy 


Taking partial differentiation of (3.118) gives 
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2 [Mc yde =x (3.120) 
oy 
Finally, we obtain a differential equation for ¢ from (3.119) 
pO ce as) (3.121) 
dy 
Canceling the function x on both sides, we obtain 
ONY 5s (3.122) 
dy 


We want to emphasize that if the function of x in (3.121) does not cancel out on 
both sides, we must have made a careless mistake. Finally, integration of (3.122) 
gives 


(vy) =2cos y, (3.123) 
Putting ¢ into (3.117), we obtain the required function u 
u(x, y) =e +yx+G(y) =e" +x4+2c08y (3.124) 


As remarked earlier, the most important step is to set u to a constant to get the final 
solution: 


ee’ +yx+2cosy=c (3.125) 


3.2.5 Integrating Factor 


If the integrable condition is not satisfied, we can make it integrable by multiplying 
an integrating factor. This was first discovered by Euler. To illustrate the idea, we 
can recast the left hand side of the ODE as 


& Wy 3126) 
y 


Max + Nady = 5M Py es — )+ (Mx — Ny) 
x yp x 


The validity of this identity can be demonstrated by expanding the right hand side. 
Note also that we have an exact integral for both brackets on the right hand side as 


BO aintay. Be ogi) (3.127) 
x oy x oy y 


Both of these are exact integrals, thus, providing a simple way to determine the 
integrating factor. However, this method is not covered in most of the textbooks on 
differential equations. 

We now look at a few special cases of (3.126). 


3.2.5.1 Case 1: Mx+Ny=0 


For this case, we can rearrange the differential equation (3.126) as 
Mdx+Ndy _ 1 
(Mx-—Ny) 2 


Since the right hand side is an exact integral, the integrating factor for such case is thus 


dn(~) (3.128) 
y 
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1 


——_——_ (3.129) 
Mx— Ny 
This is also an integrating factor for the following special forms of M and N: 
M=F(y)y, N=Fy(xy)x (3.130) 
To prove this, we can _ this equation as 
Mdx + Ndy _ 7 fee Sy dIn(x9) +d In )} 
(Mx-—Ny) 2 Mx-—Ny 
(3.131) 


_1 Foy) + hoy) 

2 AQy)- Gy) 

We see the first term on the right of (3.131) is a function of xy only, and thus the 
right hand side is again an exact integral. This completes the proof. 


dn(xy) +d In(~)} 
y 


3.2.5.2 Case 2: Mx—Ny=0 


For this case, to find the integrating factor we can rearrange (3.126) as 


MEN _ Dine (3.132) 
Mx+Ny 2 
Since the right hand side is an exact integral, the integrating factor for this case is 
—— (3.133) 
Mx + Ny 
3.2.5.3 Case 3: Mx+NyX0 & Mx-NyO 
When both of these groups are not zeros, the integrating factor is 
— (3.134) 
Mx+ Ny 
If Mdx+Ndy = 0 is homogeneous, the integrating factor is 
: (3.135) 


Mx — Ny 
Also recall that if the differential equation can be expressed as F\(xy)ydx + 
F'x(xy)xdy = 0, we have shown that the integrating factor is given by (3.135). 


3.2.5.4 Stokes Method for Homogeneous Equation 
If M and N are homogeneous functions of x and y of the degree n, we have 
M=x"¢(v), N=x"y(v), v=y/x (3.136) 
Subsequently, the differential equation can be written as 
Max + Ndy = x"b(v)dx + x"y(v)dy 


3.137 
= x" {h(v) + vy(v)}dx + x""y(v)dv ee 
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Case 1: ¢(v)+ vw(v) =0 (or Mx+Ny = 0) 


For this case, we have 


Max + Nay = x" w(v)dv (3.138) 
a = w(v)dv (3.139) 
x 


The right hand side is an exact integral, and thus the left hand side is also exact. 
Consequently, the integrating factor for case | is 
1 


n+1 
x 


(3.140) 


Case 2: o(v)+ vw(v) #0 (or Mx+Ny # 0) 


Using (3.137), we have 
Madx + Ndy _ dx ‘ y(v)dv 
xr) +wwv)} x 6) + vy) 
Both terms on the right are exact differentials and, in turn, the right hand side is 
also exact 


(3.141) 


Mdx+Ndy _ dx : yw(v)dv 


(3.142) 
Mx + Ny x Pv)+vy(v) 
Thus, the integrating factor is 
a (3.143) 
Mx + Ny 


This result obtained by Stokes is more general than the solution discussed in the 
previous section. 

Note, however, that in Section 3.2.2 we have shown that (3.136) is 
homogeneous. Therefore, we can actually solve it without finding the integrating 
factor. 


3.2.5.5 Differential Equation for Integrating factor 


We now consider the general case of finding an integrating factor u. Ifa first order 
ODE is not exact, we can make it exact by multiplying a function called the 
integrating factor: 
MM (x, y)dx + UN(x, y)dy = 0 (3.144) 
This is actually the definition of an integrating factor. Since (3.144) becomes exact 
after multiplying by , we must have the exact condition being satisfied: 
Ou(x, VM (x,y) _ OuCG Y)N(%Y) (3.145) 
oy ox 
Differentiation and rearrangement of (3.145) gives 
noe Wale a ON 
Ox oy Oy Ox 


du (3.146) 
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This is a first order PDE and is very difficult to solve for arbitrary functions VM and 
N. The commonly adopted approach is to make certain assumptions regarding the 
functional form of the integrating factor, and check whether the assumed functional 
form is valid. In a sense, it is a trial and error approach. In particular, we assume 
the integrating factor can be expressed in terms of €, which is a function of x and y: 
Mx, y)= MS), $ =SY) (3.147) 
Substituting (3.417) into (3.146) gives 
dt 06 poe a” ON (3.148) 
dg ox de oy Oy Ox 


Rearranging (3.148), we obtain 


(OM _aN, 
dil Oy Ox 

= d 3.149 

Tee ae (3.149) 
Ox oy 


We now observe that if the function on the right of (3.149) is a function of ¢ only, 
we can immediately integrate both sides and result in the integrating factor. If this 
is the case, we can first rewrite it as 


ee 
d 0 0 
Fd = ude (3.150) 
H oN OG M cs 
Ox oy 
Integration leads to the following result for 
OM ON 
fx@ag ay ae. 
=e » #“6)= aC ac (3.151) 
os yo 
Ox Ov 


Note that we did not add an integration constant in (3.151) because we can always 
multiply an integrating factor by a constant to get another integrating factor. In fact, 
there are infinite integrating factors for the same ODE. However, if y is not a 
function of ¢, we cannot solve it this way. If € = x, we have to check whether the 
following function is expressible in x only 


(OM _ an 
Oy Ox 

= —+__—— 3.152 

x 7 (3.152) 

If ¢ =y, we have to check whether the following function is expressible in y only 
(OM _ oN) 

Oy Ox 

= 3.153 

x zy; (3.153) 


There are, however, infinite possibilities for the structural form of ¢. Therefore, this 
method should be our last resort when all other methods fail to apply. The 
following example illustrates another choice of C. 
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Example 3.7 Find the integrating factor for the following first order ODE 


2 
Gre dee 232 apa (3.154) 
y y 2 


Solution: We can identify 


PA 
M=G245), we 43%) (3.155) 
y yo ae 


It is straightforward to see that it is not exact. 
gs (3.156) 
Oy Ox y? y x 
To find the integrating factor, we assume 
M(x, y) = HOY) (3.157) 
That is, we have € = xy. We have 


2 2 
6. 
ve ME 435 at “ee es 
’ ee a ¥ o E  (3458) 
x 
= xy(-—-22 434) 
y yx 
Differentiating (3.160) with respect to y gives 
(OM _ oN 
0 ox 1 1 
WO=—ae— ap =F (3.159) 
yay 6 
Ox Ov 
Substitution of (3.158) into (3.151) gives 
Io 
Mae? =e" =6=xy (3.160) 
The original ODE becomes exact by using (3.157) as 
UMadx + uNdx = (3x7 y + 6x)dx + (x° +3y*)dy =0 (3.161) 
Applying the technique discussed in Section 3.2.4, we have 
OF ih aa ies (3.162) 
ox 
Integrating with respect to x, we find 
u =x? y+3x" +(y) (3.163) 
Differentiating (3.163) with respect to y gives 
A a UN = 439" =x +¢'(y) (3.164) 
This leads to a first order ODE for the unknown function 6: 
aH) _ 3,2 (3.165) 
dy 


Integrating and substituting the result back into (3.163), we obtain the final solution 
as 
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u=xyt3x7+y=C (3.166) 


3.2.5.6 Integrating Factors by Inspection 


Certain first order ODEs allow one to guess the corresponding integrating factor. 
Here are some examples. A particular example is that a first order ODE contains 
the following group: 

xdy — ydx (3.167) 
The following Table 3.1 gives six possible integrating factors for (3.167) that we 
can use. 


Table 3.1 System of integrating factors by inspection 


Integrating | Total differential Applicable ODE form 
factors 
1 xdy — ydx ae xdy — ydx + f(x)dx =0 
ee ee 
x x 
1 xdy — ydx de xdy — ydx + f (y)dy =0 
“2 7 =a) 
y y 
- dy — ydx + dy + ydx) =0 
a. xdy — ydx = d{in(2)] xdy — ydx + f(xy)(xdy + ydx) 
xy xX x 
1 xdy — ydx _ d{tan7!(~)] xdy — ydx +f (x? + y")\(xdy + ydx) =0 
iy oe x 
1 xdy—ydx _ 1 d{In(~ +) xdy — ydx + f (x* — y*)(xdy — ydx) = 0 
:=— i ae: x—y 


The validity of the second column of Table 3.1 can be established easily, and this 
will be left for readers to show (see Problem 3.36). The following examples 
illustrate this method of inspection. 


Example 3.8 Find the integrating factor for the following first order ODE 
(y* —xy)dx +. x’°dy =0 (3.168) 


Solution: First, we rearrange the ODE as 

y dx —x(ydx —xdy) =0 (3.169) 

The functional form in (3.167) appears. Thus, we find that the integrating factor is 
1/(xy’), and using this we have 

dx _ ydx—xdy -0 


2. 


x y 


(3.170) 
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Using the result in the second row of Table 3.1, we have 


C= (3.171) 
x y 
Thus, the solution of the ODE given in (3.168) is 
a 6 (3.172) 
Be 


where C is an arbitrary constant. 


Example 3.9 Find the integrating factor for the following first order ODE 
(x+ y)dx —(x-y)dy =0 (3.173) 


Solution: First, we rearrange the ODE as 
xdx + ydy + (ydx — xdy) = 0 (3.174) 
The structural group given in (3.167) again appears. By inspection, we go for the 


fourth row of Table 3.1 and the integrating factor is 1/(x*+y), and using this we 
have 


ld +y’) bax = xdy -0 


3.175 
2 x+y x+y? ( ) 
Integration gives the solution of the ODE as 
5 int? +y?)-tan (2) =C (3.176) 
x 


where C is an arbitrary constant. 


3.2.6 Standard Linearized Form 


The most general first order “linear” ODE can be written as 


® = y(x)y+ OG) G.177) 
dx 


If we set O(x) = 0, (3.177) can instantly be integrated since it becomes separable. 
The solution is: 


gage (3.178) 


To solve (3.177), we can set the constant as a function of x in (3.178). This idea 
originated from Euler and is called variation of parameters. Thus, we assume the 
solution form (3.177) as 


p= (3.179) 
Differentiation of (3.179) gives 
dy _ de(x) , fecoar 


Ak hk + p(x)y (3.180) 


eel <2 
dx 
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Substitution of (3.180) into (3.177) leads to a differential equation for c(x) 


de —| P(x jax 
40%) _ Ory]? (3.181) 
dx 
This can be readily integrated to give 
e(x) = | one PP ae 4G (3.182) 
Substitution of (3.182) into (3.179) gives the final solution as 
pag | one} 404.0 (3.183) 


We will see in a later section that many first order ODEs can be converted to the 
standard linearized form given in (3.177). 

Another approach in solving (3.177) is integrating factor approach discussed 
in Section 3.2.6. In particular, we can multiply (3.177) by zz 


d 
M(x) =~ Ma) Pay = MCX) (3.184) 
Next, we note the following identity: 
ad MM (3.185) 


dx dx dx 
Comparison of the left hand side of (3.184) with (3.185) gives the following 
condition, if the equation is exact: 


d 
SE = — u(x) p(x) (3.186) 
dx 
This is clearly separable, and we have the integrating factor as 
pee re (3.187) 


Returning to (3.184) and using the integrating factor obtained in (3.189), it can be 
simplified as 


oY (3.188) 
dx 
Integration of (3.188) with respect to x gives 
uy = | wOde+e (3.189) 
This gives the final solution: 
jg } Hevq=d?™F, } oe Rg (3.190) 
u 


This is, of course, the same as (3.183). Once we can convert any first order ODE to 
the standard form in (3.177), we can solve the ODE. Note again that there is one 
unknown constant in the solution of first order ODEs before we imposed any 
boundary condition. 


3.2.7 Bernoulli Equation 


We now consider a nonlinear first order ODE of the form: 
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d 
= = p(x)y+O(x)y" (3.191) 
xX 


This nonlinear ODE was first considered by Jacob Bernoulli and for this equation, 
the following change of variable can be proposed: 


z=y" (3.192) 
Differentiation of (3.192) with respect to x gives 
dz _n adv 
=(l-n)y" 3.193 
Ai (I-n)y 7 (3.193) 
Substitution of (3.193) into (3.191) gives the following ODE for z 
“ =(l—n)P(x)z+(1—-n)QO(x) (3.194) 
X 


This becomes the standard linear first order ODE that we have discussed in the last 
section. Thus, we know how to solve it. We now illustrate the technique in the next 
example. 


Example 3.10 Find the general solution of the following first order ODE 


2 
[ee a (3.195) 
dx 2x 2y 
Solution: This is the Bernoulli equation with the following identifications: 
n=-l, z=y’ (3.196) 
Using this change of variables suggested in (3.196), we obtain 
dz dy 
—=2y— 3.197 
dx r dx ( ) 
Substitution of (3.196) and (3.197) into (3.195) gives 
ee (3.198) 
dx x 
In the context of the standard linearized ODE, we have 
1 
P@)=—, O@)= a (3.199) 
It can be solved as in Section 3.2.6 to give 
1 1 
—dx —|—dx 
wees ({x’e J dvte)=ort sx" (3.200) 


Back substitution of the second of (3.196) into (3.200) gives the final solution for y 
as: 


y motos (3.201) 


Note that (3.200) is not our final solution since z does not appear in our original 
equation. Thus, we must use the second equation of (3.196) to convert z to y. 
StudentsThis final step that students normally forget. 
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3.2.8 Riccati Equation 


As summarized by Watson (1944), the Riccati equation first appeared in a paper of 
John Bernoulli in 1694, who however never received the credit that he deserved. 
Riccati’s paper on the same differential equation was published in 1724. The 
generalized Riccati equation is given as 

d 

=U +aOy + gs(O»” (3.202) 
This Riccati equation is not easy to solve, in general. Euler in 1763 found that the 
general solution can be determined if one of the particular solution y; is known and 
can be written as 


1 
yVO=yO+—~ (3.203) 
v(t) 
Substitution of (3.203) into the left hand side of (3.202) gives 
dy dy, lady 2 ldv 
LHS =— = =q,+ + ->— 3.204 
dt dt vy dt N+ 92V1 + 93Y » dt ( ) 
On the other hand, substituting the following value of ” 
y = Woe (3.205) 
voy 
into the right hand side of (3.202) gives 
d 1 2 1 
RHS == =9,4+4,(y ++ G07 +214) (3.206) 
dt v voy 
Equating (3.204) and (3.206), we obtain 
dv 
Pi (42 + 243y,)V—- 93 (3.207) 


This is in the form of the most general first order ODE discussed in Section 3.2.6 
and, therefore, can be integrated exactly. 


Example 3.11 Find the general solution of the following first order ODE 


dy DM. dias 
SS 4+ yt 3.208 
dt t i i ( ) 
Solution: We can easily show that a particular solution of this Riccati equation is 
y,=1 (3.209) 
We apply the following change of variables recommended by Euler 
y=l+ : (3.210) 
v 


Differentiation of (3.210) and substitution of the result into (3.208) gives 

dv 3 1 

— = +2 v— =—-(—)v-- 3.211 

ai (92 +243 )V- 43 oy ; ( ) 
This converts the Riccati equation given in (3.208) into the standard linearized 
form discussed in Section 3.2.6. Thus, it can be solved using the standard 
approach. 
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In the years between 1697 and 1704, John Bernoulli found a change of variables to 
transform the Riccati equation into a linear second order ODE. In particular, we 
first rewrite the generalized Riccati equation as 


OY 5 phan? eye (3.212) 
dx 
The following change of variables was communicated to Leibniz by John Bernoulli: 
1 d ' 
y=—_S-7 (3.213) 
yudx wu 
Differentiating (3.213) with respect to x, we see that 
; uu’ —u'(y'u+yu' 
yot vi ve) (3.214) 
you 


Substitution of (3.214) into (3.212) leads to the following linear second order ODE: 
u"+(¢— + yywu =0 (3.215) 
y 


Therefore, the nonlinear first order Riccati equation becomes a linear second order 
ODE. Actually, only certain types of nonlinear ODEs can be converted to linear 
ODEs. The Riccati equation is one of these special types of nonlinear ODEs. As 
shown in Section 4.14 in Chapter 4, a nonlinear ODE that has poles as its only 
movable singularities can be converted to a linear ODE. Although, in general, this 
linear second order ODE is easier to solve than the nonlinear ODE, its solution may 
not be easy to obtain. In the next example, we will consider a special form of the 
Riccati equation that can be solved readily. 


Example 3.12 Find the general solution of the following first order ODE 


dy__y 2 
ic ee ee ( 3.216 
ae ¥ (3.216) 
Solution: First, we can rewrite it as 
dy Vi» 
tot +1=0 3.217 
aoa (3.217) 
Use the change of variables suggested by John Bernoulli 
yo (3.218) 
u 
Substitution of (3.218) into (3.217) gives 
"su! +u=0 (3.219) 


This is a standard linear second order ODE, which is called the Bessel function of 
zero order. The solution is the Bessel function of the first kind and of the second 
kind of zero order (Jo and Yo): 
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u = AJ)(t)+ BY) (t) (3.220) 
More detailed discussions of Bessel functions will be given in Chapter 4. In fact, 


there is a close relation between the Riccati equation and the Bessel equation as 
discussed by Watson (1944). 


We now consider a special form of the Riccati equation that allows an exact 
solution to be found. In particular, we start from the original equation considered 
by Riccati as 

du 


+ bu? = cx" (3.221) 
dx 
Use a change of variables of 
pe (3.222) 
x 
Differentiation of (3.222) gives 
ao (3.223) 
dx xX dx x? 
Substitution of (3.223) into (3.221) results in 
dy 2 m+2 
x—-yt+by =cx 3.224 
eo (3.224) 
This can be generalized to the following form 
Pa —ay+by* =cx" (3.225) 
dx 


This form is, however, a special form of (3.202) or (3.212). 
In the following discussion, we will consider a special case of n = 2a, which 
allows an exact solution to be obtained. In particular, we now consider 


gO pay? =cx"4 (3.226) 
dx 
A change of variables can be proposed as 


y=x'v (3.227) 
With this change of variables, it is straightforward to find 


D pling ga (3.228) 
dx dx 
Substitution of (3.228) into (3.226) arrives at the following separable ODE: 
pre cp ae (3.229) 
dx 


Note that the linear order term is removed by this change of variables, and it 
becomes separable: 


dv _ dk 
B(c/b-v’)  x'4 
If 5 and c are of equal signs, we can use a partial fraction to find 


(3.230) 
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1 1 


elb=ar), Ceb=nenon 


(3.231) 
1 fe t,..1 , 
2Ve Ve/lb-v Ve/lb+v 
Therefore, integration on both sides gives 
Jig e FY Pe (3.232) 
2b oc vc/b-v a 
Rearranging this equation yields 
Vefb+y ee 
———— =Ce 4 (3.233) 
Vc/b-v 
Solving for v, we obtain 
2bex% 
pated se = (3.234) 
b 2 Mbox” 
1+Ce 4 


Finally, substitution of (3.234) into (3.227) gives the final solution of the particular 
form of Riccati equation given in (3.226) as: 


2Vbex = 


E C exp(——— 
a a 
y=x = (3.235) 
1+ Cee 5 
a 


As expected, we only have one unknown constant. 
For the case that 6 and c are of unequal signs, we can integrate (3.230) in 
terms of a tangent function by introducing the following change of variables 


v= -c/b tand (3.236) 
That is, we have 
a 2,2 
dv = (-c/b)"* sec = _ do _ (3.237) 
b(-c/b-v*) —b(-c/b)(1+tan* 0) —b(—c/b) 
Therefore, integration of both sides (3.237) and (3.230) gives 
i == 1x4 (3.238) 
—b(-c/b) a 
Therefore, integration of both sides results in 
1 = 
6 = tan! {———} =-b |-<f—-x7 +0} (3.239) 
v-c/b boa 


Rearranging (3.239) and substituting (3.227) into the result, we obtain the final 
solution as 


(—be)"? x? 


y=V-c/ bx" tanf{C— } (3.240) 
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Another specific form of Riccati equation encountered in viscous incompressible 
flows will be considered in Problem 3.14. 


Example 3.13 A special form of Riccati equation was found equivalent to the 
following Ramanujan differential equation: 
dR 


dP 1 dO 1 
= =—(PQ-R), 
rh O-R) tz 


Pp ; 
Vai xP 2), @ rP 
This system relates to number theory studied by Ramanujan. As shown in Hill et al. 
(2007), this is equivalent to solving the following system after applying an 
appropriate change of variables: 


=S(PR QO’), (3.241) 


(u? ae = Ly —2uv+l) (3.242) 
du 6 


dw_ w(w-l) 2 

du 3 vu?-1) u*-l 

Once v is obtained by solving (3.242), then w can be determined by solving (3.243). 
Our focus is to solve the Riccati equation given in (3.242). 


(3.243) 


Solution: To solve (3.242), we apply the change of variables proposed by Bernoulli 
in (3.213): 


6(u? —1) dX 
=- —— 3.244 
v(u) 7 ue (3.244) 
Differentiation of (3.244) gives 
2 2 2 
dv 12u dX | 6(u 1) es 6(u~ —1) d a (3.245) 
du X du xX du x du 
Substitution of (3.245) into (3.242) gives 
d°X Tu dX xX 
u? —1) +—(u’-1 +—=0 3.246 
( ) duu 3 ( ) du 36 ( ) 
We now apply another round of change of variables 
u 
7 wy pF (3.247) 
Using this change of variables, we obtain 
OP . : P a (3.248) 
du (u“ —l)"~ dw 
dx 3 dX 1 d&’x 
2 = 2 _ 5/2 + 2 3 2 (3.249) 
du (u-ly dw (uw -ly dw 
Substitution of (3.248) and (3.249) into (3.246) arrives at 
d’X 2 dX X 
(w* -1) = (3.250) 


+=w—+t 
dw 3 dw 36 
Finally, we impose the following shift of variables: 
y=(w+l)/2 (3.251) 
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In terms of this new variable, we have 
da = eee 21 dX X _ 
3 3 dy 36 


This equation is a special — of hypergeometric equation, which was first 
discovered by Gauss as: 


(3.252) 


va Na 


2 


=x) 2" + = (abs = aby =0 (3.253) 
dx dx 


The solution of (3.253) can be written as 
y = AF(a,b;c;x) + Bx'°F(a—c+1,b—c+1;2—c¢;x) (3.254) 
where F' is the hypergeometric series defined as: 
y= a: Ps a(a+lb as 
1. a 1-2-c(c +l) 
Detailed discussion of this hypergeometric series will be given in Chapter 4. 
Comparison of (3.252) and (3. — gives the solution as: 


(3.255) 


a= AF(-L, mare —3v)+ By mae ere (3.256) 


Back substitution into the socal unknown and Bee gives the final solution 
as: 


3 u 1 
= ; 3.257 
v (u =i "Ow == a >) ( ) 

where 
a te Sse d. 1S 
Ae spr a 3 +507 a Gast 

yWY= a = (3.258) 

F(=S;= ° ee ee a ae 

Ce 63 Tey oe 23) 


where the superimposed prime means differentiation with respect to y. Therefore, a 
certain form of Riccati equation can be expressed in terms of hypergeometric 
series. Another particular form of Riccati equation that can be solved in terms of 
hypergeometric series is given in Problem 4.26 of Chapter 4. 


3.2.9 Jacobi Method 


Jacobi considered the following first order ODE: 

(A+ A'x+ A"y)(xdy — ydx) —(B+ B'x+ B"y)dy+(C+Cx+C"y)dx =0 = (3.259) 
More specifically, Jacobi showed that (3.259) can be converted to the Riccati 
equation. Allow the following change of variables 


x=E+a, y=n+P (3.260) 
Substitution of these into the differential equation given in (3.259) gives 
[A+ A(E +a) +A" + AE +a)dn—-(9 + B)de] 


{B+ B(E+a)+B"(n+ Blan +[C+CE+a)+cr(q+pye=0 © 792 
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This ODE can be reduced to a form 
(ag + a'n)(Sdn —ndé) — (bg +b'n)dn + (c& +c'n)dgé =0 (3.262) 
provided that 


alA+ A'a+ A" B|-[B+ B'a+ B"B\=0 (3.263) 
—B[A+ A'at A"B]+[C+C'a+C"B]=0 (3.264) 

a=A, a=A" (3.265) 
b=B'-A-2A'a-BA", b'=B"-aA" (3.266) 
c=C'-A'B, c'=C"-A-A'a-2fA" (3.267) 


Equations (3.263) and (3.264) require 

C+Ca+C"f B+B'at+B'p 
p a 

In a sense, A is like a characteristics value and depends on the given differential 

equation. These can be rearranged as 


A-A+A'aA+A'B=0 

B+(B'-A)at+B"B=0 (3.269) 

C+Cla+(C"-A)B=0 
Elimination of both a and B gives 

(A-A)(B'-A)(C"-A)- B"C'(A-A)- A"C(B'- 2) 
— A'B(C"— A) + A'B"C+ A"BC' =0 

This is a third order equation for 4 and a closed form root is in general not 
possible. The value of A needs to be evaluated numerically. Then, a and B can be 


determined accordingly by back substitution of 4 into (3.269). Obviously, (3.262) 
can be simplified by using the following change of variables 


A+Aat+A" B= 


A (3.268) 


(3.270) 


Lae (3.271) 
Pe al —_ , dn=vdé+ dv (3.272) 

Substitution of (3.271) and ees into (3.262) gives 
[(c+c'v)—v(b roy —~(b+b'v)E+(ata'v)é =0 (3.273) 


This is the Riccati equation and its solution technique has been discussed in the 
previous section. Therefore, ODE of the form given in (3.259) can be converted to 
the Riccati equation as demonstrated by Jacobi. 


3.2.10 Integration by Differentiation 


In this section, we will introduce a concept that sounds contradictory at first sight. 
Actually, there are certain types of differential equations that can be solved by 
differentiation. Yes, we carry integration by differentiation. Consider a general 
form of first order ODE 
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d 
y=FP), P= (3.274) 
This type of ODE can be solved by differentiation. Differentiation of (3.274) gives 
dy _ OF (x,p) , OF (x, p) dp (3.275) 
dx Ox Op ax 
Substitution of (3.275) into (3.274) leads to 
d 
p= (x, p,) (3.276) 
dx 
If we can integrate (3.276) once, we can get 
p=f(x)+e (3.277) 
Back substitution of (3.277) into the original ODE, we get symbolically 
G,(x, y,c) =0 (3.278) 


Of course, in reality the workability of this technique depends on whether we can 
integrate (3.276). 

Another form of first order ODE that can be solved by integration by 
differentiation is 


dy 
x=F(y, p), as (3.279) 
Differentiation of (3.279) with respect to y gives 
f= FuOsP)+ FOP (3.280) 
This equation can be rewritten as 
+= fo,n,2 (3.281) 
P dy 
In general, we can integrate (3.281) to obtain 
p=e(y)te (3.282) 
Back substitution of (3.282) into the original ODE, we obtain the solution as 
G(x, y,c) =0 (3.283) 


A special form of the above forms of first order ODE is that both x and y 
appear linearly in the functional form of 


xP(P)+ VW(P)= X(P) (3.284) 
Rewriting (3.284), we have 
pax AP), LP) _ xo (p) +50) (3.285) 
v(P) WP) 
Alternatively, we can also rewrite (3.285) as 
VP), XP) 
= = 3.286 
y Hp) oo) VP_(P)+So(P) (3.286) 


Differentiation of (3.285) gives 


P=alp)t+xol(D) +o} “ 3.287) 
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Rearrangement of (3.287) leads to 
dx ' r 
Pale) Alp) =¢1(p) (3.288) 


Dividing through by the bracket term gives 
de Ap) _ i) 


dp p-P(P) p-P(P) 
This is the most general linear first order differential equation that we have 
considered earlier in Section 3.2.6. Thus, the solution can be solved in terms of 
integration and expressed symbolically as: 


(3.289) 


x=g(p) (3.290) 
Suppose that we can invert this equation to give: 
P= &(%) (3.291) 


Substitution of this into the original differential equation (3.284) gives the final 
solution. 


Example 3.14 Find the general solution of the following first order ODE 


x+ yp =ap* (3.292) 
Solution: Method 1: Differentiating this equation with respect to x gives 
dp dp 
1+ p?+y— =2ap— (3.293 
BE Oe ae ) 
However, the original ODE can be used to find: 
2 — 
yo (3.294) 
P 
Substitution of (3.294) into (3.293) gives 
2 —y 
4 pe ap (3.295) 
dx dx 
Inverting this equation, we obtain a general linear ODE for unknown x: 
is - oP (3.296) 


dp p(ilt+p) l+p 
Integration of this equation gives 


x= _[etainfp+ J+ p?}] (3.297) 


yl+p? 


The original differential equation can also be used to give the value of p: 


+ Jy? +4 
paw (3.298) 
a 


The final solution is obtained by substituting (3.298) into (3.292). 
Method 2: Differentiating this equation with respect to y gives 


aS a yr (3.299) 
dy dy dy 
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This is a general linear ODE for unknown y with variable p. The solution of this is 
y= Frletw 1+ p? —aln{p+ 1+ p°}] (3.300) 
l+p 


Again, the final solution is obtained by substituting (3.298) into (3.300). 


3.2.11 Clairaut Equation 


The Clairaut equation is a special case of (3.284) that can be solved using 
integration by differentiation. In particular, the Clairaut equation is recovered as a 
special case if we set 

V(pP)=1, Op)=—-p, x(P)=f(P) (3.301) 


That is, the Clairaut equation is 


y= ape (3.302) 
Differentiation of (3.302) with respect Ne x ie 
dy _dy | dy, d 
3.303 
a ry vi ae (3.303) 
This can be factorized as 
dy.,d°y _ 
x+ 3.304 
Cae AGe wy (3.304) 
Therefore, this solution can be found by setting either factor in (3.304) to zero: 
d°y dy 
—=0, or [x+f(—)]=0 3.305 
= +O (3.305) 
Taking the first equation in (3.305), we have 
dy 
ce 3.306 
dx ( ) 


However, we should not integrate this again to get y because this will lead to 
another unknown constant. Instead, we should substitute (3.306) into (3.302) 
directly to get 

y(x) = Cx+ f(C) (3.307) 
As expected, we only have one unknown constant C. The solution of the second of 
(3.305) does not lead to a solution of the original differential equation, but instead 
yields a so-called singular solution, which actually gives the envelope of the family 
of parametric solutions given in (3.307): 


x=- ry (3.308) 


More discussion on this singular solution is next. 
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3.2.12 Singular Solution 


Recall from the last section that there are two solutions for the Clairaut equation 
because of differentiation. However, one of them is called a singular solution and is 
not the actual solution of the Clairaut equation, but instead it gives the envelope of 
the actual solutions with a different constant C. The singular solution had been 
studied extensively and theoretically by many mathematicians, including Leibniz in 
1694, Taylor in 1715, Clairaut in 1734, Euler in 1756, Laplace, De Morgan, 
Lagrange, and Cauchy in 1772. It is known that exact differential equation does not 
admit a singular solution. The singular solution only appears in nonlinear first order 
ODEs, just like the Clairaut equation discussed in the last section. We will briefly 
summarize its determination here. 

Not all first order ODEs will have a singular solution. If a first order ODE is 
given as 


o(x, y, p) =9, (3.309) 


there is no singular solution if the equation is linear in p. If the solution of the 
differential equation is 


f(x, y,c) =0, (3.310) 
the partial derivative of this with respect to x is 

F FY _4 (3.311) 

ox Ov Ox 


A technique is called c-discriminant (Forsyth, 1956) if it assumes that c is a 
function of x such that: 


OF OEY Ol Ge (3.312) 

Ox Oyox Ocdx 
To convert back to the (3.311), we need to set 
of de _ 4 


3.313 
Oc dx ( ) 
This requires 
ff =0 (3.314) 
Oc 


This and the solution (3.310) together provide a system to determine the singular 
solution by eliminating c from: 


f(%y,c)=0, —=0 (3.315) 
oc 


However, this topic does not have obvious applications in science and engineering, 
becomes obsolete, and is not covered in most textbooks in engineering 
mathematics. Our coverage of it will end here. 
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3.2.13 Lagrange Equation 


A slight extension of the Clairaut equation is called the Lagrange equation. It can 
be expressed in the following general form (Zwillinger, 1997; Piaggio, 1920; 
Forsyth, 1956) 
d 
y=xf(p)+ dp), p= (3.316) 


Note that if f(y) = p, the Clairaut equation is recovered as a special case. 
Differentiation of the Lagrange equation yields 


p=f()+bf'@)+ eon? (3.317) 


This remains a nonlinear first order ODE. However, if we reverse the roles of x and 
p and we arrive at a linear ODE for x with p being the variable: 


dx. fp) _ 9) (3.318) 


dp f(p)-p p-f(p) 
The solution of this can be found exactly. Let the solution be expressed 
symbolically in the following form: 
F(x, p,c) =0 (3.319) 
We can solve for p and substitute this solution of p into (3.316) to obtain the final 
solution. 


3.2.14 Factorization of Nonlinear Form 
Let us consider a nonlinear first order ODE in the form 
dy n dy n-1 dy n-2 
—)" + P(— +P, (— +..+P =0 3,320 
Ce iC 2) n ( ) 


where P; (k = 1,2,...,2) are, in general, functions of x and y. Suppose that (3.320) 
can be factorized as: 


dy dy dy 
a acl Sy CSL -0 3,321 
where px (k = 1,2,...,2) are functions of x and y. The solution can be considered as 
dy dy dy 
= =0, =0,:*+, | kn =0 3.322 
C. Pi) CC P2) a Prd ( ) 


Let the corresponding solutions of these first order ODEs be: 

u(x, y)—c, =9, Un(x,y)—Cc, =90,*-, u(x, y)—¢, =9 (3.323) 
The solution of (3.320) can be formed as a product of any two or more of these 
solutions, for example 


[uy (x, ¥) — €] [ua (x,y) —€]--[u, (%, y) —c] = 0 (3.324) 
A number of examples are used to illustrate this technique. 


Example 3.15 Find the general solution of the following first order nonlinear ODE 
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(2 ay? =0 (3.325) 
dx 
Solution: Factorization of (3.325) gives 
dy dy 
( ik ay)( ao ay)=0 (3.326) 
Setting both of these brackets to zero results in 2 equations: 
Inv—ax—c,=0, Iny+ax—c, =0 (3.327) 
The solution is then given as 
(In y—ax—c)(In y+ax—c)=0 (3.328) 
or 
(vy -—ce™ \(y—-ce™) =0 (3.329) 


Note again that we have set the constants equal in both equations of (3.327). 
Readers are advised to check the validity of this solution. 


To visualize the power of factorization, let us consider the following nonlinear 
ODE of the first order with non-constant coefficients: 


(Dy? 372 42x? =0 (3.330) 
dx dx 
It is straightforward to show that it is equivalent to 
dy dy 
2 —x)=0 3.331 
Co rF x) (3.331) 


By setting the first factor to zero, we have 
(22x) =0 (3.332) 
dx 
This is a separable first order ODE and can be readily integrated to give 


y=x'+q, (3.333) 


Similarly, setting the second factor to zero results in the following solution 
2 


y= =e Cy (3.334) 
The final solution is 
2 
(y-x? -e)\(y 5 c)=0 (3.335) 


Since the ODE 1s of first order, we only need one unknown constant c. 


Example 3.16 Find the general solution of the following nonlinear first order ODE 
(2 =4x" (3.336) 
dx 


Solution: Factorization gives 


@ 2x)( a +2x)=0 (3.337) 
dx dx 
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The solutions for the first and second factors are respectively 


yHext+q, y=-x +c, (3.338) 

The final solution is then 
(y-x -cXy+x -c) =0 (3.339) 

Or equivalently, we have 
(y—-c) -x* =0 (3.340) 


3.2.15 Solution by Taylor Series Expansion 


If a boundary condition is given, we can find an approximation of the solution by 
Taylor series expansion. First, we can rewrite a first order ODE as 


as 
Boag AAD) (3.341) 


Taking differentiation with respect to x, we find 
fy Hd _ HM 
dx? Ox Oy dx ae 
dy _ td _ . 
ae Be dy dx ex dy"! 
d"y = Fn + FA dy = Sn ote 
dx" Ox Oy dx Ox 
Using Taylor series expansion about a point xo, we a 


y= fy(%¥) (3.342) 


(xy) (3.343) 


7 fr Qy) (3.344) 


2 
xX—Xp) 

Y= Yo Si (os VoNO=A0)+ FC0¥0) + (3.345) 

Suppose that the boundary condition is given as 
V(X =0)=V_ =e (3.346) 

An approximation of the solution is 
2 

yx) =c+ fi,c)x+ F000) (3.347) 


When the number of terms goes to infinity, the solution approaches an exact 
solution. In reality, depending on the explicit form of fi, the process of 
differentiation may lead to very tedious calculation. Nevertheless, this approach can 
give a fast first order approximation of the result. 


3.3. SECOND ORDER ODE 


In this section, we consider second order ODEs. The most general case can be 
expressed as 
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d°y dy 
=a t, are 3.348 
dt ae dt ete 
If we assume the second order ODE is linear, we can express it as 
y"+pOytdOyv=s (3.349) 


Because of the nonhomogeneous term g, we must write the solution as a sum of two 
solutions. That is, the homogeneous solution plus the particular solution: 


V1) =y,.O+Y,) (3.350) 
The proof of this is deferred to Section 3.3.2. The procedure of finding the 
particular solution will be discussed in Sections 3.3.3 and 3.3.4. 
For the case of a homogeneous differential equation, suppose that we can find 
the homogeneous solution as: 
y=Cyy, (x) + Cyy(x) (3.351) 
Whether a boundary value problem can be solved, we have to study the so-called 
Wronskian. In particular, let us assume that the boundary conditions are given as: 
VE) = Yo» Vy) = Y (3.352) 
Substitution of (3.351) into (3.352) gives 
CiV1 (to) + Cr V2 (to) = Yo 
Cy (to) + Cr. ¥3 (to) = Y 
This system provides two equations for two unknowns, and using Cramer’s rule we 
can solve for the unknown constants: 


(3.353) 


Yo ¥2(to) Vio) Vo 
* t t ’ t * 
&= y*  yy(to) 7 yilo) ¥ (3.354) 
W W 
where W is called the Wronskian and is defined as: 
M(t) Y2(to) ; : 
= ' : ° “|= V(t) V2 (to) — VL) ¥2 Uo) (3.355) 
Vit) Vo(to) 


For the system to be solvable, we must require that the Wronskian is non-zero. That 
is, 


W(y15¥2)(to) #0 (3.356) 


3.3.1 ODE with Constant Coefficients 


Let us consider the simplest case of constant coefficients without a 
nonhomogeneous term. That is, p and q in (3.349) are both constants: 


y"+ py'+qv=0 (3.357) 
For constant coefficients, it can be proved that the solution must be in the form of 
an exponential function as: 


y=e™ (3.358) 
Substitution of (3.358) into (3.357) gives the following equation: 
r+ pr+q=0. (3.359) 


Ordinary Differential Equations 149 


Since our assumed solution should not be zero (if it is zero, the solution becomes a 
trivial solution), (3.359) must be zero. This is also the characteristic equation of the 
ODE given in (3.357). There are two roots of this quadratic equation: 


—ptp’-4 
eee a (3.360) 


r 
1,2 
2 


These characteristic roots can be real, repeated, or complex, depending on the 
values of A. The three scenarios are: 


Case 1:4>0 


For this case, we have two distinct real roots: 


| ap typ’-4q = p-v P= 44 G.361) 


oe 2 
The corresponding independent solutions are: 
y=e™, y=e (3.362) 
The general solutions are then linear combinations of these solutions 
Vn = Ce’ +Cye2* (3.363) 


The unknown constants need to satisfy the corresponding boundary condition (but 
this will be done after we obtain the particular solution first). 


Case 2: A=0 


In this case, since A = 0 we must have two roots that are the same. Thus, the 
repeated roots are 

=m = = (3.364) 
The first solution is obvious, while the second independent solution cannot be the 
same as the first one. To find the second one, we use Euler’s approach of variation 
of parameter (i.e., replacing the constant by a function of x). Or equivalently, we 
can use the well-known theorem (Forsyth, 1956) that the second independent 
solution must be in the form of an unknown function multiplying the first known 
solution. In either case, we have two independent solutions as: 


y=e™, yy =u(xye™ (3.365) 
To find the function u, we note that the differentiations of y2 are: 
yy =(u7,tu'je™ (3.366) 
ys =(u n +27,u'+u")e™ (3.367) 
Substitution of these results into (3.357) gives 
u" + (27, + p)u'+(72 + pr, + qu =u" =0 (3.368) 


Both bracket terms are zeros, and we end up with the last term in (3.368). Thus, we 
have 
u=Cx (3.369) 


With the second independent solution, we can now have two independent solutions as: 


150 Theory of Differential Equations in Engineering and Mechanics 


Vy =(C, + Cyx)e™ (3.370) 
Although the present analysis is for second order only, a similar result is also 
obtained for the case of a higher order ODE with roots of higher multiplicity. 


Case 3: A<0 


Finally, for A < 0 the two roots are complex conjugate pairs. Thus, the 
characteristic roots are: 
= pia 
2 


Thus, expressing the real and imaginary parts of it as a and B we get 


, A=p*-4q (3.371) 


42 


4 =atifP,, m=a-if, (3.372) 
The two independent solutions are 
=", ae (3.373) 


However, (3.357) is real but the solutions given in (3.373) are complex. We can 
take the real part and the imaginary part as the solutions: 


1 . I : x 
¥3= 5 +y2)=cos Bxe™, yy = 5) — yy) =sin Bxe® (3.374) 


Finally, the homogeneous solutions are 
y, =e**(C, cos Bx + C, sin Bx) (3.375) 
We will illustrate the solution in the following examples. 


Example 3.17 Find the general solution of the following second order ODE 
y'—2y'-3y =0 (3.376) 


Solution: For an ODE with constant coefficients, we can assume the solution as an 
exponential function as: 


y=e™ (3.377) 
Substitution of (3.377) into (3.376) gives the following characteristic equation 
r’ —2r—3=0 (3.378) 
Two real distinct roots are found, and we have Case 1. The two distinct roots are 
,H=-L m=3 (3.379) 
Finally, the general solution is 
y=Ce*+Ce™ (3.380) 


Example 3.18 Find the general solution of the following second order ODE with 
given initial condition 
d’s_ ds 
— +2—+s=0 (3.381) 
dt dt 


fly = 4.8" (3.382) 
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Solution: Again, an exponential solution is expected: 


s=e" (3.383) 
The corresponding characteristic equation is 
r?+2r+1=0 (3.384) 
It is straightforward to see that the roots are equal: 
4H =r =, (3.385) 
Thus, the corresponding solution is 
s=(C,+C,pe* (3.386) 
Differentiating this with respect to ¢, we obtain 
s'=C,e'-(C,+ Cote" (3.387) 
Applying the two initial conditions, we get two equations for two unknowns: 
s|,9 =4=C, s'|,9 =-2=C,-C, (3.388) 
The unknown constants are solved as 
G=H=4.6 =2 (3.389) 
Finally, the solution is 
s=(44+2t)e" (3.390) 


Example 3.19 Find the general solution of the following second order ODE 
y"-2y'+5y=0 (3.391) 


Solution: Finally, this example illustrates the case of a complex conjugate pair of 
roots. Assuming exponential function 


y=e", (3.392) 
we get the following characteristic equation 
r’ —2r+5=0, (3.393) 
The corresponding pair of complex conjugate roots is 
Ky =142i, (3.394) 
The general solution becomes 
y =e'(C, cos2x+C, sin 2x) (3.395) 


3.3.2 Nonhomogeneous ODE 


It was discovered by Lagrange that the general solution of a nonhomogeneous ODE 
consists of two parts, namely the homogeneous solution (solution of the differential 
equation without the nonhomogeneous term) plus a particular solution of the 
nonhomogeneous ODE. Mathematically speaking, for a linear second ODE, 


y"+Px)y'+ O@)y = f(x) (3.396) 
the general solution is 
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V(X) = V4 (%)+ Vp (X) (3.397) 
where y; and y, are the homogeneous solutions that satisfy (3.396) with f(x) = 0 
and the particular solution satisfies (3.396) with nonzero f(x). To show this, we first 
substitute (3.397) into the left hand side of (3.396) as 


y" + P(x)y'+ Ox) y =(4" +¥p" + POO Oy + ¥p +O), +p) 


=[y," +P(x)y, + OQ) yy +L yy" + PC) Vy + O(X) ¥p] B.398) 
= f(x)+0= f(x) 


The final result is obtained by observing that the term containing y, equals f(x) and 
the term containing y, equals zero. 

Therefore, it is of utmost importance that classification in terms of 
homogeneous or nonhomogeneous is made properly otherwise, the solution is 
automatically incorrect. 

In the next two sections, we will discuss the method of undetermined 
coefficient and the method of variation of parameters in obtaining the particular 
solution. 


3.3.3 Undetermined Coefficient 


For the following discussions, we will restrict to the cases of constant coefficient, 
that is 


y"+ py’ +qy= f(x) (3.399) 
where p and q are constants. Consider the case that the nonhomogeneous term is a 
product of polynomials and an exponential function 


f(x) =e"P, (x) (3.400) 


where P,, is defined as 
m 


T+ +a, (3.401) 


Since the exponential function cannot be altered or killed by differentiation, we 
must assume that the particular solution must also be proportional to the same 
exponential function. Thus, we assume the particular solution as: 


P(X) =x" + a,x 


y=O(x)e™ (3.402) 
Differentiation of this assumed solution form gives 
y' =Q'e* + 1Qe* (3.403) 
y" =Q"e* +210'e™ +A°Qe™ (3.404) 
Substitution of (3.403) and (3.404) into (3.399) leads to 
O"(x)+ (24+ p)O'(x) + (A? + pA +q)O(x) = P,,(x) (3.405) 


If we look closely, we discover that the bracket terms in (3.405) are the 
characteristic equation of the homogeneous ODE. Thus, it becomes very important 
for us to check whether the exponential power A of the nonhomogeneous term is 
the same as the characteristic roots of the homogeneous solution given in (3.359). 


Case 1: 4 is not a characteristic root 


Ordinary Differential Equations 153 


For this case, none of the bracket terms in (3.405) vanishes or: 
V2 +paAt+q#0, 2A+p#0 (3.406) 


Thus, the highest power term of Q on the right hand side of (3.405) must be the 
same as polynomials P77: 


O(x) = 2, (x) (3.407) 
More specifically, we expect the particular solution is 
¥,=0,(0e" (3.408) 
where Or is 
O(a) shy” 4b bs (3.409) 
Substitution of (3.409) into (3.399) gives 
[m(m—1)byx""* +...+ 2b, 9 ]+ (24+ p)[mbyx” | +...+B, 4] Anis 


m-| 


+...+5,,]= ayn + ax +...4+4, 


m-| 


+(A? + pat q)[byx” +x 
Matching coefficients for different powers of x, we find m equations in ascending 
order as: 
(A? + pA +q)by = 4% 


2 _ 
(4 + p)ymby +(A° + pAtqyb, =a (3.411) 


2Bn-2 + (2A + Pon + (A? + pa + Wom = any, 
Once this system of equations is solved, we obtain the particular solution. 
Therefore, Case | is purely a matter of matching terms on both sides. 


Case 2: 4 is a simple characteristic root 


If the exponential power 2 matches the characteristic roots of the homogeneous 
differential equation (3.360), we have 
A? + pAt+q=0, 2A+p#0 (3.412) 
Thus, (3.405) is reduced to 
O"(x) + (24+ p)O'(x) = F(x) (3.413) 
The highest order term of the power series on the left is Q’ (i.e., the first derivative 
of Q) which must have power of order m in order to match the highest order term 
on the right hand side. This suggests that we should choose Q as 
O(x) = xO,,(x) (3.414) 
Consequently, for Case 2 (simple root matching) we have the assumed particular 
solution in the following form: 
Vp =XOn (x)e* (3.415) 


Case 3: d is a double characteristic root 


Finally, for Case 3, power 4 matches with the double or repeated characteristic 
roots of the homogeneous equation, and we have 
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A? +pAt+q=0, 2A+p=0 (3.416) 
Thus, (3.405) is simplified to 
O"(x) = Pa (x) (3.417) 
Similar to the earlier discussion given in Case 2, we have to raise the power of the 
polynomials by two in order to match the highest power on both sides of (3.413): 


OG) =2' 0.6) (3.418) 
Therefore, we expect the particular solution to be expressed as: 

i,=* Oe" (3.419) 
In summary, we can combine all three different scenarios into a simple formula: 

y, =x*e"Q,, (x) (3.420) 


where & equals zero, one, or two for the case of no match of the root (zero 
matching), match a simple root (one matching), or match the double root (two 
matching). Let us illustrate again by example. 


Example 3.20 Find the particular solution for the special case of a polynomial Pi, = 
A 


y" + py’ +qy = Ae™ (3.421) 
Solution: The nonhomogeneous term is simply an exponential constant and thus the 
polynomial is of order zero: 


O"(x) + (24+ p)O"(x)+(A2 + pa+ Ox) =P,(x)=4 3.422) 
Case 1: i is a not characteristic root 


For this simple case, we have a constant term for our polynomials O,, 


O=c, (3.423) 
Obviously, we have 
O"(x) = O'(x) =0 (3.424) 
Balancing terms on both sides, we have 
(A? + pA+q)Q=(A? + pA+q)ey =A (3.425) 
Then, we have the unknown co as: 
a (3.426) 
(A* + pA+q) 
Finally, the particular solution is 
i= rere (3.427) 


Since A is given in (3.421), there is no unknown constant in (3.427). 
Case 2: A is a simple characteristic root 


For this case, we have 
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A? +pAt+q=0, 2A+p#0 (3.428) 
Thus, for k = 1 we have to raise the order by one: 
O=cx (3.429) 
Differentiation gives 
O"(x)=0, O(x)=G (3.430) 
Finally, we can match all terms on both sides to get 
O"(x)+(22+ p)O"(x) =(24+ p)c, =A (3.431) 
This gives the constant as: 
eae (3.432) 
2A + p 
The final particular solution becomes: 
v= = : xe** (3.433) 


Case 3: X is a double characteristic root 


Finally, if A matches the double characteristic root of the homogeneous ODE, we 
have 


V+pa+q=0, 2A+p=0 (3.434) 
For k = 2, we have to raise the order by two: 
O=cpx* (3.435) 
The ODE (3.422) reduces to 
Q"=2c,=A (3.436) 
The unknown constant is 
a=4 (3.437) 


The final particular solution becomes: 


A x 
= sre (3.438) 
In summary, we have the following solution depending on A: 
A : . a 
7. e A is not characteristic root 
Ao + pat+q 
A to fy sos 
yy, =3. <= xe™ Aisa simple characteristic root (3.439) 
p 2A +p 
A : . oe 
- oat i Ais a double characteristic root 


Example 3.21 Find the general solution for the following nonhomogeneous second 
order ODE 
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y"-2y'-3y =3x4+1 (3.440) 
Solution: Note that the nonhomogeneous term is only a first order power series. We 
have to solve for the characteristic root of the corresponding ODE (i.e., setting the 
right hand side zero): 
y"-2y'-3y =0 (3.441) 
Assuming the exponential solution form leads to the following characteristic 
equation: 


r’ —2r—3=0 (3.442) 
Two distinct real roots are obtained: 
R=), 1=3 (3.443) 
This is Case 1, and we can assume the particular solution as: 
Vp = Ox +5, (3.444) 
Substitution of (3.444) into (3.441) yields 
3h) x — 2b —3b, = 3x41 (3.445) 
Comparing terms on both sides, we obtain 
1 
The particular solution is 
1 
Vp hase (3.447) 


The general solution is then the sum of homogeneous solution and the particular 
solution: 


1 
y=Ce* +Cye* —x+ = (3.448) 


Example 3.22 Find the general solution of the following second order 
nonhomogeneous ODE 


y" —3y' +2y = xe™* (3.449) 
Solution: Let us consider the homogeneous equation of (3.449): 
y"-3y'+2y =0 (3.450) 


Since the coefficients of this differential equation are constant, the solution is of 
exponential form. Substitution of a solution form of e” leads to 


r? —3r+2=0 (3.451) 
The roots of this characteristic equation are 
nr=l,n=2 (3.452) 
Thus, the general solution of the homogeneous equation is 
y=Ce*+Cye* (3.453) 


The nonhomogeneous term of (3.449) is a first order polynomial and thus, a 
polynomial of the same order as 
QO, (x) = Ax+B (3.454) 


However, we have A = r2 thus we have to assume 
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y, =x(Ax+ B)e* (3.455) 
Substitution of this into (3.449) gives 

2Ax+B+2A=x (3.456) 
Matching coefficients on both sides gives 

Aas, B=-1 (3.457) 


Finally, the general solution is obtained 


y=Ce* +Cye* 4 XG s=le" (3.458) 


Let us now consider that the nonhomogeneous term is a product of polynomials, an 
exponential function, and a circular function: 


f(x) =e" [P cosax+P, sinox] (3.459) 
7 


One simple way to deal with the cosine and sine terms is to replace them by 

exponential functions using Euler’s formula discussed in Chapter 1: 

i@x —i@x i@x _ ,—i@x 
fx)=e* [A 5 — + 2, —E— (3.460) 
i 
Grouping similar terms we can rewrite (3.460) as 

F E (A+ia)x PB 
x)=(—+—e “4 (4 
f(x) G a ( 5 


= Fin yo(A-iex 
2i (3.461) 
_ P(xje** + P(x)e*io* 

Allowing for complex constants for the polynomials, we get a particular solution 
for the first term on the right of (3.461) 

y" + py’ +qy = P(x)jeAi* | yi =x'Q eto (3.462) 
Similarly, for the second term on the right hand side of (3.461) we have 

x + py’ +qy= P(xje4 i , ce = PO. (3.463) 
Finally, we can combine these particular solutions to get the particular solution for 
(3.459) 


k Ax i@x Fy) ~!@x 
=xe Co =k e 
gs Len me (3.464) 


= x*e [RO (x)cosaxt RO (x)sin ox], 


Example 3.23 Find the general solution for the following nonhomogeneous ODE 
y"+y =4sinx (3.465) 


Solution: Solving for the homogeneous case of (3.465) we obtain the following 
characteristic equation and its corresponding roots: 
A =-l, A=Hi (3.466) 
The homogeneous solution is 
y=C,cosx+C, sinx (3.467) 
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Using the approach just discussed, we can first rewrite (3.465) as the imaginary 
part of the following ODE: 


y"ty=4e" (3.468) 
Thus, the nonhomogeneous term matches the characteristic roots (i.e., +/). We have 
to raise the order of power by one: 


y,= Axe“ (3.469) 
Differentiation of (3.469) gives 
Vp= Ae™ + Axie™ (3.470) 
Vp = 2iAe* — Axe™ (3.471) 
Substitution of these results into (3.468) results in 
2Ai =4 (3.472) 
This can be solved to give 
A=-2i (3.473) 
Finally, our particular solution for (3.468) is 
Vp= —2ixe™ = 2xsin x —(2xcosx)i (3.474) 


Comparing (4.465) and (4.468), we find that taking the imaginary part of the right 
hand side of (3.468) gives the right hand side of (4.465). Thus, to get the particular 
solution of (4.465) we also take the imaginary part of (3.474): 

Vy =—2x COS x (3.475) 


Adding the particular solution to the homogeneous solution gives the final solution 
as 
y=C,cosx+C, sinx-—2xcos x (3.476) 


The undetermined coefficient approach discussed here can be considered as a lucky 
guess method. For any nonhomogeneous terms other than those considered in this 
section, we cannot assume the proper form of the particular solution. In the next 
section, we will discuss a much more powerful technique—the method of variation 
of parameters. 


3.3.4 Variation of Parameters 


For first order ODEs, we have already learned the method of variation of 
parameters by Euler. The same technique was used by Lagrange to derive the 
particular solution of a second order ODE. In particular, consider the following 
second order ODE with non-constant coefficients: 


y"+ pOy'+qOy =a) (3.477) 
Assume that we can solve for the homogeneous equation of (3.477) 
V=EQyO+ QO (3.478) 


Using the idea of variation of parameters, Lagrange assumed that the particular 
solution for (3.478) could be expressed as: 
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V(t) = (CY (1) + Uz (1) 2) (3.479) 
Differentiation of (3.479) gives 
VO=UONO+y OO +u, Oy) tu (Oy0 (3.480) 


Lagrange observed that there are two unknown functions that need to be found in 
(3.479). That means we need two conditions for two unknowns. He then decided to 
choose the first condition before he continued to get the second derivative. In 
particular, we set the following condition: 


uy (t) y(t) +3 (t)y2(t) =0 (3.481) 
In doing so, we have reduced (3.480) to 
VO=4yOvyO+u(OvVO (3.482) 
Differentiation of (3.482) one more time leads to 
y"O=UONO+mONO + Oy 0 + OvO (3.483) 


Substitution of (3.482) and (3.483) into (3.481) gives 
Uy, Huy YF US V, tug yy + DULY +UZV)+qUY +UyV2)=8 (3.484) 
Grouping terms, we get 
mL y+ py, + gyi |+ugl yy + pys +92] + uy +uyV2 
= Uy + UV) = 8 
Note that the square bracket terms in (3.485) are zeros because both y; and y2 are 


the solutions of a homogeneous equation. Thus, we have two equations for two 
unknowns: 


(3.485) 


UY, +UZ¥,=8, Uy +Uby, =0 (3.486) 
Using Cramer’s rule, we find the solutions as: 
t)eg(t t)e(t 
ui(j=- 2080 p(y = O80 (3.487) 


W (v.92)? W (31,92) (0) 
where W is the Wronskian. Integrating both with respect to ¢, we find 
yr(g(t) WAG) 


u,(t) dt, u,(t) = | —-— (3.488) 
Www) 9 9 IW) 
Back substitution of these results into (3.479) gives 
yas) VOMsO 
y,(t)=—-y,(t) = dt +y,(t) | —-— (3.489) 
J IW (y%2)O 7? IW (Hp r2)O 
Finally adding this to the homogeneous solution, we obtain 
V=QyX)+Q,y2(—)+y,O (3.490) 


Note that this formula obtained by Lagrange is much more general than the 
undetermined coefficient method discussed in the last section because it is 
applicable to ODEs with non-constant coefficients. In addition, it is valid for all 
functions g as long as we can find the integration. However, first of all we must 
have the homogeneous solutions of the ODE (i.e., y; and y2). Even for linear ODEs, 
the homogeneous solution may not be easy to find for the case of non-constant 
coefficients. Secondly, even if a homogeneous solution is available, integration 
given in (3.488) is no easy task. Very likely, a closed form solution is not possible, 
and we may have to use numerical technique to conduct the integration. 
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3.3.5 Operator Factors 


The following method of factorization of differential operators was considered by 
Cayley in 1886 and reported in Ince (1956). Let us consider a second order ODE in 
the following form: 


2 
B 2p) ZX +q(ay =0 (3.491) 
dx dx 
Next, we assume that it can be factorized in the following form: 
d d 
— + a,(x)|[—+ @(x)]y =0 3.492 
ae 2( MS i(x)|y (3.492) 
This equation can be rewritten as 
d° d 


{+(@ +a))—+(qan +a) )}y =0 (3.493) 
dx dx 
Note that the expansion of the differential operators in (3.492) to the form (3.493) 
involves differentiation and thus cannot be treated as simple expansion of algebraic 


factors. If (3.493) and (3.491) are the same, we must have the following identities 


2p=Q,t+@, G=a,+a, (3.494) 
However, we can only say that one of the solutions of (3.491) is 
ce + a(x)]y =0 (3.495) 
dx 
Since it is in a separable form, one of the solutions of (3.491) is 
peel. (3.496) 
We consider a special case that (3.491) can also be factorized as 
d d 
[— +a (x)][— + a, (x)]y =0 (3.497) 
dx dx 


In other words, we are looking for the condition that this factorization is 
commutative. Similarly, (3.497) can be expanded to get 
2 


& +(Q, bay) + (QA, +a, ty =0 (3.498) 
Thus, we must have 
Q =O,0y +0, = O40 +A, (3.499) 
Equality of (3.474) and (3.499) can only be realized by setting 
@, =O, (3.500) 
This implies that the factorized function can only differ by a constant: 
A, =A,+A (3.501) 
Thus, we can factorize the second order ODE as 
[Layo +ay(x)+ Aly =(P+APy=0 (3.502) 


where the differential operator P is defined as 
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d d 
Py = taal = +a (x)y (3.503) 


That is, the factorization is commutative 
d d d d 
[H+ a, + a(x) + Aly = +04) + AS + a,(@)|y =0 6.504) 
dx dx dx dx 
For such cases, the final solution is 


sg. Fg Zen (3.505) 


Let us consider a — case that allows the factorization to be commutative 


dy’ ' 
aa +2p() = ~+[p°(x) + p'(x)—a" ly = 0 (3.506) 
It is straightforward to show that . can be factorized as: 
d d 
[— + p(x)-a]l— + p(x) +aly =0 (3.507) 
dx dx 
The corresponding solution becomes 
jaGe VORP pe oa (3.508) 
Example 3.24 Find the solution of the following ODE by factorization 
dy 52 4 
+ 2x t[x° +2x-l]y=0 3.509 
se ly (3.509) 
Solution: Recall (3. sy that 
dy? ' 
oo +2p() ~ +[p?(x) + p'(x)—a’7|y =0 (3.510) 
Comparison of (3.510) and ion gives 
p@=x, ¢=1 (3.511) 


Thus, the ODE can be factorized as 


d 2 d 2 d 2 d 2 
—+x°-l][— +x? +1]y= x7 +1 l]y =0 3.512 
a Ik x +1]y rma iL |y ( ) 


The two independent solutions can be obtained by solving the following ODEs: 


d 2 d 2 
—+x°-lly, =0, [—+x° +]]y, =9, 3.513 
ore 1, Ua |y2 ( ) 
Finally, the solution is 
2 2 
x x 
y, = C, exp[x(1 ; )]+ C, exp[-x(14 )] (3.514) 


3.3.6 Reduction to First Order 


Second order ODEs can be reduced to first order by using various means. If the 
right hand side is only a function of x: 
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d’y 

<t=f(x); 3.515) 
dx 

we can just integrate it once. This is the simplest type of second order ODE. If the 
unknown y does not appear explicitly on the right hand side as: 


d*y dy 
= f(x,—), 3.516 
ae Bi Te (3.516) 
the following change of variable can be applied: 
p(y- (3.517) 
dx 
Then, the resulting ODE becomes first order and p becomes the new unknown: 
d 
<? = f(%p) (3.518) 
dx 


Clearly, it will not work if y appears explicitly. Another widely encountered situation 
is that the variable does not appear in the ODE as: 

d*y dy 

—= f(y, (3.519 

dx? IO dx ) 
For such cases, we can still use the change of variables given in (3.517) but we 
observe that we can convert differentiation with respect to x to become differentiation 
with respect to y as: 


d’y dp dp dy dp 
> 3.520 
ae ae ar f(¥P) ( ) 
The last equation of (3.520) is an ODE with p being the unknown and y being the 
variable. This type of differential equation is called the autonomous type because its 
coefficients do not change with x. Higher order autonomous ODE will be discussed in 
Section 3.5.14. 


Example 3.25 This example considers the escape velocity of a rocket of mass m 
firing from Earth’s surface 
dh  GMm 


h| 


= R, =v 3.522 
=o dt|t=0 ° ee!) 
where / is the elevation of the rocket measured from the center of the Earth, G is 
the universal gravitational constant, and M is the mass of the Earth. 


Solution: Note that (3.521) is autonomous or the time variable does not appear 
explicitly in the ODE. We observe that 
d*h dv dh. dv 


= 3.523 
2 ad dh vee) 


where v is the velocity. Substitution of (3.523) into (3.521) gives 
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p= (3.524) 
dh he 
We have effectively reduced the equation to first order and clearly, it is separable 
vdv=-SS dh (3.525) 
h 
Integration of both sides gives the following relation between v and h: 
Pe i (3.526) 
2 h 
Substitution of the initial condition given in (3.522) gives 
1, GM 
=—Vvy -— 3.527 
5° @ (3.527) 
With this constant, we can rewrite the solution in (3.526) as 
15 14 1 1 
= +GM 3.528 
s 28 (57 R) ron 


If the rocket is going to escape from the gravitational pull of the Earth, we will 
have a nonzero v even at h > oo: 


iin 252 12 éu2 (3.529) 
h>+0 2 2, R 
This is equivalent of requiring the initial velocity to satisfy the following condition 
2GM 
Vo = ; (3.530) 
At the surface of the earth, the gravitational pull equals the weight of the rocket: 
GMm 
rms (g =9.81m/s”) (3.531) 
Thus, GM can be found in terms of R and h as 
GM = R’g (3.532) 


Finally, substitution of (3.532) into (3.530) gives 


vy = V2Rg =V2x63x10° x9.81 ~ 11.2x10° (m/s) (3.533) 


The escape velocity is about 11.2 km/s, which is a very fast initial velocity and is 
difficult to achieve. In reality, the rocket will gradually accelerate from launch. 


3.4 SECOND ORDER ODE WITH NONCONSTANT COEFFICIENTS 


In general, a second order ODE with non-constant coefficients is not easy to solve. 
We will see that when the Bessel equation is considered in Chapter 4. In this 
section, we will consider the condition that an ODE with non-constant coefficients 
can be transformed into one with constant coefficients. One of the most classical 
examples is the so-called Euler equation. 
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3.4.1 Euler Equation 
The Euler equation can be considered as the simplest type of ODE with non- 


constant coefficients. The coefficient is in a power of x and its order is exactly the 
same as the order of differential terms as: 


x’y"+ pxy'+qy = f(x) (3.534) 
Euler discovered that the following change of variables 
x=e' or Inx=t (3.535) 


can be used to convert it to an ODE with constant coefficients. Differentiation of 
the second equation of (3.535) gives 
dx = xdt (3.536) 
In particular, we can apply the chain rule to get 
dy _dydt_1dy 


(3.537) 
dx dtdx x dt 
Similarly, we can find the second derivative as 
d’y dy 1d id dy 1d 
dx -f0S4 a Co ( DEG ® 
“ ‘ (3.538) 
l1.dy 1 dy 
+ 
@ dt x dt? 
Substitution of (3.537) and (3.538) into (3.534) leads to 
2 
d 
rtp) +a= fle) (3.539) 


This is a second order ODE with a constant coefficient, which has been considered 
in Section 3.3.3. Thus, we know how to solve it. 


3.4.2 Transformation to Constant Coefficient 
In this section, we will consider the condition that an ODE with non-constant can 


be transformed into one with constant coefficients. Let us consider the following 
ODE 


a(x) i+ boy kat c(x)u = f(x) (3.540) 
Consider a change of ae as: 
= 9(x) (3.541) 
Differentiation of u with respect to the new variable defined in (3.541) results in 
du du dt du 
= =@' 3.542 
a aa coe (3.542) 
du os es u 
<+=0 (a 7 Hp'(x Me (3.543) 


dx* 
Substitution of (3.542) and (3.543) into tals a 
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2 
a(g’)” on +(ag"+bo') = +cu=f (3.544) 
We now impose the following conditions for o: 
a(g’y’ =Ac, (ag"+bo') = uc (3.545) 
where 4 and sare constants. Thus, all coefficients become constant: 
du du f 


A +u= 3.546 
Pre ae? , (3.546) 
Substitution of the first equation of (3.545) into the second equation of (3.545) 
leads to 
et Ae 4 je = Lc (3.547) 
dx Va a 


This gives the condition that a, b, and c need to satisfy if it can be converted to a 
constant coefficient ODE. This can be further simplified to a more compact form 


dc da 2 
(a —-—c— + 2bc) 2 
di _dx le (3.548) 
ac a 


Note that the right hand side is a constant. If (3.548) is satisfied, the first equation 
of (3.545) gives the required change of variables: 


o'= |, or p= Vif fac (3.549) 
a a 


Example 3.26 Let us consider the classic non-constant ODE of Euler type as an 
example: 
.d'u du 
ra tgx—-+ru= f(x 3.550 
ala aa ae f(x) (3.550) 


Solve the Euler type ODE using the result of this section. 


Solution: Thus, from (3.540) we have 
a= px’, b=qx, c=r (3.551) 
Substitution of these values into (3.549) gives 


ee (H (3.552) 
x\p 


fae (3.553) 
| ia 


It is natural to choose 


The change of variables becomes 
g=\Inx=t,or x=e' (3.554) 


This of course agrees with Euler’s approach of solving this equation. Substitution 
of (3.554) into (3.548) gives 
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Pre ae (3.555) 
, 
Finally, using (3.553) and (3.555) in (3.546) we have the following constant 
coefficient ODE 
d°u du rf 
+ +ru= 3.556 
Pa (q-P) a ; (3.556) 


This can be solved by assuming the standard exponential form. Thus, we can see 
that you do not need to be Euler to figure out the appropriate change of variables 
for Euler’s equation. The systematic approach presented in this section shows you 
how to propose the proper change of variables. 


3.4.3 Laplace Type 


The following ODE is known as the Laplace type and was considered by Weiler in 
1856, Schlomilch in 1879, Pochhammer in 1891, and Bolza in 1893. In particular, 
Laplace’s type equation is 


d°y dy 
a+lx) ——+(b+mx)—+(c+nx)y =0 3.557 
(a+lx) Fe ( ) hk ( yy (3.557) 
For non-zero /, we can let 
E&=a+l (3.558) 
It is straightforward to show that (3.557) can be written as: 
2 
é Yi ep) Bd er (3.559) 
dé S dg ¢ 
For / = 0, (3.557) can be rewritten as: 
d°y dy 
+ 2(h+ fx +(2gx+r)y=0 3.560 
Aa (h+ fx) =a (2gx+r)y (3.560) 
Both of these can be considered as special cases of the following more general ODE: 
d’y h dy _p 24 
+2(-4 tr)y=0 3.561 
a tat (3.561) 
d’y dy oo 
+ 2(h4 + (px +2qx+r)y =0 3.562 
ae (h+ fx) ae (Pp gx+r)y (3.562) 


Note that we have added extra terms in the last bracket terms in (3.561) and (3.562). 
We will focus our discussion for these two ODEs given in (3.561) and (3.562), since 
the Laplace type ODE is only a special case of them (i.e., p = 0). We first examine a 
particular change of variables as: 


x=Ké, paver ®O (3.563) 

We can see that 
dy dyd&é_1dy d*y_1 d’y 
dx dédx Kdé de ede 


(3.564) 
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dy | A OME dco a 
dé vg"e Gye Et ps} (3.565) 
Wh. epee 16 ty? 2A, MAD 
de = vé“e" { >. de ae aia a +[ uo + E + 1} ~=(3.566) 
Substitution of (3.564) to (3.566) into (3. sae _ 
a’ as us apy at Pe 2 APO (3.567) 


dé Ge dé @ 4 
where 
W=At+h, f'=ut+Kf, r=rke +2Kuf +p’, 
p'=AA-l+2hr+ p, gq'=Aut+KmMf +hut+«Kgq 
It can be shown that there are invariants for some combinations of these constants 
such that: 


(3.568) 


pith'-h” =pt+h-h’ =A (3.569) 
q'-h!' f'=K(q-hf)=KB (3.570) 
raf? =e(r—f?)=K°C (3.571) 


These can be proved directly by using the definitions given in (3.568). We can also 
define another invariant as: 

I=C/B (3.572) 
Now, we want to remove the first order derivative term by setting h'=f' = 0 and 
this implies 


A=-h, w=-Kf (3.573) 
Thus, we propose the change of variables (compare (3.563)) 
yar te yz) (3.574) 
From (3.569) to (3.571), we have the special cases of 
pi=A (3.575) 
g'=xB (3.576) 
r=K-C (3.577) 


Therefore, with these special values, we have the following normalized form of 


+K°C)v =0 (3.578) 


dz? . GO "oz 
There are a number of scenarios for the invariants (with « =1/B in (3.563)): 


Case 1: B #0, C#0 


d’y A 2 
+ +—+/)v=0 3.579 
a G 7 ) ( ) 
Case II: B #0, C=0 
dy A 2 
+ + =0 3.580 
rE G me ( ) 


Case II: B= 0, C#0 
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2 
fY (44nv=0 (3.581) 
dz Zz 
Case IV: B=0, C= 0 
2 
£7 A y=0 (3.582) 
dz Zz 
where 
A=pth-h’, B=q-hf, C=r-f?, [=C/B (3.583) 


Note that a special form of (3.578) is the Whittaker equation (Whittaker and Watson, 
1927: Abramowitz and Stegun, 1964): 
dW. 1k 1/4-m 
+—4 W=0 3.584 
” t ra - } (3.584) 
To recover the Laplace type ODE given in (3.557) as a special form given in (3.578), 
we can make the following definitions: 


bl—ma m cl—na n 
2h= z af a p=0, 2q= a faa (3.585) 
To reduce (3.567) to (3.559), we can choose A according to the following equation: 
p'=A(A-1)+ 2h 4+ p=0 (3.586) 


A special form of second order ODE of (3.561) is called the Bessel equation, which is 
one of the most important second order ODEs in physics and engineering, namely 
2 2 
BEY ole yh (3.587) 
dx? xd x? 


which is obtained by setting 


2h=1, f=0, p=-n’, g=0, r=1 (3.588) 
Thus, for the Bessel equation, we can find A by (3.586) as 
AV? -n* =0 (3.589) 
Thus, making use of the following change of variables (see (3.574)) 
yex"z (3.590) 
we have the derivatives as 
OY cyl pg (3.591) 
dx dx 
ay =n(n—1)x"?z+2nx"! ae gt o (3.592) 
dx dx dx 


With these derivatives ready, it is easy to show that the Bessel equation given in 
(3.587) can be written as 
d’z Intl dz 
+z 


+E ) 


dx” x ok 
This provides another mathematical form of the Bessel equation and is also known as 
Weiler’s canonical form of Laplace type ODE. Thus, the Bessel equation is a special 
form of (3.561). In conclusion, (3.561) can be converted to one of the canonical forms 
given in (3.579) to (3.582). 


=0 (3.593) 
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Similarly, we can also convert (3.562) to three canonical forms. In doing so, we 
observe that the following change of variables can be used 


x= KE A (3.594) 
12 
wee ee (3.595) 


Differentiation of y with respect to x can be changed to differentiation with respect to 
& by the chain rule: 


dy dydé_1dy d’y_ 1 d’y 


dw db de x = de ede — 
dy | wey de 
‘ Pee P unoes 7 (3.597) 
2 l 5 
oe  darvey weg ney EE (3.598) 
With these results, (3.562) is transformed to 
HE she fy Ea (pre? 42g'g +r9G =0 (3.599) 
dé* dé 
h'=u+kh+ fra 
f'=v ae ft 
p'= Vv? +2x°v f + px4 (3.600) 


g'=uvtKv(h+ fa)teuftKe(Ap+q) 
r'=K° (pA? +2gA+A)+2eu(h+ fat +v 


Similar to the former case, there exist invariants for these coefficients. In particular, 
we can easily show that 


P'=p'-f”? =K*(p-f?)=K'P (3.601) 
O'=q'-h' f'=K*(AP+0) (3.602) 
R'=r'-h” = f'=K[R+A°P +240] (3.603) 
P'&? 420'E + R'=K°[Px? +20x+ R] (3.604) 
S'= P'R'-Q” =K°(PR-Q’) =k°S (3.605) 
To remove the first derivative term, we set the following special case of (3.599) 
h'=0, f'=0 (3.606) 


For this case, it is observed that 
P'=p', O'=q', R'=r' (3.607) 
Consequently, the mathematical structure of (3.599) is reduced to 


a +(P'E* +20'E+ RNC =0 (3.608) 
Now we can use the following change of variables to simplify (3.608): 
E€=az+P (3.609) 
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The result is 


2 
IZ 


Comparing (3.608) and (3.610), we see that the invariant forms given in (3.601) to 
(3.603) reappear. 


= +[P'a%z! +2a°(P'B+O)z+a°(P'B +20'B+RI KC =0 (3.610) 


Case I: P# 0, 
For this case, we can choose 2 = —O’P’, c= 1/P' such that 
2 
#6 424M =0 (3.611) 
dz 
where 
5° 
T=7, (3.612) 
P" 


This equation can be converted to a confluent hypergeometric equation, but the 
details will be given later in this section. 


Case II: P’= 0, S# 0, 
By choosing 8 = —R'/(20’), 8 = -1/(6Q'), we get 
ae 1 
7 =0 3.613 
ar 3% (3.613) 
This equation is known as the Scherk-Lobatto equation. However, by assuming a 
change of variables of 


za33x (3.614) 
we have the Airy equation as a result: 
ae 
ae (3.615) 


The solution of it is the Airy functions of the first and second kinds (Stegun and 
Abramowitz, 1964): 
€ =CAi(x) + C,Bi(x) (3.616) 


Case III: P'= 0, S’' = 0, 
In this case, we assume c?= 1/R' such that 


de 
+¢=0 (3.617) 
dz” 
This is the well-known harmonic equation with solution 
¢ =C,sinx+C, cosx (3.618) 


Case IV: P'= 0, S'=0, R'=0 
This is the simplest case, and thus we have 
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2 
d 6 -0 (3.619) 
dz 
Thus, the solution is 
€=Az+B (3.620) 


We can see that (3.561) and (3.562) are very general. The special cases of them 
include the Whittaker equation in (3.584), the Bessel equation in (3.587), and the 
Airy equation in (3.615). 


3.4.4 Solution as Confluent Hypergeometric Functions 


In this section, we show that the Laplace type ODE is equivalent to the confluent 
hypergeometric equation, which is a special case of the hypergeometric equation. 
The consideration of the solutions of these functions will be postponed to Chapter 
4 (see Section 4.12) on series solution. 

We now retum to (3.557) and will show that it can always be transformed 
into the following equation, called the confluent hypergeometric or Kummer 
equation (Abramowitz and Stegun, 1964): 

2 


Ss a +(v-s) _ au=0 (3.621) 
Its solutions can be expressed as Kummer’s function and are given as 
u=C,®(a, 735) +s" 'C,P(a—y+1,2-735) (3.622) 
where the Humbert’s symbol ®(o.,y;s) is defined by 
te <7 (1), 8 ‘ 
D(a, y;8) =1+ 2, ae (3.623) 
and the following Pochhammer symbol has been used: 
(a), =a(a+l)-(a+k-1), (a) =1 (3.624) 


It was defined by Pochhammer in 1890. The detailed discussion of such solution will 
be presented in Chapter 4 on series solutions. 


Case 1: 140, m#0 


Let us assume a change of variables: 
y=e"n, (3.625) 
then (3.557) can be written as 


2 
(a4 nyt +[2Aat+b+x(2Al4 my tad? +bA+c+x(IA* +mA+n)]n =0 


(3.626) 
We can use the following equation to determine A 


1A> +mA+n=0 (3.627) 
Thus, (3.626) can be expressed as 
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d’n dyn, 
at+lk +(h+kx + jn =0 3.628 
(a+lx) ro ( ) at! (3.628) 
where 
h=2aa+b, k=2Al+m, j=al?+ba+c (3.629) 
Adopting another change of variables 
ath =vé (3.630) 
we can convert (3.628) to 
dn Ih-ak kv ,.dn jv 
é +( +— &)—++—n=0 (3.631 
dé? r prde Pp 
We can further choose v as 
gf yg 44_an=0 (3.632) 
dé dé , 
where 
Lalas J (3.633) 
kv k 
If there are equal roots for 
1A* +m2+n=0 (3.634) 
we have the root as: 
21A =-m (3.635) 
Equation (3.626) can be simplified as 
d’n dn 
at+k th tkyn =0 3.636 
(ate) pt hel +k (3.636) 
Adopting a change of variables of 
a+lx =vé? (3.637) 
gives 
iia) (3.638) 
dx 2wé dé 
d° P od’n 1d 
ae i AR RD) (3.639) 
dx 4vE° d& Ede 
Substitution of (3.638) and (3.639) into (3.636) gives 
d°n 2h ..dn 4vk 
+ -1 + =0 3.640 
tp ge (3.640) 
To reduce this equation further, we use another change of variables 
1 & 
n=e2u (3.641) 
Equation (3.640) can then be written as 
du 2h duh 1. 4vk 1 
+ 1+ + + + =0 3.642 
oe G oN Te G 5 (na ru ( ) 


We now make the identifications for the parameters of this change of variables: 
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l6vk+P? =0, €=-s 
Hence, (3.642) becomes 


2 
=. eu 1 ae _ w=0 
ds i ds 1 
This can be recast as 
2 
Ss a+ iF au=0 
where 
pos gt 
cc i 2 


This is clearly the confluent hypergeometric equation given in (3.621). 


Case 2: /= 0, m#0 


For this case, we introduce a change of variables 
y=ne m 
then (3.557) can be written as 
d°n dn 
t(h+kx =0 
ae ( ) oe 


where 
2 
po ee. iLL 
a em a aam m 
Then, we assume another change of variables: 
h+kx=.J2ké 


Hence, (3.648) becomes 


d°n 1, dn _ 
oe eS ge 0 


where 
_g_e bn an” 
2k 2m 2m 2m} 
Finally, let € = —s, and we obtain 


d°n 1 dn 
+ Ss 
. ds G ) ds 


(3.643) 


(3.644) 


(3.645) 


(3.646) 


(3.647) 


(3.648) 


(3.649) 


(3.650) 


(3.651) 


(3.652) 


(3.653) 


By identifying that y = 1/2 and w = aq, it is clear that it is a special form of the 


confluent hypergeometric equation given in (3.621). 
Case 3: /=0,m =0 


For this case, the following change of variables can be used: 
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bx 


y=ne 24, (3.654) 
then (3.557) can be written as 
2 
ST (i+ kon =0 (3.655) 
x 
where 
2 
h=* 7 ka" (3.656) 
a 
It can further be converted to a simpler form by 
h+kx =vé"2 (3.657) 
The resulting equation becomes 
dn ld 
E 7 =, 7 +AEn =0 (3.658) 
where 
3 
gat (3.659) 
9k 
We further let 
n=ePru (3.660) 
Consequently, (3.658) can be transformed to 
du 1. du P 1 
— + (2p& +-) — + + A)E+— =0 3.661 
dE (2p6 3 dé (p+ A)6 3 pl ( ) 


To recover the mathematical form of the confluent hypergeometric equation, we can 
select the following parameters in the change of variables: 


2p=-l, p’+A=0 (3.662) 
Consequently, we obtain the following special form of the confluent hypergeometric 
equation: 


2 
cae pe 1 


u 
: dé? “3 “dé 6 
Therefore, we conclude that the Laplace type of second order ODE given in 
(3.557) can always be transformed into a confluent hypergeometric equation, and 
thus can always be solved in terms of Kummer’s functions. 


0 (3.663) 


3.4.5 Liouville Problem 

The following nonlinear second order ODE was considered by Liouville: 
d’y dy dy.» 
—+ f(x) —+F (yy) =0 3.664 
ce F(x) re (v i (3.664) 


To solve this equation, we first drop the nonlinear term to get 
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2 
4 gj 6 (3.665) 
dx? dx 
This can be rewritten as 
i, Saree ene (3.666) 
dx dx 
It is separable and the solution is 
pea (3.667) 
dx 
Now, we follow a similar idea of variation of parameters introduced by Euler: 
- x)dx 
 _ ye lf (3.668) 
dx 


Note, however, that Euler’s variation of parameters assumes C as a function of x 
instead of y. Differentiation of (3.668) gives 


2 —| f(x)dx 
TY EB pea lO* 1G AD opiay 


2 
dx i . ; c dx dy dx (3.669) 
_ 1a a&,2 IV 
cae LY. 
Substitution of (3.669) into (3.664) gives 
l de _dy.y dy dy dy,2 
x + f(x + F =0 3.670 
ay ae LOY 7 TY OG? (3.670) 
The cancellation of the middle terms leads to 
d 1 dc 
(2)? -L+ F(y)} =0 (3.671) 
dx cdy 
Since dy/dx # 0, we have 
a ee (3.672) 
é 


The unknown function can be evaluated as 


-[FO)dy 
c=Ae (3.673) 
Back substitution of (3.673) into (3.668) gives 
ay ge [Pondy [fea 


3.674 
es (3.674) 

This first order ODE is separable and the result is 
} FOO ay 4 } Pe ear te (3.675) 


This is the solution of the Liouville problem. 
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3.4.6 Mainardi Approach for Liouville Problem 


Mainardi provides a different approach to solve the Liouville problem. In 
particular, we can divide through the (3.664) by as 


i @ 
= f+ FOND) = (3.676) 
a d 
dx 
Note that the first and last terms in (3.676) can be rewritten as: 
fe 6.677 
d. dx dy dx? , 
dx 
d d dy dy 
“NF = “= F(y\(— 3.678 
a FO == f= FN) (3.678) 
In view of (3.677) and es sage we v (3.676) becoming 
in 2) 4 £1 PUM} =-Fe) (3.679) 
Integration of (3.679) aa a to x gives 
d 
in) + [FC y)dy = -| f(x)dx+C (3.680) 
dx 
This can be expressed as 
Sg es (3.681) 
dx 
Equation (3.681) is separable and can be integrated as 
O_O x (3.682) 
-[ Fay 
e 
} el FOM gy, - cl pI pie (3.683) 


This solution is exactly the same as that given in Section 3.4.5. 


3.4.7 Liouville Transformation 


Consider a second order ODE with non-constant coefficients 

y"+ p(x)y'+q(x)y =0 (3.684) 
According to Piaggio (1920), the first order derivative term can be removed from 
(3.684) by using the following change of variables 


v(x) = Wdexp{-> J" pleas} (3.685) 


ya) = W'oexp-5 | pE)ds}- = prpexpt-5 | pE)de} 6.686) 
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" " Lip ’ 1 fe 
via) =W"(aexp-5 | pOAS}-W')pexpt-5 | pS} 
0 240 


Ls lee 1 : lee 
+S PW (x)expt-5 | pE)ds}-5()P'Wexpt- | pag} 


(3.687) 
Substitution of these results into (3.684) gives 
”" 1 1 ’ 
W"(x)+O(x)W(x)=0, O(x) =a-47 = 0 (3.688) 
where 
2 
wxy=4 ao) (3.689) 


The term Q(x) is called the invariant of the second order ODE. That is, if the 
invariants of two different ODEs are the same, these ODEs are actually equivalent 
(see Problem 3.25). Further discussion of the invariant will be given in Section 
3.4.8. The ODE given in (3.688) is the intermediate form or so-called normal form 
of these different ODEs. Note that the invariant of the adjoint problem of an ODE is 
the same as the original invariant (see Problem 3.29). The existence of invariants in 
ODEs was discovered by Laguerre. This invariant allows transformation between 
ODEs. In 1873, Lie group transformation was used in studying whether an ODE 
can be integrated. All hodograph transformation, Legendre transformation, and 
Riccati transformation can be derived from Lie Group method. This topic is, 
however, out of the scope of the present study. 

We further note that (3.688) can be simplified by using Liouville 
transformation or the Liouville-Green transformation 


W(x)=Vxw(t), t=t(x) (3.690) 
where 
- dx 
Vx =,J— (3.691) 
dt 
. dw di i ™ 
=—=—(=wW+W' 3.692 
err me Ve (3.692) 
oe =(= a + Wise w' e+’ “(Wi (3.693) 
dt dt” F dt Sx 
It is straightforward to prove that 


dj ly. dip 
HE - (Vx) (3.694) 


Thus, (3.693) can be simplified as 


2 

i ™ 

aw +Ww'lax (3.695) 
x. 


Substitution of (3.690) and (3.688) into (3.695) gives 


w= 
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~ ped?) e 
iv = Vx —(S)w-°O(x)w (3.696 
ae FY POW) ) 
Finally, the differential equation becomes 
w-w(x)w=0 (3.697) 
where 
w(x) = —x?O(x) + a rp) (3.698) 


The second term on the right hand side of hae can be rewritten in terms of the 
Schwarzian derivative. That is 


sete (3.699) 


ae CP 7 4 3/29 3/2 (3.700) 
~-d@ 1. 3X 1% 1 
x —(—=) = —— -—(—) = -— x,t 3.701 
Via ®) rele i } ( ) 
where the Schwarzian derivative is defined as 
{x40} =4-2@) (3.702) 
x 2% 


The term “Schwarzian derivative” was coined by A. Cayley in honor of Schwarz. 
However, this derivative had been studied implicitly by Lagrange and Jacobi, and 
explicitly by Kummer. Whether the resultant differential equation is easier to solve 
depends on the function Q(x). 

Further discussion on Liouville transformation can be found in Temme (1996) 
and Zwillinger (1997). 


3.4.8 Transformation and Invariants of ODE 


To consider the invariants of ODEs, we will consider general transformation. 
Consider again a general second an ODE with non-constant coefficients as: 


Lu) = a(x) roy +c(x)u = f(x) (3.703) 


We assume that the unknown oie u can ni be considered as a product of two 
functions with one of them at our disposal: 


u = 2z(x)w(x) (3.704) 
Differentiation of (3.704) and substitution of its result into (3.703) results in 
2. 2 
ie ad ei eG oh ncaa). 08) 
dx? dx dx dx? dx 


Case 1: 
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The function w is at our disposal, and we select w that 


2. 
ae ep ees (3.706) 
dx” dx 


Note, however, that this is precisely the homogeneous equation of (3.703). Thus, w 
is the homogeneous solution of (3.703). But, in general, the determination of w(x) 


may not be a straightforward task. In a sense, (3.704) can be interpreted as a 
variation of parameters. With (3.706), (3.705) can be written as 


2 
d Ze dw 6 dz f(x) 


(3.707) 
dx? wdx adx aw 
Naturally, we assume a change of variable as 
dz 
=_ 3.708 
7 S (3.708) 
This reduces (3.707) to a first order ODE 
a6 SU yp I (3.709) 
dx wdx a aw 


The homogeneous solution of this ODE (i.e., solution of (3.709) with f = 0) is 
1 — [eax 
6, =C—e “4 (3.710) 
an 
Application of variation of parameters leads to the following solution form: 
1 — [eax 
6=CG@)—~¢"* (3.711) 
w 


Substitution of (3.711) into (3.709) gives a differential equation of C(x) 
aC _ fw Jie 


3.712 
dx a ( ) 
Thus, we have 
b 
—dx 
c= [fd adx+C, (3.713) 
a 
Thus, the solution for C is 
dz 1 fede fw [eae 
a ee [|e a de+C] (3.714) 
dx w a 
Finally, integration of (3.714) gives 
I ACL) 
z= [ g(E)dE+C,|dx+C (3.715) 
J wi(ag(a) *x até) ; 


where g(x) = ef", Again, the determination of w(x) may not be a straightforward 
task. 


Case 2: 


Instead of removing the linear order term of z, but rather the first derivative term in 
(3.705), we have 
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ee (3.716) 
dx 
That is, we have 
Oye (3.717) 
dx 2a 
2 2 
dw | es db wb bwda (3.718) 


de 2-adx 2a a? dx 
Substitution of (3.717) and (3.716) into (3.705) gives 
d’z f 
—+lz=—— (3.719) 
dx” aw 
where 


2 
ral b 1 db b da (3.720) 
a 4q* 2adx 2a? dx 
We find that if we assume wu = zv instead of u = zw, we arrive at exactly the same 
equation as (3.720) except for v instead of w: 
2 
cay eee (3.721) 
dx” av 
where the value of / is the same as that given in (3.720). For different ODE of the 
form: 


: d’z az, 
a +b'—+c'z= g(x) (3.722) 
dx” dx 
If it can be transformed to the form (3.720) by u = zv: 
2 
oF ign. (3.723) 
dx? av 
we must have 
2 
Aa'=av, Aan chy. Hage pag (3.724) 
dx dx dx 


where J is a function of x only. Thus, we have 
cl bY ab, bt da'_7_c ab bb da 


['= = 3.725 
a’ 4a? 2a'dx 2a" dx a 4a® 2adx 2a? dx ( ) 
Therefore, J is called an invariant. 
Case 3: 
Finally, we consider a self-adjoint second order ODE: 
fo Gn Le (3.726) 
dx dx 
Thus, we have 
2 
BH OE 5 pF (3.727) 


dx ) dx dx 
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To put this into the standard form, we can let 


d 
va=p, vb= 7 , we=q, vf =F(x) (3.728) 
X 
The first two equations of (3.728) give a differential equation for v 
vb= ava) (3.729) 
dx 
This can be recast as 
ee ace. (3.730) 
a va 
Integration gives 
b 
—dx 
yale (3.731) 
a 
To remove the middle term in (3.727), we can assume a new variable as 
= (& (3.732) 
P 
Then, 
ae (3.733) 


dx dt dx p at 


d, du. d_ du. 1d*u 
= = 3.734 
ade aed pe ew 
We deduce from (3.734) that (3.726) becomes 
2 
u 
—; + pqu = pF (3.735) 


where pg can be rewritten as 
ce 22 ee 
pq= es (3.736) 
a 
This resulted from (3.728) and (3.731). 


3.5 HIGHER ORDER ODE 


The general form of — ODE of mal n can be given as 


dy 
Li y|= 4 t tp, (t)yv =g2(t 3.737 
[y] ma roe 1 Prat) P,( yy g( ) ( ) 
The corresponding boundary conditions are 
Hil =3. 2 later” laa (3.738) 


Without going into mathematical proof, we simply recall that we have learned that 
there are two independent solutions for second order ODE. Actually, this 


182 Theory of Differential Equations in Engineering and Mechanics 


observation is also true for any order. Thus, the general solution for the 
homogeneous form of (3.737) can be expressed as 

V(t) = GY (0) + Cr V2 (0 +E, Yn (0) (3.739) 
To show the validity of this solution, we can substitute (3.739) into the homogeneous 
form of (3.737) to give 


d"y, d" yi dy, 
c + p(t ++ p,_\(t)—+ p,{(t 
il ai” P(t) a P,-1(t) dt P,Q] 
n-1 
y2 d" y2 dy, 
—— ee 2 ate t ae ee 
+c5[ # + p(t) dt”! + + p,(t) dt + Py (t)y2] (3.740) 
+... 
d"y dy dy 
+C,, "+ pi(t Dent pi (t)—+ p, (Hy, ] =0 
a p(t) re P,-1 (0) praae yn] 
Each square bracket is zero since yj, 2, .... and y, are solutions of the homogeneous 


equation. 

Note again that a second order ODE possesses two unknown constants. 
Analogously, there must be 7 unknown constants for an n-th order ODE, as shown in 
(3.740). Consequently, for an n-th order ODE being well-posed we must need n 
properly prescribed boundary conditions (see (3.738)). Otherwise, the differential 
equation is not solvable. 

The next question is whether the boundary conditions are given properly, or 
whether the unknown constants can be determined uniquely by the prescribed 
boundary conditions. The idea of the Wronskian for second order ODEs can be 
readily adapted to n-th ODEs. Application of (3.739) into (3.738) gives a system of 
equations as: 


CY (to) +4** +6, Vn (to) = Yo 


CY (to) +2*2 +6, Yn (lo) = YO (3.741) 


-1 -1 -1 
CM Uy) te- + Gn lo) = 90"? 
Clearly, for the system to have a unique solution, we must have the Wronskian be 
nonzero: 


V(t) Voto) os Yn (to) 
V(t) Vo(t) Yn to) 
W(Yi¥ae In \(o)=| ~; Se TE #0 (3,742) 
H(t) Po) = vy Co) 
We first consider the homogeneous ODE that 
y) + PyO) 4004 P y'+Py=0 (3.743) 


As before, we seek an exponential function as the solution for the case of constant 
coefficients: 


y=e™ (3.744) 
Substitution of (3.744) into (3.743) gives the following characteristic equation: 
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7" + Pr® 1 +004 P r+ P, =0 (3.745) 
It is a mathematical theorem by Gauss established in 1799 that an n-th order 
polynomial has n roots, and thus (3.745) allows a factorization as 
Z(r)=("-H)T-h) (1 -4,) =9 (3.746) 
Suppose that all the roots are real and that the first k roots are equal. Thus, we have 
the following factorization: 
Zr) =H OMT Mean) OH) = 0 (3.747) 
We also said that the real root 7; is of order k. The k independent solutions 
corresponding to the repeated root of order k can be expressed as: 
y(x) = (Cy Cx te + Cx e™ (3.748) 
The proof for the case of the third order root will be given in the following 
example. For the case with complex conjugate roots (i.e., » = a+ if) of order k, we 
have the & independent solutions as: 


yx) =[(Co + xt + Cp yx") cos Bx 
k 


(3.749) 
4+(Dy + Dix t+++++D,_,x*")sin Bx]e™ 
For the case of a simple root, we have the solution form as 
y(x) = Ce"™ (3.750) 
The general solution can be expressed as (derived by D’Alembert) 
yH=Qy, + Cyy, +--+ Cy, + YO) (3.751) 


Example 3.27 Show that for the case of a triple root, the 3rd order ODE and its 
general solution are 


y" + py"+qy' +sy=0 (3.752) 
y =(C, + Cot + Cyt” )e” (3.753) 

where r is the root of the following characteristic equation 
r+ pr? +qr+s=0 (3.754) 


Solution: For the case of constant coefficients, we seek an exponential solution in 
the form: 
y(t) =e” (3.755) 
Thus, we have 
(7° + pr? +qrts)e" =0 (3.756) 
Since e” cannot be zero (otherwise we have the trivial solution), we arrive at the 


characteristic equation for r. For the case of triple roots, we mean that the 
characteristic equation can be factorized as 


r+ pr +qrts=(r—-ny =0 (3.757) 
On the other hand, we expand the last term in (3.757) as 
r ~3nr? +3 r-4 =0 (3.758) 


Comparison of (3.757) and (3.758) yields the following values of p, qg, and s as: 
2 3 
Pp=-3%, G=3%, S=-KH (3.759) 
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Therefore, the first obvious solution is 

y=e™ (3.760) 
To find the other independent solutions for the case of triple roots, similar to the 
double root case, we seek a solution in the form: 


y=u(tye” (3.761) 
where r satisfies (3.757). Differentiation of (3.761) gives 
yl" =(u"+3ru" +3r7u' +reuje” (3.762) 
y" =(u" +2ru' +r’u)e”™ (3.763) 
y=(u' +ruje™ (3.764) 


Substitution of (3.762) into (3.764) into (3.752) gives 
u" + (3r+ p)u"+ (Gr? +2prt+qwu'+(r> + pr? +qr+s)u=0 (3.765) 
Substitution of (3.757) into (3.765) gives 
u” +3(r—1m)u" +3(r—ny)’u' +(r—n)'u =0 (3.766) 
Therefore, all terms become zero except the first terms, and we have, after 
integrating it three times: 


u(t) = (C, + Ct +C,t*) (3.767) 
Combining (3.767) and (3.761), we obtain the required result 
y =(C,+Cyt+Cyt”)e™ (3.768) 


3.5.1 Euler Equation of Order n 


The definition of the Euler equation for second order given in Section 3.4.1 can 
easily be extended to a higher order. In particular, the Euler equation for the n-th 
order is 
n n-l 
OY ogg e DF hihi ee Te (3.769) 
dx” dx” dx 

Note that the coefficient of each derivative term is a power function x and its 
degree equals the order of differentiation of the same term. Again, we can adopt the 
following change of variables: 


x 


eee, YQHYH (3.770) 
Taking the total differential, we find 
dx =e'dt, as (3.771) 
dx x 


Then, applying chain rule of differentiation, we have the first derivative of y as 
dy dY dt _1dY (3.772) 


dx dt dx x dt 
Similarly, we can apply the chain rule of differentiation one more time to get 
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d*y _d 1dY._1d’¥dt 1 dY 
dx’ dx x dt’ x dt? dx  ?* dt 
ter tay: Vg @ 
x dt? x2 dt x dt dt 
Note that the last of (3.773) rearranges the differentiation in a factorized fashion. 
This step is very important, and this will become obvious when we continue the 
process of differentiation. In particular, applying the chain rule of differentiation 
for the third time gives 
fe Vey Ser Bay Teg wa 
= a - 
ad xd xrdeP xrdt dtd? dt 
_ld_ed 1) CG ay 
xi dt dt dt 


More importantly, we observe that the result can be factorized in an orderly 
fashion. To double check, we consider the fourth order differentiation of y and the 
result is 


(3.773) 


DY 


+2}Y 
(3.774) 


oy 1a @¢Y Ud? 6 ay 
dt xt dt x4 dP xt at’ x4 dt 
1 did d d 
= 1 2 3)Y 
x* dt dt Mit Mit 
Therefore, it becomes obvious that the change of variable for higher differentiation 
can be expressed in a compact form. In fact, one can show that for the k-th 
derivative term, we have 
k 
BY Ne ay ey (3.776) 
dx* x* dt dt dt 
Using the = aaa in - 776), the ee ODE wrtias 


(3.775) 


<(<-). (Fans +a, 2(2-D. (Sans 27 a 


+...+4, 4 +4,¥ = FO 


It is obvious that this is an n-th order ODE with constant coefficients. Thus, the 
solution must be expressed as an exponential function and the corresponding 
characteristics equation is 

r(r—-l)...r7-n+l)+ar(r-))...7—n+2)+..44,_)r+a,=0 (3.778) 
It is obvious that this is an n-th order ODE with constant coefficients. 


Example 3.28 Show that the following generalized Euler equation can also be 
solved by using Euler’s method. This is also known as = problem. 


n n— ty 
(ax +b)" + A (ax-+b)" ae J pet Ay (arb) A, y=0 (G.779) 
dx" 


Solution: Using Euler’s approach, we can assume 
axt+b=e', adx=e'dt (3.780) 
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Then, the first and second order derivative terms can be rewritten as 


i a (3.781) 
dx e dt 
2 2 d da’ 
ws4 a rad — ae) 
eS : (3.782) 
_@ dy dy 
edt’ dt 


Note that all derivatives of y with respect to ¢ appear linearly on the right hand side 
of (3.781) and (3.782). To show that this statement is true for all higher derivatives, 
we use mathematical induction. From (3.781), the statement is true for the case of 
first order, and we assume that it is true for the k-th order: 


dé y at 
dx* = ef k (3.783) 
where P; is a linear function of derivative of y. That is, 
dy dy d*y 
P, = P(—,—..... 3.784 
po are ae ae 


with all derivatives, appears linearly. Differentiation of (3.783) once with respect to 
X gives 
qi! y a d a‘ 


(ep 
dx** e dt a a) 
a _ aP, 
= = {-ke™P. +e* Pe. (3.785) 
qh! dP. qh! 


i elke ( k + - ) = eke k+l 
It is clear that Px+; is a linear function of y's derivatives. This completes the proof. 
Generalizing Euler’s analysis for (3.779) gives 
y=C(axtb)! +C,(ax+b)? +...4+C,(ax+b)” (3.786) 
where 7; (i =1,2,3,...,7) are the characteristic roots of 
riv-)...(r-nt+)+Ar(r-l...7-n4+2)+..4+4,,7+A,=0 (3.787) 
If there is a multiple root of order m, the solution becomes 


y =(ax+b)" {Cy +C, In(ax+b)+...+C,,_,[In(ax +b)]""} 8) 


+C4(ax+b)? +...4C, (ax +b) 


3.5.2 Adjoint ODE 


One general way to solve an n-th order ODE is to consider its adjoint ODE. 
Physically, the adjoint ODE is the governing equation of the integrating factor of 
the origin problem. A particularly important special case is that when the adjoint 
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problem is the same as the original problem (differential equation as well as 
boundary conditions), it is called self-adjoint. 
Consider the following n-th order linear ODE 


d"u d"y du 
L(u) = x bres =0 3.789 
(u) = po(x) z P(x) Pe Pn-1(*) fe Dy(x)u ( ) 


n 
First, we note the product rule of differentiation that 


: {UV} age (3.790) 
dx dx dx 
Rewrite differentiation as a power in bracket form, we have 
yu -<{uv}-uv (3.791) 
Ix 


We are going to see that this formula can be generalized to higher order easily. 
Consider the following identity 


£ (yoy -uv®} =U9y4U%y _yOy _yy® =Uu@y-—uv™ — (3.792) 
X 

Note that all immediate terms canceled successively. This can be rewritten as 
(3.790) 


yu® —4 {ur uv} uv (3.793) 
dx 
Following the same procedure, it is straightforward to see that 
yU® = “fur -U®y® +uv\ -uV®) (3.794) 
In fact, we have the following general form for any order k 
YOM £ (yey -UE MYO EMO — PU E+ yur 
dx 


(3.795) 
Now, we consider the following function: 


d"u d”"y du 
vL(u) = vpo(x) A + yp, (x) r= br -+ VD, 4 co prme2 (x)u (3.796) 
Applying (3.795) to each term on the right of (3.796), we find 
d"u d n- n- ' n- n- 
w(x) = fu pov (Dov)! +. "(por)"? } 
. Ze (3.797) 
+(-1)"u (Pov) 
d. n 
"aa d ts Ne! A— ' Ys ee! A. 
i) = [ul py ul) (pyr)! +t (Du pv} 
i Mes (3.798) 
n—- Pi 
+(-1)"'yw——+ 
ax? 
dud d*(Py-2¥) 
VP, -2(X) — = — ju'p,,_»v —u(p,,_9v) | +u ———— 3.799 
Pn-2 ae RP 2 (p av)’ © ( ) 
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_d d(Pyav 
Pie) = fup,. jw Pa? (3.800) 
vp,,u =Uup,,Vv (3.801) 


Substitution of (3.797) to (3.801) into (3.796) gives 


vL(u) = a u") pov —y"-2) (pov)' 4.4 (-1)"lu(pyv) 
dx 


d fear j=, ' i= i. 
+ fu" ) pv—u™ >) (py +...+(-1)" 7u(p,v) 
dx 


lhe (3.802) 
+ typ, gv—u(D,-2V} 
dx 


+ « {up,1v} + uL(v) 


dx 
where the adjoint operator is defined as 
n a" (Dov 1a” (pv (Py 
Zy= Cy 2 4 ay EY 4. Poets py 3.803) 
dx dx 


Equation (3.802) can fae be simplified to - following form: 
vL(u) —uL(v) = “(PU v)} (3.804) 


where P(u,v) can be identified from ae as a summation of all terms under the 
differentiation sign d/dx. If we want to solve the original ODE such that wu satisfies 
(3.789) and v satisfies the adjoint ODE 


L(v) =0, (3.805) 
then, the left hand side of (3.804) equals zero, and we have 
d 
—}P(u,v); =0 3.806 
v7 {Plu,v)} (3.806) 
Integration of (3.806) yields 
Plu,v)=c (3.807) 


If we can solve the adjoint ODE given in (3.805), we will have n independent 
solutions. Substitution of each of these solutions into (3.807) results in n algebraic 


equations for n unknown u, u’, u", ..., and u"~) as: 
Pu,v,) =¢,, PU,Vv>) =C,..., PCV, ) =C, (3.808) 
If we solve for u, the solution can be expressed in terms of n unknown constants as: 
U =U(Vy, Vo50--2 Vy 5 C2 Cas Opn (3.809) 


An important particular case is that the associated adjoint ODE is the same as the 
original ODE. Most of the important ODEs found in physics, science, and 
mechanics are of the adjoint type. Mathematically, the self-adjoint ODE is defined 
as 

L(u) = L(u) (3.810) 


Note that the adjoint of the adjoint of an ODE is the original ODE. 
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Example 3.29 This example considers a second order ODE with circular functions 
as its coefficients 


2 
nae ine" 4 Gn e=0 (3.811) 
dx? dx 

Solution: This is a linear first order homogeneous ODE. For n = 2, (3.789) and 
(3.805) become 

2 


u lu 
L(u) = po(x) S +P; (4 + po(x)u =0 (3.812) 
Fy 4 (Pw) _ Un) , 
L(y) ie = Pov =0 (3.813) 
The adjoint ODE given in (3.813) can be written explicitly as 
L(v) = pov" +(2p — PV’ + (Po Pi + P2)V =O (3.814) 
Comparison of (3.813) and (3.814) gives 
Po(x) =sinx, p(x) =2cosx, p,(x)=sinx (3.815) 
Substitution of (3.815) into (3.814) gives 
(sin x)v" + (2sin x)v =0 (3.816) 
This can be further simplified as 
v"+2v=0 (3.817) 


The general solution of (3.817) is 
v =C,sinJ2x+C, cos2x (3.818) 
For the second order ODE, the function P can be defined as: 


vL(u) —uL(v) = “1Ptu») =0 (3.819) 
X 


P(u,v) =u'pov—u(pov) tupyv =c (3.820) 
The two independent functions are 
vy, = sin Bx =sinJ2x, Vy = cos Bx = cos 2x (3.821) 
The simultaneous equations given in (3.808) are 
P(u,v,) = Upov —U(Por)' + UP, 


a (3.822) 
=u'sinxsin Bx+u(cos xsin Bx — Psin xcos Bx) =C, 

P(u, V2) = u'pov, —U( Pov)’ + UpyV2 (3.823) 

=u'sinxcos Bx +u(cos xcos Bx + #sinxsin Bx) =C, 

That is, we have two equations for two unknowns, and the solution for wu is 
1 
u = —=——$}-C, cos(V2x) + C sin(J2x) (3.824) 
J2 sin x : : } 


Note that if the adjoint ODE cannot be solved, the present approach does not work. 
Therefore, obviously only very special kinds of ODEs can be solved using this 
approach. 
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3.5.3 Sarrus Method 


The Sarrus method allows one to reduce the order of a second order ODE to 
become a first order ODE. Let us illustrate the method by using a particular 
example: 


a 4 a3 Loy? dy. dy 
yt a. ore (x? +2y ee Ae =0 (3.825) 
Note in (3.825) that the highest order differential (i.e., d’y/dx”) must be of the first 
degree (i.e., linear in power) because it is resulted from differentiation. It is a 
necessary condition that it can be integrated once by exact differential. If the 
highest order in the ODE does not appear linearly, the Sarrus method is 
inapplicable. Let us assume that the following exact differential exists 


2 
dU = year oy oy +(x? +2y” oy Pia (3.826) 
As usual, let us rewrite the first ia as 
ay =p (3.827) 
dx 
dU = {y+3xp +2yp° +(x? +2y?p) Prax 
dx (3.828) 
7 (y + 3xp + 2yp" )dx +(x? +2y* p)dp 
Let us assume that 
U, =(x? +2y"p)dp (3.829) 
Integrating once gives 
Uae pry p=e Saye a (3.830) 
Now take the total differential of U; again, but this time with respect to dx. 
dy dy. ody, d’y 
dU, ={2x—+2 H(x° +2y dx 3.831 
v= a re ea Ye ae (3.831) 
Subtraction of (3.831) from (3.828) gives 
d 
dU — dU, = (vt xP dx (3.832) 
LX 


The first derivative term of dy/dx only appears once on the right hand side, and this 
is a consequence of an exact differential. The left hand side is clearly exact. 
Integrating once more, we have 


U-U, =x (3.833) 
Thus, 
2 dy 
U=U,+xy= x? oe la +xy (3.834) 


Note that dU = 0 is our differential equation and this implies that U = C. Therefore, 
we have the first integral of the second ee as 
d 
a oe (~Y +27=C (3.835) 
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Unfortunately, this equation is clearly not exact and we cannot proceed using the 
Sarrus method. Therefore, the Sarrus method is not a very general method. That is, 
once the resulting equation is not linear in its highest order, we cannot reapply the 
Sarrus method again. 


3.5.4 Rule of Transformation 


We showed earlier that sometimes it is easier to solve a differential equation if the 
roles of variable and dependent function are reversed. This idea will reappear in 
our later discussion of PDEs (e.g., hodograph transformation discussed in Section 
6.15 and in Section 7.7.2). 


Interchange of variable: The interchange of variables is defined as 
dy 1 
dx dx/dy 
With this transformation, we have the second derivative as: 
d°y _d_dy._d_ dy.dy_d_ 1 1 
dx dk de dy Ce dx dy Ele (dx / dy) 


2 1 dx 1 d’x/dy 
(dx / dy) dy” (dx/ dy) (dx/dyy 
Similarly, the third derivative can be evaluated similarly as 
dy _d_d’y._d, d’x/dy’,dy 
dx dx dx?" dy (dx/dyy dx 
d°x/ dy : 3(d*x/dy’y, 1 
(dx/dyy — (dx/ dy) ~~ dx/ dy 
_ -(d?x/ dy’ (de | dy)+ (d?x/ dy y 
(dx /dyy 
Higher derivatives can also be derived accordingly. Let us consider an example. 


(3.836) 


(3.837) 


) (3.838) 


Example 3.30 Solve the following third order nonlinear ODE by interchange of 
variables 


yo 2p dy d°y d*y 
de dx dx dx? dx 
Solution: Application of the formulas given in (3.836) to (3.839) gives 
x d’x/dy’>., 1  ~(d?x/dy*)(dx/ dy) +3(d’x/ dy’) 
(dx/dy — dx/dy (dx / dy)? 
a: 2 
# d ae 1 <=) 
(dx/ dy) (dx/dy) 


yY =0 (3.839) 


(3.840) 
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This is a linear first order nonhomogeneous ODE. 
4d lo 2, ax 1 5 dee d’x 1 


5 F 5 Zt z=0 G.841) 
ey dyy’ dy” (dx/dy) dy” (dx/dy) dy (dx/dy) 
The first and third terms cancel out and the equation is simplified to 
3 2 
= Pee, (3.842) 
dy dy 
To solve (3.842), we can assume a reduction of order by 
2 
z= ues (3.843) 
dy 


This technique of reduction of order will be discussed in more detail in Section 
3.5.8. Using (3.843), (3.842) becomes 


Lad +z=0 (3.844) 
dy 
Integrating this by using separation of variables, we have 
2 
z= a —, = Qe” (3.845) 
dy 
Thus, the final solution is 
x=CQe"+Qy+C, (3.846) 


Change of dependent variable: Consider y a function of x or y(x), and we propose 
to change the unknown from y to z: 


y=9(2) (3.847) 
Applying the chain rule, we find the first, second, and third derivatives as 
dy dydz dz 
7 =¢' 3.848 
dx dz dx ¢ OF dx ( ) 
dy d dy d dz d’z 
= = ' "z + b'(z 3.849 
Pe he dy) qe Oe oe =9"(F oh Oe) 2 (3.849) 
ay _ a (2) 
dx dx “a 


3 
= 8"MEP +26" ee ee ill (3.850) 


=9'"(z ea) +39"(z XS eave on 


Higher derivatives can : found by aad a a procedure. The dependent 
variable is changed from y to z. The most difficult part is the identification of an 
appropriate function «(z). The application of this change of dependent variable is 
considered through the following example. 
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Example 3.31 Solve the following second order nonlinear ODE by changing the 
dependent variable 


d’y dy,2 2, ay 
l+y? 2y-1 +3x(14 =0 3.851 
( yy (2y 1G ( Le ae (3.851) 
Solution: We consider the following change of variable 
y=tanz (3.852) 
Note that 
Doge s@ 1+ tan’ z = sec* z (3.853) 
dx dx 
d*y ) dz 2 2 d°z 
— + =2sec* z tan z(—)* +sec” z—— 3.854 
7 ztan2(F) tsec? 2 (3.854) 
Substitutions of (3.853) and (3.854) into (3.851) gives 
d°z dz.» dz 
+(—)° +3x— =0 (3.855) 
dx’ dx dx 
The use of reduction of order gives 
ae +" +3x¢ =0 (3.856) 
dx 
where 
dz 
== 3.857 
9p = (3.857) 


This is the Bernoulli equation and can be solved analytically as discussed in Section 
Bit. 


Change of independent variable: Consider y as a function of x or y(x), and we 
propose to change the unknown from x to z: 


x =¢(z) (3.858) 
The first, second, and third derivatives can be evaluated as 
dy _dydz_ dy | (3.859) 


dx dzdx dx ¢'(z) 
d’y_d (dyy_d dy 1 1 
dx? dx dx dz dz $(z) dx/dz 

_d’y 1 dy/dz $"(z) 
dz” @(2)P (DP #@) 

_ 1 a@y ’@Qw 
(oy dz? {P'(ay az 


(3.860) 
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d’y_d (42) - d 1 d’y $¢(z) dy, 1 
de dx dx? dz {h(z)P dz? p'(z)P dz dx/dz 
_ 1 dy ,V@ @y_ ¢@#'@-39'OV w 
Wy dee Way" az’ wey dz 
Again, the most difficult part is finding an appropriate function ¢(z). The following 
example illustrates this method. 


861) 


Example 3.32 Consider the second order ODE 
d’y x dy y 
+ 
dx’ 1-x? dx 1-x? 
Solve this differential equation by using the following change of variables 


=0 (3.862) 


xX =cosz (3.863) 
Solution: The derivative can be evaluated using the chain rule as: 
d d. 
Oe = —cosec z dy (3.864) 
dx dz dx dz 
2 
d 1 
7 2 a cosee 2 P) dx/d 
oe aE ° eS * 2 (3.865) 
=—cosec” zcot z ay +cosec” Pos 
dz dz? 
Note also that 
. ae ae = os = cosec zcotz (3.866) 
l1-x° l-cos*z sin’ z 
Substitution of (3.864) to (3.866) into (3.862) yields 
d’y 
x 
This is a harmonic equation and the solution is 
y=Acosz+Bsinz (3.868) 


Thus, the final solution is 


y= Ax+ BY1-2? (3.869) 


3.5.5 Homogeneous Equation 


We have learned that certain kinds of first order ODEs are called homogeneous and 
they can be solved by a standard change of variables to make it becoming 
separable. The idea of homogeneous can be extended to consider higher order 
ODEs. The following discussion follows from Bateman (1918). 

In this approach, we assign a number system to each quantity of a differential 
equation as: 
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Table 3.2 Number system used in the homogeneous system 


Variables | x y y! y" yl" vee | x? y y? 


Weights m n n-m n—2m | n-3m | | 2m | 2n 2(n-m) 


The differential equation is called “homogeneous” if the sum of the numbers for 
each term in the differential system is the same when m and n are chosen properly. 
Sometimes, a differential equation is homogeneous for all values of m and n, but 
more often, there is only one value of the ratio n/m for which the differential 
equation is homogeneous. The ratio n/m is called the grade of the equation, denoted 
by p. 

Consider the following differential equation as an example: 


2 
o(Dy 4 x2yZ2_y =0 (3.870) 
dx dx 


According to the weighting system given in Table 3.2, the sum of the number of the 
first term is 2m+2n—2m = 2n, that for the second term is 2m+n+(n-—2m) = 2n and 
that for the last one is simply 2n. Thus, the sum of the numbers for each term is the 
same regardless of the values of m and n. Consider another example that 


a d 
4 y 3 Ly 2 
x* ——-(x° +6xy)—-5 =0 3.871 


The middle term actually consists of two different sums of the numbers. The sum of 
the number of the first term is 4m+n-2m = 2m+n, that of the second term is 
3m+n—m = 2m-+n, that of the third terms is m+n+n—m = 2n, and that of the fourth 
term is 2n. Thus, for the differential equation to be homogeneous, we require that 
2m+n=2n, or2m=n (3.872) 
Therefore, the grade of the equation is two (i.e., p =2). 
When m +# 0, the following change of variables can be applied for all values 
of m and n: 
y=xPé, @ xP ly (3.873) 
dx 
where € and 7 are the new independent variable and the new unknown. 
Differentiating the first of (3.873) gives 
dy _ p-l p dé 
7 px’ E+x -" (3.874) 
The last of (3.874) can be obtained by putting the second definition of (3.873) into 
the left hand side of (3.874). This results in 


dé 1 


= =U Pe) (3.875) 
dx x 
Note from the chain rule and (3.875) that 
d dnd 1 d 
1 BS 1 gpg (3.876) 


dx d&dx x dé 
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Differentiating the second equation of (3.873) gives 


d’y 2 dn 
——=(p—-l)x? “n+xP > — 3.877 
re (p-Ix" "7 ee (3.877) 
Combining (3.877) and (3.876) gives 
ay p2 dy 
=x? *[(p-Dn+(n- pé)—] (3.878 
Pr (p—-)n+(9- ps) dé ) 
Differentiation of (3.878) with respect to x one more time gives 
d° : d 
Sx (P= 2x” Up—Dn +7 ~ Ps) @ 
dn d dé. d dd om 
4yP-2 1 7] + 7 | + 7] 
x" {(p ee on Pe dé (7 Po) Ge) 
Substitution of (3.876) into (3.879) leads to 
d’ : d 
<= x? (pp - 2) +(p-3)q— pe) 
dx dé 


(3.880) 


dn,» 2d 
+ + 

(7 PENG (7- ps) de 

Similarly, higher order derivatives can be found. In general, the derivatives on the 

right of these equations are one order lower than those on the left before 

transformation. Thus, substituting these results into the original differential equation, 

the order of the original differential equation will be reduced one order after the 
change of variables. 

If the original equation is homogeneous for all values of m and n, it must have 


the form 
2 3 
Pee ae Ye? =0 (3.881) 
dx dx 
Note that (3.870) is another example of this form. If such a differential equation is 
linear, it is clearly of Euler type. If the equation is second order, transformation of 
(3.873) will convert it to first order and it is solvable by the technique of first order 
homogeneous type. If the equation is of third order, it would be converted into 
second order of grade | (i.e., p = n/m = 1). For the case that m = 0, we can use a 
substitution of 


oon (3.882) 
The method will be illustrated in the next example. 
Example 3.33 Consider the second order ODE 
2 
2, dy 2.22 
x y— Hx (—) - 3.883 
ya ( a y (3.883) 


Solution: As shown earlier in (3.870), the equation is homogeneous as the total 
weights of all terms are 2n and thus it is homogeneous for all values of m and n. 
Substitution of (3.878) and (3.873) into (3.883) gives 
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x2(xPE)x? Up — Dn + (9 — py = x22? 29?) —- 22 (3.884) 


dé 
Simplification leads to 
cil Wet eal EEN (3.885) 
dé & 4-PS 


Since (3.885) is homogeneous for all values of m and n, we can set p = 1 for 
simplicity. Integration of (3.885) gives 
n=(Iné+C,)é (3.886) 
Back substitution of the definition given in (3.873) gives 
Y Yim) +e, (3.887) 
axe xe" x 


As expected, it is the homogeneous type of first order ODE. Thus, we can use the 
standard change of variables as 


2 =i (3.888) 
x 
Using this change of variables and integrating, we obtain the final result as: 
= Ci+C2x 
y=xe (3.889) 


3.5.6 Undetermined Coefficient for Nonhomogeneous ODE 


The method of undetermined coefficients discussed for second order ODEs can be 
easily extended to the case of higher order ODEs. All earlier discussions given in 
Section 3.3.3 remain valid here. That is, by examining the mathematical form of the 
nonhomogeneous term, we can propose a particular solution form that can provide 
terms that can match those given in the nonhomogeneous terms, one by one. For 
example, a nonhomogeneous term in polynomials must be matched by assuming 
the solution in polynomials. We will proceed by considering the specific examples: 


Example 3.34 Consider a third order ODE with the nonhomogeneous term given in 
an exponential function as 


y" —3y"+3y'-y =4e' (3.890) 
Solution: Note that this is a linear ODE with constant coefficients. Thus, we should 


try for an exponential solution as for the homogeneous ODE (i.e., ignoring the term 
on the right hand side of (3.890) as 


y(t) =e" (3.891) 
Substitution of (3.891) into (3.890) gives the following characteristic equation: 
r> —3r? +3r-1=0 (3.892) 


Recalling the binomial theorem, we recognize that it can be factorized as 
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(r-1) =0 (3.893) 
Thus, the solution can be expressed as 
y,(t) = qe! +e,te* +¢,t7e" (3.894) 


Since the nonhomogeneous term on the right hand side of (3.890) matches the triple 
root of the homogeneous ODE, we need to add f to the trial particular function as 


y(t) = Ate! (3.895) 
Differentiation of (3.895) gives 
y, =3At'e' + Ate’ (3.896) 
y, = 6Ate' + 6Ate' + At’e! (3.897) 
y,” =6de' +18Ate' +9 Ate’ + Ate! (3.898) 
Substitution of these results into (3.890) gives 
6 Ae’ = 4e' (3.899) 
Thus, the unknown constant is 
a ; (3.900) 
Finally, the particular solution becomes 
2 
i= gre (3.901) 


Adding the homogeneous solution and the particular solution, we obtain 


y(t) = qe! teste! +0,1°e' + ste (3.902) 


Example 3.35 Consider a fourth order ODE with nonhomogeneous terms given in 
terms of a circular function as 


y 42y"+ y =3sint—S5cost (3.903) 
Solution: The homogeneous solution is considered by seeking 
y(t) =e" (3.904) 
The corresponding characteristic equation is 
4 42r? +1=(7? +1)(?? +1)=0 (3.905) 
The roots of r are i and —i and both are double roots. The homogeneous solution is 
y, (t) = ¢, cost +c, sint+c,fcost+c,tsint (3.906) 


As expected, there are four unknown constants. For the particular solution, we 
observe that the nonhomogeneous terms do match the characteristic roots and thus 
we seek a particular solution in the form 


Y(t) =0?(Asint + Bcost) (3.907) 
Differentiation of (3.907) gives 
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Y" =(4At+2B-Bt*) cost +(2A—4Bt— At’) sint (3.908) 


Y"" = (-12B-8At+ Bt?) cost +(—124 + 8Bt — At?) sint (3.909) 


Substitution of (3.908) and (3.909) into (3.903) gives 
—8Asint—8Bcost =3sint—Scost (3.910) 
Matching the coefficients of sine and cosine on both sides gives two equations for A 
and B and their solutions are: 
A=-3/8, B=5/8 (3.911) 


Therefore, the particular solution is 
Yi=-3t sint +2 cost (3.912) 
Summation of the homogeneous solution and particular solution leads to 


V1) =V,O+YO) 


ae eee (3.913) 
i eA a aay ae cost 


3.5.7 Variation of Parameters 


The idea of variation of parameters was introduced by Euler but its application to 
higher order ODEs was done by Lagrange. In particular, we consider the following 
linear n-th order ODE with a nonhomogeneous term: 


V+ QV? ++ PriOy' + POY =8O) (3.914) 
Let us assume that the homogeneous solution is known: 
WO =QYO+ eV Ot +e, y, 0) (3.915) 
Using variation of parameters, we assume the particular solution for (3.914) as 
yp) =m (1) (0) +uy (Dy) ++, Oy, (3.916) 


Differentiation of (3.914) gives 


Vp C= (Uy +Uay2 +2 FWY y )+ (UY + May, thu, VY) — (3.917) 
Following our proof of the second order, Lagrange set the following group to zero: 
Uy, tus y, ++ +ujy, =0 (3.918) 
This is the first equation governing the first derivative of the unknown functions w;. 
Differentiation of (3.917) with condition (3.918) gives 


Vp =(Uiy tuys to +u,y,)+ (uy tunys ttu,y,) (3.919) 
We again set the first bracket term on the right hand side of (3.885) to zero 
Uy, +Usy, +o-+uLy, =0 (3.920) 
This is the second equation governing the first derivative of the unknown functions 
u; We can repeat this differentiation procedure. In summary, up to differentiation 
of n—1 times we have the following relations: 


uy + uy ye) peat, yo =0, k=1,...,n-2 (3.921) 
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yp? =m) +4u,y, k=0,Le052-1 (3.922) 
For the n-th differentiation, we have 
= (uj Sill D feet nae »)+ (u,34” +: +1, (3.923) 


Substitution of (3.922) and (3.923) into (3.914) gives 
(u ee D pasta rye D+ (uy! fob -+u,y0) 


+p (O(uy{"- D4. -+U, yo) (3.924) 


+PyaO(uyt tetany, + PaO(w Hn = BO 
Grouping all terms of u;, we get 


C Dt py )+ us (yf? + piyh 


m (> + py" (n-1) 


— (3.925) 


+..+Uy, (y + pyyO) +094 p, y,)+ ae tee+ul ye \= g(t) 
Since yx (kK = 1,2,...,) are the homogeneous solutions of (3.914), we must have all 


the bracket terms in (3.925) identically zero, except for the last one on the left hand 
side of (3.925). This provides the last equation for the first derivative of u; as: 


uy) teetul yo) = (3.926) 
In summary, we have n equations for 7 unknowns: 
my +o tun Yn 
uy toc t+uiy, =0 
eal nn é 927) 
uly Dan. tui yr Li); xs =¢g 
The system can be expressed in matrix form: 
Noo In |jm} {0 
: : i b=dibe (3.928) 
yer) 7 yo 1) [u,’ 1 
The solution for each unknown can be solved by Cramer’s rule as: 
t)W,(t 
v= SEO where WO =WO mo HNO (3.929) 


Integration of (3.929) gives the unknown functions defined in (3.916) 
* g(s)W,(s) 


u,(t)= ds, k =1,...,n 3.930 
O=] TO (3.930) 
Finally, the particular solution is ae as 
&(s)W,(s) 
t ix ds |y,(t 3.931 
yp (t)= 3 Wis) a) (3.931) 


The homogeneous solution ga can be added to this particular solution to obtain 
the general solution for (3.914). 
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Example 3.36 Consider the following third order ODE with its homogeneous 
solutions given: 


y"-y"—-y'+y=8(0), 
y(t) =e, y(t) =te’, y3() =e" 


Solution: We now apply Lagrange’s variation of parameters technique. For the 
present case n = 3, we have the particular solution as: 


3 

yO = >| , 5 is|y () (3.933) 
Recall the definition of the Wronskian 

V(t) V2) 8 Vato) 
V(t) Voto) + VA Cto) 


(3.932) 


t 


W (ds V2002s¥n )(to) = #0 (3.934) 


Hy) YPC) = wo) 
Substitution of the homogeneous solutions into (3.934) gives 
é te’ e! 
W(t) =|e’ (t + le -e*|=4e' (3.935) 
é (t + 2) e et 
Replacing the first column by (0,0,1) we have 
0 te’ e* 
W,(t) =|0 (t +1)e -e'|=-2r-1 (3.936) 
1 (¢ + 2) e ¢* 
Similarly, we can find W2 and W3 as 
e 0 et 
W,(th=|e’ 0 -e*}=2 (3.937) 
e 1 et 
é te 
W;(t) =|e’ (t+ le’ 
e (t+2)e 1 


=e"! (3.938) 


Substitution of these results into (3.933) gives 
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3.) pi W, 
yp (= a = 7am aly, (t) 


4 = ' ; 28) 
nef OED geste f Sater SO ay 3.939) 
ty 4e* % 4e* de® 


The final solution given in (3.939) is a function of g and integration can be 
conducted once this nonhomogeneous term is given. 


3.5.8 Reduction to Lower Order 


We now consider a special form of n-th order ODE that all derivatives of order 
lower than k do not appear in the ODE. Mathematically, we have 


FER” pa S0 (3.940) 
where x is the unknown function and ¢ is the variable. Naturally, this suggests a 
change of variables that the lowest order derivative term is the new unknown as: 


xO ay (3.941) 
It is clear that now the new differential equation is of order n-k and appears as 
F(t,y,y sy") =0 (3.942) 


Suppose that the solution of this k-th ODE can be solved, and the solution is 
written symbolically as: 


Y= PEC Ce) (3.943) 
Then, back substitution of (3.394) into (3.941) gives a k-th order ODE: 
x) = &(t,C,,°°",C,_4) (3.944) 


Note, however, that now the right hand side is only a function of ¢. Therefore, it can be 
solved by integrating the right hand side times as: 
x=W(t,C,°"7,C,) (3.945) 


The following example will illustrate its usefulness. 


Example 3.37 Solve the following fifth order ODE: 


5 4 
eee) (3.946) 
dt t dt 

Solution: This is a linear first order nonhomogeneous ODE. 

d*x 
—=y (3.947) 
dt* 

aes =0 (3.948) 
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This can be integrated readily to get 


y=ct (3.949) 
Back substitution of the definition of y gives 
4 
uae =ct (3.950) 
dt 
Finally, integration of (3.950) yields the solution 
X=Of +oyt +0307 +e4t +s (3.951) 


Example 3.38 Solve the following nonlinear ODE 


2 
Py aa (3.952) 
dt? dt 
Solution: We can reduce the order by assuming 
dx 
= 3.953 
cia (3.953) 
Substitution of (3.953) into (3.952) gives 
a (3.954) 
dx 
There are two solutions for y and they are 
poo, Bes (3.955) 
dx x 
The solution of the second equation of (3.955) is 
yHex (3.956) 
Back substitution of (3.953) into (3.956) yields 
dx 
—=¢c 3.957 
dt 2 ( ) 
Integration gives the final solution as 
x=Qe" (3.958) 


Another type of reduction of order for an n-th ODE will be considered in Section 
3.5.14 for the case of autonomous differential equations (i.e., variable does not 
appear explicitly in the ODE). 


3.5.9 Exact Condition 


Consider the most general form of linear ODE of order n 


Polxy + yO +...+ PrCOY = £0) (3.959) 
The adjoint of this n-th order ODE is 
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n n-1 
( 1" d (Pov) ( iy d (Pv) a d(p,,1V) 

dx" dx" dx 
This adjoint problem has been discussed in Section 3.5.2. The adjoint ODE actually is 
the governing equation for the integrating factor v. Thus, if an n-th order ODE is exact 
we must have v = 1, and with this information we have the following condition for the 
coefficients to satisfy 


+p,v=0 (3.960) 


qd" n-l 

(nt 20, py 4 Ee Prt. =0 (3.961) 
dx" dx" dx 

This is the condition for an n-th order ODE to be exact. 


3.5.10 Factorization of ODE 


The factorization technique considered here closely relates to the symbolic methods 
reported in some textbooks on differential equations. This symbolic technique was 
developed by Boole 1859 and Lobatto in 1837. Consider an n-th order ODE with 
constant coefficients 


d® qd? qd? 
Ca + Ay cs + A, ne t..+ A, )u= f(x) (3.962) 


For the homogeneous case, assuming an exponential function, we arrive at a 
characteristic equation 


r+ Ar") + Ar"? +..4.4,=0 (3.963) 
We can factorize (3.962) as 
d d d 
—-—a,)(— —a,):++(——a, u= f(x 3.964 
Sr de ) a nu = f(x) ( ) 
where a; = 1,2,..., n are the roots of (3.963). The homogeneous solution is 
u=Ce™ +Ce?* +..4C,e" (3.965) 


To consider the particular solution, we first consider the case with only one 
differential operator: 


or —a)u = f(x) (3.966) 


This is the most general linear form of first order ODE discussed in Section 3.2.6, and 
its solution is 


“= ew [em fax (3.967) 
This can be solved symbolically as the inverse of the differential operator 
u= ase —a)' f =e" |e fax (3.968) 
dx 
For the homogeneous case that f= 0, we can define 


u= (25 a) '0=Ce™ (3.969) 
dx 
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For the cases of two factorized operators, we considered the following second order 
ODE 


du du 

ae (a+b) ae tabu = f (3.970) 
This can be factorized easily as 
d d 

—b)u= 3.971 
Lore yu = f(x) (3.971) 

The solution of u can be given symbolically as 

d 1,d | d d | 

anf pyri (= = {(—-a)(——b 3.972 
ar a ce ae ae ane (3.972) 


To solve this, in principle, we can apply (3.967) successively to (3.972). 
Alternatively, a symbolic approach based on rational function decomposition was 
developed by Gregory. That is, the solution can be written such that the inverse 
process can be applied to each term of (3.964) one-by-one: 


d d . : 

paloma ora aera ae 6.973) 
d - a 7 is | 

-{mch-a) + No(G-— aa) Nt ot Nq(Go= ay 0 


where JN; (i = 1,2,..., 2) can be found by algebraic means of partial fraction. If r roots 
are equal to a and the rest are distinct, we have 


~¢ 4 _gy (4g. (4g 
wt a) . 4,41) os a) f 
| | ae 
={N(G--4) + Noa) ith. (3.974) 


d = d -| d 4 
+N ,.(—-—a) +N.,,(—~—a, +..+N,(—-~a 
ors ) ri are ne *) MF 
For the case of repeated roots, by applying (3.967) r times we have 
(o ar ee e* |. J e™ faxed (3.975) 
xX 


Using (3.971) and (3.972) as an illustration, we assume u = e”* and this leads to the 
following considerations: 
1 N, N, _(m—b)N, +(m—a)N, 


= (3.976) 
(m—a)(m—b) (m—a) (m—b) (m—a)(m—b) 
Equating the constant and m order term, we have 
N,+N,=0, 5bN,+aN,=-1 (3.977) 
The solutions of this set of equations are 
M=—, Ny =-—— (3.978) 


To see the validity of this algebraic approach, we can consider the actual 
differential operator: 
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d d | d 4 d 4 
—-—a)(—~—b =N, +N. b 3.979 
Ce oh ee (3.979) 
Applying the original differential operator on both sides of (3.979), the left hand 


side is fand the right hand side becomes 


oo ar ee ee A ee fig. 
RHS ars crm re a) ae a aX - =o —-=b)  F 


d d d d, 
= NED) f+ Nol <-a)f=N, 2 Npf +N, LD Nyaf (3.980) 


=(N,+ Nyy ~(N,b+Noa)f 
X 


To get back the original nonhomogeneous function, we must set 
N,+N,=0, bN,+aN,=-1 (3.981) 
These are precisely what we got in the partial fraction analysis for (3.977). This is the 
reason why we can replace the inverse differential operator by algebraic analysis. 
Return to our second order derivative problem given in (3.971) 
d 1 d 


u rr ay ay f= (L- ay fay f} 


er: —— {e™ fen fax - ee" fax 


This completes the procedure of symbolic analysis for the factorized form. 


(3.982) 


Example 3.39 Find the particular solution of the following fourth order differential 
equation using the factorization and symbolic technique 


4 
cata FY FY g® -ay= (3.983) 
dx dx? dx dx 
Solution: The characteristic equation of (3.983) is 
m* +4m?> +3m” —4m-4=0 (3.984) 
This can be factorized as 


(m+2)°(m-1)(m+1) =0 (3.985) 
According to (3.964), the rae ie can be factorized as 
Ce +2)” © m1 Dy = f(x) (3.986) 
Applying symbolic analysis, we ma 
-1 
{4 +2) C > 1B | f(x) (3.987) 


The associated algebraic analysis using ar fractions is 
1 _ 4m+ii 1 z 1 


(m+2)°(m+1)(m—-1)  9m+2)? m+) 18(m-]) 


Therefore, the final solution is given as 


(3.988) 
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1 
y= 54 < +1 pe**| e* fdxdx — 7 e*fet fax+ rd e* fdx (3.989) 


For the case of non-constant coefficients, the symbolic analysis can be extended to the 
case of an arbitrary differential operator instead of d/dx. In particular, we can write 


C1" + AMT" +All"? +..44,u= f(x) (3.990) 
where the operator /7 satisfies the following rules: 
IT(au) =alT(u), 
IT(u+v) =IT(u)+/T(v), (3.991) 
TT" TT" (u) = 17""*" (u) 
Then, the solution can be expressed as 
BoC eATT 4 Adsl eA EG (3.992) 
Using symbolic method, the solution can be rewritten as 
u=N,U1-a)' f+N,(T-a) f+..4N,T-a,)'f (3.993) 
This method is demonstrated in the following example. 


Example 3.40 Solve the following second order ODE with non-constant 
coefficients by factorization: 


fH ex rene (x +x—I)u=0 (3.994) 
x 


Solution: Let us define the operator es as: 


Tlu= a (7 -1)u = a a(x)u-u (3.995) 
dx dx 


We now consider that 


I(T -1)\u) (o ay ou-1) 


(3.996) 
OE 6 Ae +a-a')ju 
dx? 


Therefore, we recognize that if ~@ = x, we recover (3.994). Therefore, we have 


u=(17-1)'0-77'0 (3.997) 
The corresponding equations are 
dy dz 
U7 -)y =—-(x+)Dy=0, (1)z=—-xz=0 (3.998) 
dx dx 


The solution is 
u= Cet? 4. Ce" 2 (3.999) 
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3.5.11 Symbolic Method for Nonhomogeneous ODE 


This idea of using the algebraic symbolic method to differential calculus can be 
traced back to the time of Leibniz when he discovered the analogy between his n-th 
differentiation formula for two functions and the binomial theorem (see Eq. (1.27)). 
Subsequently, in 1774 Lagrange also found an algebraic analogy between Taylor 
series expansion and an exponential function of differential operator: 
i) n n ne. 
ux+h=> 22 aaa Ah(u) =(e & -l)u (3.1000) 


! n 
n=0 nt dx 


Cauchy reported the symbolic calculus by Brisson in 1821 and 1823 (both papers 
have been lost). Then, symbolic methods branched into the British school (with G. 
Boole, Gregory, de Morgan, Carmichael, etc.) and the French school (with 
Arbogast, Francais, Cauchy, Laplace, etc.). The application of symbolic algebra to 
differential equations with variable coefficients was made by Boole in 1844, and the 
approach was summarized in a book by Carmichael in 1855. 
In particular, differentiation can be defined as: 
2 3 n 
= - D= call wD" wi (3.1001) 
dx dx dx dx" 
The use of the symbol D gives this method another name, the so-called “D-operator 
method.” Using this notation, we observe that 
De® =ae™, D?e™ =a’°e™,...,. D" =a"e™ (3.1002) 
Let us define a general differential operator as: 
F(D)e™ =(p)D" + p,D"" +...+ p,_;D+ p, en 


=e" F(a) 
Therefore, differentiation of e** with respect to x is done by setting D in F to a. 
Next, we consider the differentiation of another functional form: 


D(e*V) = D(e“ V +e" D(V) (3.1004) 

where V is a function of x. Similarly, higher derivatives can be evaluated as 
D? (e®V) = D?(e* W +2D(e™) DV) +e*D? (V) (3.1005) 

Di (e“V) = D3 (e* W +3D? (e*)D(V) +3D(e™)D? (VV) +e" D3 (V) 
(3.1006) 


(3.1003) 


More generally, we can use the Leibniz formula in (1.27) as: 


l 
D" (e®V) =D" (e* VW +nD" | (e*) DV) + —n(n-1)D" *(e*)D?V 
(e"V) = Die") a Ee arma aaa ey"). 3.1007 


+..t¢e*%D"(V) 
Carrying out the differentiation with respect to the exponential function, we find 


D"(e®V) =a"e"V + na" 'e™ DV) + nin = bee" DV ue" DF) 


1 
=e"(a" 4 na"'D4 - n(n Da"? D? +..4D" WV 


(3.1008) 
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Therefore, we have 
D" (e*V) =e" (D+a)"V (3.1009) 
The exponential function appears to shift the differential operator from D to D+a. 
The result just obtained in (3.1009) can be used in the following differential 
operator: 
F(D)(e"V) =(poD" + p,D" | +...+ p, D+ p, )e7V 
=e" {p)(D+a)" + p,(D+a)""' +...+ p,_\(D+a)+ p, WV (3.1010) 
=e"F(D+aVv 


This analysis can be extended to other functions as (the readers are encouraged to 
verify these themselves): 


F(D*)cos ax = F(-a”) cos ax (3.1011) 
F(D’)sin ax = F(—a”)sin ax (3.1012) 
(D-a)’ {e°V} =e" (D-a+a)yV =e"D°V (3.1013) 


Consider a special form of (3. wie as 


paar aye —* 4-60) a)? 
(D-a)? 6 oe he mae i a) Ka <- 
e” pee (3.1014) 
g(D — ) = . 
ale a 


Note that we have used (3.1010) in getting (3.1014). We now introduce the 
strangest notation of this symbolic approach. In particular, it is obvious that 


Dx =1 (3.1015) 
We introduce the algebraic form of writing its inverse as 
1 
x=—(l 3.1016 
D (I) ( ) 


Recall that D is a differential operator, but not a coefficient. However, using the 
symbolic approach, we just divide through by the operator D as we would do for 
algebra. Similarly, we can extend the idea to the following form 


P ! 
pee te ei (3.1017) 
pip! 
Thus, algebraically we can write: 
xP 1 
ol pO (3.1018) 


Let us now proceed to consider the symbolic method in solving a nonhomogeneous 
ODE as 

F(D)y = f(x) (3.1019) 
We now propose to write the particular solution of (3.1019) as 
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1 
y= Fo)! © (3.1020) 


Note that we have treated the operator D as if it were an ordinary algebraic quantity. 
In this approach, we will follow all plausible algebraic operations and then check 
the final result by differentiation. 


Case (1) For f(x) =e”, (3.1003) suggests that 


a ae (3.1021) 
F(D) F(a) 
We can verify this by differentiating (3.1021) as 
Lary F(A) _ ax 
F(D = = 3.1022 
( Fa. } Fa) e ( ) 


This confirms our usage of symbolic algebra for solving (3.1019). 


Case (ii) For f(x) =e“ and F(D) = (D-a)?@(D), the symbolic method suggests that 


1 ow 1 oo = 1 f eo yas e& 1 
F(D) (D-a)? ¢(D) (D-a)’ $a) g(a) (DY? (3.1023) 
=i ee 
o(a) p! 


In obtaining the last of (3.1023), we have taken 1/D as integration as defined in 
(3.1018). To verify this result, we can differentiate the result by using (3.1017): 

e™ x? 

(D—a)’ $(D){-—_— 

pla) p! 


Thus, the result is verified. We will illustrate the method by the following examples. 


}=e™ (3.1024) 


Example 3.41 Solve the following second order ODE 
(D +3) y =50e** (3.1025) 
Solution: The particular solution is 
1 2x 1 
~ Go 2 
(D +3) (2+3) 

The last of (3.1026) is suggested by the result of (3.1021). It is easy to verify the 
validity of (3.1026). Adding the homogeneous solution gives 

y= (A+ Brje™ +2e* (3.1027) 


500** =2e** (3.1026) 


Yp 


Example 3.42 Solve the following second order ODE 
(D-2)" y = 50e* (3.1028) 
Solution: The particular solution is 
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1 2x 
= ——50e~ (3.1029) 
Yp (D-2° 


For this case, we cannot apply (3.1021) or directly substitute D = 2 because it will 
lead to infinity. Instead, we can apply (3.1013) and (3.1018) to get 
1 
(D-2)° 
It is easy to verify the validity of (3.1030). Adding the homogeneous solution gives 
y =(A+ Bxje* +25x7e* (3.1031) 


For this case, the nonhomogeneous term matches the repeated root of the 
homogeneous ODE. Thus, similar to the method of undetermined coefficients that 
we discussed earlier, special treatment is needed in obtaining our particular 
solution in (3.1030). 


x x 1 x 1 x 
Vy 50e* = 50e”" > = 50¢e (52°) = 25x°e” (3.1030) 


Example 3.43 Find the general solution of the following second order ODE 
(D? +3D+2)y =cos2x (3.1032) 
Solution: Using the symbolic method and employing (3.1011), we have 


n= cos 2x = : cos2x = ! cos2x (3.1033) 

(D* +3D+2) (-4+3D+2) (3D-2) 

Note that we can only substitute the value of D? but not D (see (3.1011)). Next, 

(3.1033) can be simplified as 

3D +2 3D +2 

5 cos2x = 

(9D* —4) (-9x 4-4) 

1 


eT eee) (3.1034) 


cos 2x 


Vp 


= ae (3sin 2x —cos 2x) 
20 


Note that we can employ (3.1011) again in (3.1034). 


Example 3.44 Find the particular solution of the following third order ODE 


(D> +6D* +11D+6)y = 2sin3x (3.1035) 
Solution: Using the symbolic method and employing (3.1012), we have 
2 : 
Vy= ps = i 2sin3x = (9D ~54411D+6) sin3x 
—-9D-54+ + 

ee!) (3.1036) 

1 : 
= —— sin3x 

(D -24) 


Again, note that we can only substitute the value of D? but not D. Thus, we proceed 
like the last example to get 


is) 
NN 
is) 
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1 : D+24 
y= sin 3x = —— sin 
(D-24) (D* —576) 
IE ign Geese dy (3.1037) 
—9—576 585 
= 4 fepade 4 Bein 
195 


If the nonhomogeneous term is given as a power of x, we should expand the 
“inverted” differential operator as: 


F(D)y =x" (3.1038) 
1 


= x” =(a,+a,D+a,D’ +...)x™ 3.1039 
F(D) (a) +4, 2 ) ( ) 


iP 


Let us illustrate this with examples. 


Example 3.45 Solve the following third order ODE by the symbolic method 
(D? +4)y =x? (3.1040) 
Solution: Using the symbolic method and employing (3.1039), we have 
1 2_1 1 > 1 1 1 


= x D* +..)x? 
” (D? +4) 4 Vee 4° 4 16 ) 
(l+—D*) 
4 (3.1041) 
t8 4 
=—(x -— 
7 7 
Therefore, the general solution becomes 
1 1 
Y=VntVp = Acos2x-+ Bsin2x+ 7 (x =) (3.1042) 


We can see that a higher order of the Taylor series expansion is not needed since 
the nonhomogeneous term is only up to second degree in power. For power series 
of higher order, we need to retain more terms in the series expansion of the 
differential operator D. 


Example 3.46 Solve the following third order ODE by the symbolic method 
(D* -4D+3)y=x° (3.1043) 
Solution: Using the symbolic method and employing (3.973), we have 
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! 3 eee 


1 
e—— — 
“a (D?-4D+3) 2 1-D 3-D 
2 3 4 
=1a4p+D?+D?+D*+..) a ee Peal +...)}x3 (3.1044) 
2 3 3 9 27 =8i1 


13. 45 26 80 
=—x+—x°4+—x+ 
3 3 9 27 
Therefore, the general solution becomes 
‘ | 4 26 80 
=y, ty, =Ae*+Be* += += x? += x4 3.1045 
Y=Vh yp 3 3 9 27 ( ) 
As illustrated, more terms are needed in Taylor series expansion for this example. 


The symbolic method introduced here is also known as the “inverse differential 
operator,” which is more efficient for finding a particular solution. If the order of a 
differential equation is higher than third order, the traditional technique of 
undetermined coefficients becomes tedious, whereas the symbolic method becomes 
very effective. Table 3.3 compiles some typical formulas for the symbolic method, 
and the differential operator F(D) in Table 3.3 is defined in (3.1010). 


Table 3.3 Some formulas of symbolic method 


No. ODE g Remark 
1 — oh 
F(D)g =e oe 1 on (3.1021) 
F(a) 
2 =e : 1 3.101 
F(D)g =e" O(x) g=e" O(x) (3.1013) 
F(D+a) 
. Dae =O) g =e fe“ O(x)dx (3.967) 
4 Oy 
F(D*)g = cos ax ee 1 —cosax (3.1011) 
F(-a’) 
5 DR a ect 
F(D*)g =sinax pe —sin ax (3.1012) 
F(-a’) 
6 F(D? +a’)g =cosax _ xsin ax 
2a 
i F(D? +a’)g =sinax _ __ xcosax 


2a 
8 F(D-a)"g=O(x) — g=e® [ | nis J ec" O(x)dx-+-dxdx 
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3.5.12 Removal of the Second Highest Derivative 


A linear n-th order ODE can be written as 


y™ +a,(x)y? 4+...44,,(0y' +4, (~)y = f(x) (3.1046) 
Let y be expressed as a product of two functions 

V(X) = Yo(x)u(x) (3.1047) 

Applying the Leibniz rule of differentiation, we have 
y=Vourt you’ (3.1048) 
y"=yy ut2yyu't+ you" (3.1049) 
y" = yu" +(n-2) ypu” 4+... (3.1050) 
yr) = yu? +(n-Nyyu?™ +... (3.1051) 
y = yu” +nyyu ee zn ypu +... (3.1052) 


Substitution of (3.1047) to (3.1052) into (3.1046) and collection of all terms up to 
ul") gives 


yo +(n—layyo +azyq lu +... = f(x) 


' = -1 
you” +(1V9 + ayyo)u” 7 a 


(3.1053) 
To remove the second term of (3.1053), we must set its coefficient to zero 
LEE (3.1054) 
Yo nt 
This is separable and it can be rewritten as 
cL (3.1055) 
Yo n 
1 
In yy = expl-— | adx+ ¢] (3.1056) 
nN 
1 
Y= Cexpl-— | mar] (3.1057) 
n 


Therefore, the second highest derivative term can be removed by using the 
following transformation 


y=u(x) expl-— fa (x)dx] (3.1058) 


Example 3.47 Solve the following non-constant coefficient ODE by removing the 
second highest derivative term 


2 
Oo sis slv=0 
dx dx (1+ 3x) 


Solution: We see that for this particular ODE 


(3.1059) 
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a, =2 (3.1060) 
Thus, we have the following transformation 
y=ue ~ (3.1061) 
The derivative terms of (3.1061) are 
y'=(-ut+u')e* (3.1062) 
y"=(u-2u'+u")e~ (3.1063) 
Substitution of (3.1061) and (3.1063) into (3.1059) gives a simplified ODE 
u"+ Z 7u=0 (3.1064) 
(1+3x) 


This equation can actually be considered as an extended Euler equation (see 
Example 3.28). The following change of variables can be assumed 
143x=e', or t=In(1+3x) (3.1065) 
Thus, we have 
dt 3 3 
de 1s3n u(x) =U(t) (3.1066) 
yi de dU at _ 3 dU 


(3.1067) 
dx dt dx e dt 
| du d.3d.dt 3. 3d 3a@U 
ee eg ee ee ae 
(3.1068) 


2 
_1 [yeu au 
e' | dt dt 


Finally, the differential equation is converted to an ODE with constant coefficients 


2 
of 2 99 so =0 (3.1069) 
dt dt 
Following the standard procedure of assuming that an exponential solution leads to 
the characteristic equation 


947 -924+2=0 (3.1070) 
G= 4-5-9, (3.1071) 
3 3 
the solution of U is 
U = Ae’? + Be? (3.1072) 


Substitution of this result into the definition of change of variables given in 
(3.1065) and (3.1061) gives the final solution 


u =[A(1+3x)!3 + B0+3x)*7] (3.1073) 
y =[A(+ 3x)? + BO 43x)? Je* (3.1074) 
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3.5.13 Particular Forms 


In this section, we will summarize some particular forms of higher order ODEs that 
can be solved easily. The simplest ODE of order n is 


ee, (3.1075) 
dx" 
This ODE can be solved by applying direct integration n times as: 


y= [[]-[renarnds, 1 +++ dXydx 


(3.1076) 
Seg Hea #2 ae. 
Another simple form of an n-th ODE is 
d” 
7 =f) (3.1077) 
dx 


The integration of this differential equation is less obvious. Direct integration can 
only be applied for the case n=1, 2. For n = 1, we have 


d 
<= fv) (3.1078) 
dx 
This is a special form of separable ODE, and thus the solution can be determined as 
= (226 (3.1079) 
f(y) 
For the case of n = 2, the ODE given (3.1077) becomes 
Pe 
<7 =f) (3.1080) 
dx 
Multiplying both sides by 2(dy/dx), we have 
dy dy dy 
2 =2 3.1081 
erie es (3.1081) 
Note that the left hand side can be recognized as 
dyd’y_ d_dyy dy 
= =2 3.1082 
aot aa ( re fY) i ( ) 


This technique actually closely relates to the evaluation of energy in physical 
problems. It is a very powerful mathematical technique and will be used again in 
later chapters. Thus, we can integrate the last one as 


Sy =2f sendy +6 = 90) +C (3.1083) 
X 


The first derivative becomes 


- =+/o0)+C, (3.1084) 


Integrating one more time, we have 


= dy 
J Joo, 


(3.1085) 
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3.5.14 Autonomous n-th Order ODE 


In this essay, the n-th order ODE is independent of the variable x or more 
specifically x does not appear in the ODE except in the derivative terms. That is, 


FQ Ds )=0 (3.1086) 
dx 


Since none of the coefficients change with x, we call such an ODE autonomous. 
The functional form is independent of x. We can apply a standard change of 


variables as 
dy _ (3.1087) 
dx 


Differentiation of (3.1087) gives 


2 
dy _dp _dpdy __ dp (3.1088) 
dx? dx dy dx dy 


dy _d_ dy. d_, dpdy_ dp. dpy 
— ( = = + p(—) (3.1089) 
de dx dx’ ) ) - dy” dy 


d‘y d@& ee a d 

pa) = 1? $+ oy 

dx dx dx 
: : eo (3.1090) 

d 

E+ (EY + 4p? (LF 


“9 
dy dy dy dy’ 


dy d_d‘ d_3d°p d dp. d? 
= *y= [p= F 4 a +4p aa” 
dx dx dx dy dy’ dy dy 


4 
AED ED iy dog Pay? oa Pry oP 
dy dy dy dy 
(3.1091) 


All higher derivatives can be evaluated using a similar procedure. However, it does 
not appear to have a general form for d"y/dx". However, it is clear that 


dq" d gq? 
7=fp,%,.. (3.1092) 
dx dy dy 
Therefore, the original ODE becomes symbolically 
n-1 
Fy, 7, ©, = 2) =0 (3.1093) 
dy dy"” 


The original unknown y now becomes a variable, and its first derivative p becomes 
the new unknown. Now the variable y appears explicitly in the differential 
equation. Therefore, the procedure used in (3.1087) can no longer be used. In 
addition, even when the original ODE (3.1092) is linear, a change of variable given 
in (3.1087) will make it highly nonlinear. Therefore, this technique is more useful 
if the highest order is second, and in such case, the resulting nonlinear first order 
ODE is more likely to be solved. 
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3.6 SUMMARY AND FURTHER READING 


The methods for solving ODEs have occupied the minds of great mathematicians in 
the last three hundred years. Many of the techniques covered in the chapter were 
developed by Bernoulli, Euler, Fourier, Kirchhoff, Lagrange, D’Alembert, Jacobi, 
Laplace, Poisson, Legendre, Helmholtz, Gauss, Kummer, Clairaut, Riccati, Bessel, 
Boole, Hankel, Fuchs, Cauchy, Riemann, Picard, Frobenius, Stokes, Liouville, 
Monge, Ampere, Darboux, Goursat, Bateman, Airy, Sarrus, Kelvin, Forsyth, and 
many others. Yet, there remain many unsolved ODEs, especially nonlinear ODEs 
or ODEs with non-constant coefficients. Many naive looking ODEs require the 
application of substantial mathematical skills and insights to obtain their solutions. 
In this chapter, we only summarize some of the most notable techniques in 
obtaining the solutions of ODE. These skills are essential before we discuss the 
solution techniques for partial differential equation (PDEs). Very often, when we 
solve a PDE, we convert the PDE into a number of ODEs (such as the separation of 
variables). There are a number of handbooks on differential equations. The most 
comprehensive and newest ones are a series of handbooks by Polyanin and co- 
authors (Polyanin, 2001, Polyanin and Zaitsev, 2002 and 2003). The handbook by 
Zwillinger (1997) covers a number of different techniques and provides insights in 
solving different ODEs as well as PDEs. We suggest that readers identify and 
classify the ODE at hand. It is advisable to look it up in a handbook before trying 
to solve the differential equations by other techniques (such as a series solution 
technique to be covered in Chapter 4). It is quite common that an exotic 
differential equation can easily be transformed into one that has been solved, if 
appropriate change of variables is applied. Indeed, in this chapter we have 
repeatedly demonstrated the power of change of variables. 

Many techniques covered in this chapter are not covered in most modern 
textbooks on differential equations. We have referred extensively to more classic 
textbooks on differential equations, such as Boole (1865), Forysth (1890, 1900, 
1902, 1906, 1956), Goursat (1917), Bateman (1918), Piaggio (1920), Ince (1956), 
Sommerfeld (1949), Erdelyi (1951), and Sneddon (1957). We recommend that 
readers refer to these books for more classical techniques in solving ODEs. 


3.7 PROBLEMS 


Problem 3.1 Find the solution of the following first order ODE 


{yt Vo2+9°)| de—ady=0 (3.1094) 
x =clytyx?+y?} (3.1095) 


Problem 3.2 Find the solution of the following nonlinear first order ODE 


Ans: 


wy =q-y (3.1096) 
dx 


Hint: Use factorization. 
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Ans: 
(x—c)? -(sin! 2)? =0 (3.1097) 
Problem 3.3 Find the solution of the following second order ODE 
2#Y 9D 462 220 (3.1098) 
dx dx 


Hint: Use removal of the second highest derivative. 
Ans: 
y = Axcosx + Bxsinx (3.1099) 


Problem 3.4 Find the solution of the following second order ODE 
d* aa 2 dy 2 


y=0 (3.1100) 
dx’ xdx (1+x) 
Hint: Use removal of the second highest derivative. 
Ans: 
2 
jp hen ep (3.1101) 
x(1+x) x 
Problem 3.5 Find the integrating factor of the following first order ODE 
F (xy) ydx + Fy (xy)xdy = 0 (3.1102) 
Ans: 
: (3.1103) 


&* iF (ay) Fray} 


Problem 3.6 Use the integrating factor to solve the following first order ODE 


(x? y* +l) ydx + (x7 y? -1)xdy =0 (3.1104) 
Ans: 
1 2.9 x 
<x y° +In{(—) =e (3.1105) 
2 y 
Problem 3.7 Use the integrating factor to solve the following first order ODE 
(2x°y* —y)dx +(2xy* —x)dy =0 (3.1106) 
Ans: 
Pag? hose (3.1107) 
xy 


Problem 3.8 Find the integrating factor of the following first order ODE 


Ct s00 dr ~dy=0 (3.1108) 
y 


Ans: 
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U=cos(~) (3.1109) 
x 
Problem 3.9 Find the general solution of the following n-th order ODE with the n- 
th repeated root, 70: 
PrY? ++ Pay" + Pry" + Py’ + Poy =0 (3.1110) 
Ans: 
y=(QtCt+...4+C2" je™ (3.1111) 
where r satisfies the following characteristic equation 
Dal" + Dye” +-.+ Prt Py =(—HY" =0 (3.1112) 


Problem 3.10 By differentiating the solution obtained in Example 3.15, show that 
it is indeed the solution of the given ODE. 


Problem 3.11 In solving a particular form of Navier-Stoke Equation for the case of 
viscous incompressible fluid flow, one arrives at the following form of Riccati 
equation in polar form (Sedov, 1993) 


dp 1 2 
—-—p —gcotd=0 3.1113 
ea ( ) 
Show that by applying the following change of variables 
2 du 
So 5 3.1114 
ny ( ) 
we can convert (3.1113) into the following second order ODE: 
2 
£4 tg =0 (3.1115) 
do do 
Problem 3.12 Show that we can apply the following change of variables 
ae v (3.1116) 
do 
to solve (3.1115) obtained in Problem 3.11 and thus obtain the solution for @. 
Ans: 
= _2sind (3.1117) 
A+cosé 


where A is an unknown constant. 


Problem 3.13 Repeat the solution procedure for solving (3.1115) by using another 
change of variables 


Lu =c08"(5) (3.1118) 


Ans: 
2sin0 


= (3.1119) 
A+cos@ 
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where A is an unknown constant. 


Problem 3.14 Show that the solution for the following particular form of Riccati 
equation encountered in viscous compressible flow 


dp 1 2s 29 , 1 
= *@- Ocos@ =—cos20+1 3.1120 
ae md )sin gsin @ cos hae ( ) 
is 
p= {eord + cath (3.1121) 


Hint: See Chapter 3 of Sedov (1993). 


Problem 3.15 Solve the following nonlinear first order ODE by differentiation: 


d. 
y hey = forty By (3.1122) 


(i) In particular, first consider that 


dy dy. 
+y—~=a, 1+(—)y =b 3.1123 
ae rs yf C? ( ) 


Show that differentiation of both of these with respect to x leads to the following 
ODE: 


2 
142? +y 4% =0 (3.1124) 
dx 
(ii) Next, show that differentiation of (3.1122) also leads to (3.1124). 


(iii) Finally, show that the solution of (3.1122) is 


y+(a-xP ={f@y (3.1125) 
where a is defined in (3.1123). 


Problem 3.16 Find the differential equation for the singular solution of (3.1122). 


Ans: 
dy 


4- srt yB=0 (3.1126) 
142) ™ 


Problem 3.17 Find the solution of the following ODE 


jag (3.1127) 
dx dy 
Ans: 
yeort (3.1128) 
c 


Problem 3.18 Find the solution of the following ODE 
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2, 93 2 
go vey. a a (3.1129) 
dx~ dx dx 
Hint: Assume the following change of variables: 
2 
as = (3.1130) 
dx 


Ans: 


y=|f (~—*)? -ldxdx +. o,x406, (3.1131) 
a 


Problem 3.19 Find the solution of the following ODE 


- ay = ay (3.1132) 
dx dx 
Ans: 
pace tee 405x404 (3.1133) 
Problem 3.20 Find the solution of the following ODE 
ye dx + xe dy =0 (3.1134) 
Ans: 
ev =c (3.1135) 
Problem 3.21 Find the solution of the following ODE 
(x—2xy +e” dx +(y—x? +xe”)dy =0 (3.1136) 
Ans: 
x+y? —2x* y+2xe” =c (3.1137) 


Problem 3.22 Apply the Liouville transformation discussed in Section 3.4.7 to the 
following second order ODE: 


2 
PVN gD a iy (3.1138) 
de 8? de ae 
(i) Use the following transformation 
ler dx 
y=Wexpts |r (3.1139) 
Show that the governing equation for v is 
2 
aya =0 (3.1140) 
x 


(ii) Apply a change of variables x = e’, and show that the answer of (3.1138) is 
y =exp(x"){C,x* 4 (3.1141) 
x 
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Problem 3.23 Apply Liouville transformation discussed in Section 3.4.7 to the 
following second order ODE: 


2 d*z dz 
(l ew +(1 3x) +kz=0 (3.1142) 
Xx 


Find the governing equation for W defined in (3.685). 
Ans: 


da’w (1+x)(1-3x) 
W=0 3.1143 
dx? A(1-x’”)? 


Problem 3.24 Apply Liouville transformation discussed in Section 3.4.7 to the 
following second order ODE: 


ag dg 
1-x’ l+x t(k+1)¢ =0 3.1144) 
( ) ae (1+ x) hk (A+) ( 
Find the governing equation for W defined in (3.685). 
Ans: 
2 — 
@ di C+ Nl 3) 7 =o (3.1145) 
dx 4(1— x") 
Problem 3.25 Consider two different ODEs as: 
d7z dz 
— + P(x) —+O(x)z=0 3.1146 
2 PWT +O) (3.1146) 
ag dg 


(1) By observing the results from Problems 3.23 and 3.24, show that if the invariant 
I is the same for different ODEs, these ODEs are equivalent. More specifically, 
show that the invariants for them are the same if 


1dF 1 1dP 1 
t= 1__ pe = Pr 3.1148 
Oa 4 la A : 
(11) Prove the following relation between z and ¢: 
5 =zexp(> [(P-A)dr (3.1149) 


Problem 3.26 Show that z in Problem 3.23 and ¢ in Problem 3.24 are related by 
z=C(1+x) (3.1150) 
Hint: Use the result of Problem 3.25. 


Problem 3.27 It will be shown in Chapter 4 that the Bessel equation is 


2 2 
Oe Gi VG (3.1151) 
de xax x? 


Its solutions are the Bessel function of the first and second kinds: 
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z=AJ,(x)+ BY, (x) (3.1152) 
Find the solution of the following ODE in terms of Bessel functions 


2 2 
OJ eg” ay 4 ")y=0 (3.1153) 
dx x 


Hint: Use the result of Problem 3.25. 


Ans: 
y=(—)'? (4S, (x) + BY, (2) (3.1154) 
e 


Problem 3.28 It has been shown that 


d’y dy 
—+ P(x) —+O(x)y=0 3.1155 
Fe ( ye O(x)y ( ) 
can be converted to 
d*y 
—+I1(x)v =0 (3.1156) 
dx 


where J is defined in (3.1148). If the solutions of this second order ODE are v; and 
v2, and their ratio is s = v1/v2, show that 


~ =) = {s,x} =2] (3.1157) 
AY 


s 
where {s,x} is the Schwarzian derivative, which is defined in (3.702). 


Problem 3.29 Show that the invariant / of the adjoint ODE of 
2 
fY p(X + O0)y =0 (3.1158) 
dx dx 
is the same as the invariant of the original ODE. 


Problem 3.30 Use the symbolic method to find the particular solution of the 
following linear ODE 


[¢(D*) + Dy(D”)]y = Pcos ax + Osin ax (3.1159) 
where ¢ and y are arbitrary linear differential operators. 


Ans: 
_ ¢(-a” )(P cos ax + Osin ax) + ay (-a’ )(P sin ax —Ocos ax) 


{(-a’)}" +a? {y(-a?)P? 


(3.1160) 


Problem 3.31 Find the integrating factor by inspection and solve the following 
ODE 


(x* + y*)dx —2xydy =0 (3.1161) 
Ans: 
= =e (3.1162) 
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Problem 3.32 Find the integrating factor by inspection and solve the following 
ODE 


(x? — y”)dx + 2xydy =0 (3.1163) 
Ans: 
o 
Inx+-—-=C (3.1164) 
xX 


Problem 3.33 Find the integrating factor by inspection and solve the following 
ODE 


xdy — ydx = (x? + y”)dx (3.1165) 
Ans: 


tan (>) =x+C (3.1166) 
x 


Problem 3.34 Find the integrating factor by inspection and solve the following 
ODE 


ES (3.1167) 
Ee y 
Ans: 
--1/V\ 
sin (—)=y+C (3.1168) 
x 
Problem 3.35 Solve the following ODE 
xdx + ydy + ydx — xdy = 0 (3.1169) 


Hint: Use polar form. 
Ans: 


Inx? +y? -tan (2) =C (3.1170) 
x 


Problem 3.36 Show the validity of column 2 in Table 3.1. 
Problem 3.37 Show that (3.300) is the solution of (3.3299). 


Problem 3.38 Employ the technique discussed in Section 3.5.14 to solve 


d°y dy.» 2 dy 
—=(0 3.1171 
dx? Ce dx ( ) 
Ans: 
Y_ = Ce (3.1172) 
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Problem 3.39 Solve the following ODE 
Mf, (xy)dx + xf (xy)dy = 0 (3.1173) 
Ans: 


PON Siete, yey (3.1174) 
AAM-AW)] 


Problem 3.40 It was reported in Example 3.13 that Ramanujan’s differential 
equations given in (3.241) can be converted to (3.242) and (3.243). This problem 
provides the detail of the derivation. 


(1) Applying the following change of variables 


qz=e’, (3.1175) 
show that (3.241) can be rewritten as 
dP 11 dQ 1 dR 1 2 
= Ps F = PQ-R), = PR 3.1176 
a iS Q) a 3! Q-R) rs at QO) ( ) 
(11) Apply the following stretching transformation 
y=e’'y, B=e*P, O=e"O, R, =e R, (3.1177) 
show that (3.1176) can further be reduced to 
dP 1» dQ, 1 dR, 1 3 
= P ; = PO, -R,), = PR 3.1178 
dy, 12 ( 1 —, ) dy, 3 ( 12; 1 ) dy, 2 ( | Fa | —, ) ( ) 


Comparing (3.1176) and (3.1178), we see that the ODEs are invariants under the 
stretching transformation given in (3.1177). This information allows us to apply a 
scaling transformation to be considered next. 


(111) Apply the following scaling transformation 
R Q!? 


a aan w=y0'”, (3.1179) 
derive the following system of ODEs for u,v, and w: 
du Ww 9 dy W 9 dv w 1 
= u’ —1), = v —2uv +1), =w{l+—(u 
JI dy, 3 ( )» MY dy, Bi » VY dy, t 6 ( Ds 

(3.1180) 

(iv) Finally, prove the following ODEs given in (3.242) and (3.243) 
ti? he ee ot) (3.1181) 

du 6 


dw w (uv-l) 2 
du 3vu?-l) wl 


(3.1182) 


CHAPTER FOUR 


Series Solutions of Second Order ODEs 


4.1 INTRODUCTION 


Series solutions for second order ODEs with non-constant coefficients occupy an 
essential place in the development of the solution techniques for differential 
equations. Physically, many phenomena in sciences and engineering can be 
modeled by second order ODEs. The difficult part of solving many second order 
ODEs relates to the existence of regular singular points and irregular singular 
points in these ODEs. Fuchs, in 1866 and 1868, initiated study of the regular and 
irregular singular points of linear second order differential equations with non- 
constant coefficients. This problem had been considered by a number of 
mathematicians including Hermite, Jordan, Hadamard, Darboux, Poincare, 
Frobenius, Goursat, Thome, and Painlevé. For example, the Bessel equation, which 
was found useful in modeling many physical phenomena, has a regular singular 
point at the origin of the variable. A technique called the Frobenius series has been 
found useful in solving it. Many special functions were defined as the solutions for 
these second order ODEs with non-constant coefficients. The solutions for the 
Bessel equation are called Bessel functions of the first and second kinds. In this 
chapter, we will consider some classical differential equations that fall into this 
category, including the Bessel equation, modified Bessel equation, Legendre 
equation, associated Legendre equation, hypergeometric equation, and generalized 
hypergeometric equation. 

Before we devote ourselves to the series solutions, a thorough introduction to 
the gamma function will be given, which is essential for the later part of the chapter 
in obtaining solutions of these important differential equations. 


4.2 GAMMA FUNCTION 


The birth of the gamma function is related to the factorial function, which is 
defined as 
nl=n(n—l1) 3-2-1 (4.1) 
In addition, it is defined that 
O!=1 (4.2) 
We will see why we have such a strange definition for the zero factorial shortly. 
The gamma function was proposed by Euler. In 1729, at the age of 22, Euler wrote 
a letter to Christian Goldbach (1690-1764), a German living in Moscow, to 
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describe the idea of extending the factorial function to the non-integer case. Euler 
proposed (Davis, 1959) 


(3) 1 Ie 2 Ie 3 | m\(m-+ty""! 
= lim =n! 
1) nt+1}}\2) n+2])(3) n4+3 moo (n+1)(n+2)-**(n+1+m) 


(4.3) 
A proof of (4.3) will be given in Section 4.2.2. This definition leads to an amazing 
formula of x in terms of infinite product (see Problem 4.55). The motivation for 
studying (4.3) is that a” is true not only for integer m but also for the non-integer 
case, and Euler said to the extreme, that the factorial function defined in (4.1) can 
also allow the non-integer case. For example, 2! = 2 and 3! = 6, Euler expected that 
2.5! would be somewhere between 2 and 6. If we take the first four terms of the 
infinite product given in (4.3), we have 2.5! = 2.3828.... As expected, it is between 
2 and 6. Euler later proposed the gamma function in terms of an infinite integral, 
which will be discussed in the next section. The analysis of gamma functions is by 
no means simple but occupies a central place in mathematical analysis, and its 
properties have been investigated by many great mathematicians, including Euler, 
Legendre, Gauss, Weierstrass, Dirichlet, Binet, Hankel, Stirling, Pochhammer, and 
Hadamard. It also forms the basis for studying many functions of series solutions, 
like Bessel functions and hypergeometric functions. 

In the next few sections, we will summarize some essential properties of the 
gamma function. We will see that the gamma function becomes infinity at zero and 
negative integers. To remedy this, French mathematician Hadamard proposed an 
alternative factorial function, which will be discussed in Section 4.3. 


4.2.1 Euler’s Integral Definition 


Instead of using (4.3), Euler discovered that the following integral did give the 
required definition of the factorial for the case of integer z = n, with n being an 
integer: 


P@)= foe tetar (4.4) 


A plot of this gamma function is given in Figure 4.1. First, we rewrite (4.4) in terms 
of argument +1 and apply integration by parts as: 


P(n+l)= [te tde=-1"e" 
0 


Ps nf leat aagigt|| nT (n) (4.5) 
0 0 0 


Note that the upper limit of the first term on the right hand side is in a form of 00/0, 
whereas the lower limit is 0. Then, L’H6pital’s rule can be applied repeatedly to 
reduce the numerator to a constant and thus the upper limit is also zero: 
n n-l 
eu: : _ a 
=-lim—+0= lim =-.=-lim =0 (4.6) 


0 10 @ t—00 e to @ 


n-t|” 
—te 


In short, we have a recursive formula: 
T(n+) =nT(n) (4.7) 
Applying (4.7) repeatedly, we finally get the factorial definition: 
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T(n4+1) =n(n-I)(n—-2)...1-T) =a! (4.8) 
The last part of (4.8) resulted from the fact that [(1) =1, which will be shown in the 
following example. 


Figure 4.1. Plots of gamma function 


Example 4.1 Using Euler’s definition for the gamma function given in (4.4), show 
that 0! = 1. 


Solution: Set 1 = 0 into the following relation between the gamma function and the 
factorial function for the case of integer argument: 


Tan+) =n! (4.9) 
Thus, we have 
T(1) =0! (4.10) 
Using Euler’s integral definition, we obtain 
r= [etd Bc (0-1)=1 (4.11) 
0 
Substitution of (4.11) into (4.10) yields 
Ol=1 (4.12) 


This provides the definition 0! = 1 that we reported earlier. This is a definition 
given to high school students without proof, and now we see why it is so. 


Example 4.2 Find the value of [(1/2) 


Solution: By substituting z =1/2 in (4.4), we have 
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r= I “eta (4.13) 
2 0 
Application of the following change of variables 
t=y°, dt=2ydy (4.14) 
gives 


1 o 2) Or 
T(~) =[ e? —2ydy= 2) e dy 
2 0 y 0 


[Levan 


This is the Laplace or Gauss integral that we encountered and proved in Chapter 1. 


(4.15) 


Example 4.3 The original integral definition of Euler in 1729 is actually given as 
Vas As 
T(z) = I (In—)?'dy (4.16) 
oy 
Show that this definition is equivalent to that given in (4.4). 


Solution: Apply the following change of variables, 


t=In(1/y), or y=e" (4.17) 
Taking the total differential on both sides, we have 
dt = aii or dy=-e ‘dt (4.18) 
y 


Noting from (4.17) that y = 1 for ¢= 0 and y = 0 for t > ©, we can rewrite the integral 
given in (4.16) as 


0 oO 
T(z)=- [tera - I, ett at (4.19) 


This is the same as (4.4) given above. 


4.2.2 Euler’s Factorial Form 


We now show that the gamma function can be expressed in terms of a factorial 
function, which was introduced by Euler in 1729 and has been cited early in (4.3). 
First, we rewrite the integral (4.4) by using the definition of the exponential 
function defined in Example 1.2 in Chapter 1: 
5 n a u n a4 t 
T(z) = tim | teat = lim [ #71 a—-5)"at (4.20) 
nod) nad) n 
We now apply the following change of variables 
t=np (4.21) 
Substitution of (4.21) into (4.20) gives 
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T(z) = lim } ; n? p= (1— BY" ndB = lim n? [. Br -py"dB (4.22) 


Integration by parts reduces the integral _ 


T(z) = bas n 2 “- By" | 


~[6°(1-py""n(-ap)) 


(4.23) 
= lim n*(" # (l-f)"" — 
Comparison of tees ies: (4.23) gives a recursive formula as 
[,od-py'dp =2F pea-py""ap (4.24) 


Repeating the process of integration by parts or applying (4.24) repeatedly, we 
obtain another form of the gamma function 


T(z) = lim n? ea aaa Bd py" dp] 


noo 


sien n(n—1)(n—2)...1 
noo 2(z+1)(z+2)...(z2+n-l) 


f, got (l = Vy ibe dp] (4.25) 
nn! 


= lim 
no Ce +1)(2+2)...(z+ ny 
In obtaining the last result in (4. = we have a the following obvious result: 


The result given in (4.25) as another definition of the gamma function in 
terms of a limit of the factorial function. The result given in (4.25) is the same as 
(4.3) as n > . This result will be used in the next section as an immediate step to 
show Weierstrass’s product definition. 


(4.26) 


4.2.3 Weierstrass’s Product Definition 


Euler also proposed another form of the gamma function in terms of infinite 
products as: 
foo) 


1 ee Zz —z/n 
7a [[{e+26 ! (4.27) 


n=l 


where y= 0.5772157 is the Euler constant. This definition was proposed by Euler 
in 1729 in his original letter to Goldbach, but many of the properties of this product 
definition were considered by Weierstrass. Indeed, Weierstrass preferred this 
definition to Euler’s integral definition given in (4.4). In the literature, it was 
referred as Weierstrass’s canonical form of definition (e.g., Whittaker and Watson, 
1927). 

To show that this definition is the same as the integral definition of Euler 
given in (4.4), we first recall the definition of Euler’s constant: 
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y= tin| Sti n| ~ 0.572157 (4.28) 


A review of the Euler constant is given in Appendix E at the back of this book. Let 
us rewrite the infinite product as a limit 


1 ; - ae ee 1 
= lim ze’”* 1+—)e*"" + =lim 4.29 
T(z) 13% II a noo (z) ( ) 
Rearranging this equation results in 
YZtzinn n wi wk i n Z4 
7, (z)=e™ il Gey e joe "TI ae) (4.30) 
k=l k=l 


The last part of this result is obtained by recognizing the following identity 


. Z Z fh Zz vA Z n 
ex = eX] Pie = eX) ex eX) = elk 
Dy Fe aa ) = exp(—T)exp(—5)--exp(-—) = T] 


Zz 
n k=1 


(4.31) 
In this equation, it is important to recognize that the exponential function of a sum 
equals the product of the individual exponential functions. 
To further simplify (4.30), we want to establish another identity 


Inn =[In2—Inl}+[In3—In2]+...+[Inn—In(n—1)]+ [Inn + 1) —Inn] + In(——) 


+1 
k 


=—Inl+Inn=Inn= ine )+In(—_) 
i n+l 
(4.32) 
Substitution of (4.32) into (4.30) gives 


n 
n 


1 
> zIn(1+—)+zIn(—_) _1 
X,(zyae Fm ‘a + zy} 


k=l 
n .T- 1+ Busy 
1+ -)? 1+— 
ntD IT @ I] rm 


_, n \zp7 th) k 
a lke Ge (4.33) 


=f 


n ., 2737..n? (n4+1)" n! 
n+l 128 ne? (z+1)(z4+2)...(z +n) 


a nn! 
(z4+1)(z+2)...(z+n) 
Substitution of (4.33) into (4.29) gives 


Z| 
M(@)= lim[,(2)]= liml ae = (4.34) 
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This is obviously the same as (4.25) obtained in the last section. Consequently, the 
equivalence of (4.27) and (4.4) is established through the immediate result of the 
factorial form derived in (4.25). 


4.2.4 Reflection Formula 


An important result of the gamma function is called the reflection formula and it is 
given as: 


T-2l(z)= (4.35) 


sin £Z 
This formula is sometimes known as the complement formula. It is a very powerful 
identity but its proof is not simple. 
To prove (4.35), we start the left side as 


T(z)r(-z) = [etal sl) 1eS ds = [, ie e 5-747 -ldsdt (4.36) 


In writing (4.36), we recognize that both ¢ and s are dummy variables of the definite 
integral of the gamma function. Now apply a change of variables of 


Perey ae es (4.37) 
S 


Using the second equation of (4.37), we can rewrite u as 


u=s(l+v), or s= (4.38) 
l+v 
Subsequently, t becomes 
gee = (4.39) 
l+v 


Now s and ¢ are completely in terms of u and v. The physical or geometrical 
interpretation of (4.37) can be depicted in Figure 4.2. Every radial line corresponds 
to a specific value of v, with v = 0 for horizontal (i.e., ¢ = 0) and v — 0 for vertical 
(1.e., s = 0). The new variable wu is the length measured along the radial lines for 
various values of v. Therefore, for v= 0 we have u = s and v > © we have u = ¢. In 
summary, we find 

s=—, t=sy-— (4.40) 

l+v l+v 

For the double integration in (4.36), we need to consider the Jacobian of the 
mapping [see (1.109) of Chapter 1]: 


O(t,8) 
(u,v) 
By using (4.38) and (4.39), we obtain the Jacobian as 


dtds = dudv (4.41) 
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a ot 
A(t.s) _Jau dv] _ at as _ at as 


O(u,v) |Os Os| Gudv vou 


Ou Ov 
Vv uy u vu . 1 (4.42) 
1+v (+y) l+v (l+vy 14+v 
1 —u 
= ———_[-uv-u(1+v)+uv] = —— 
(l+v)° (l+v) 
Increasing v 
vo 
t 
t=u = 
v= const. 
s=u v=0 s 
Figure 4.2 Mapping from s-¢ space to u-v space 
Substitution of (4.42) into (4.41) leads to 
je) i ae (4.43) 
O(u, v) (l+v)° 


In addition, it is straightforward to see that every point in the quarter plane s-t is 
uniquely defined by u and v. In particular, the ranges of these variables are 


s:050, t:050, 
(4.44) 


u=stt:050, v=t/s:0 30 


The ranges of the new variables are 0 <u < © and 0 < v < o, Finally, (4.36) can be 
expressed as 


WU 2 “t l+v 


ae 5 dudy (4.45) 


T@r—z) = [, [,e fae 


Expressing in terms of the new variables, using aa we have (4.45) becoming 
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I(2)rd-z)= [, [e -uyz-1 dudv = [eta ic lng 


l+v l+v (4.46) 
= a 
0 l+v sin zr 


The last result of (4.46) is a result of Titchmarsh’s contour integral derived in 
Section 1.7.7 in Chapter 1. The above proof was outlined in Lebedev (1972) 
without giving detail. A simpler proof by Euler of (4.35) is given in Problem 4.58 
using infinite product form of sine function. This formula is also found useful in 
evaluating certain Mellin transform (see Problem 11.6 in Chapter 11). The 
reflection formula is closely related to the singularity of the gamma function. Note 
from the reflection formula that there are poles at z = 0, -1, —2, ... The poles 
coincide with all the zeros of the periodic sine function (with a period of 2). For 
example, from the recursive formula given in (4.7) we have 


T(z) = nis =) (4.47) 
For z = 0, we have 
1 

T(0) = a 00 (4.48) 

Thus, I'(z) is singular at z= 0. We can extend this idea further for z = —n 

en | 
—n (—n) (-n +1) 

(4.49) 


2A sep a Z 
- a) T(-n+2)= T(0)> 


The last result is a consequence of repeated itea of the recursive formula 
given in (4.7). From this, we can observe that there is a simple pole at z= 0, —1, —2, 
... For complex z, the residue at the pole z = 0 is: 
ResI(z) = lim zI(z) = limI(z+) =T() =1 (4.50) 
20 z>0 z-0 


The residue of the negative integer is 


Res T(z)= Jina (z+n)P(z) = = bast ro TG — n)= lim re —n+l) 
>06-n 


a ¢ 
i Peano (4.51) 
- OW" time 1(g) = 


Alternatively, we can noe the dace of the gamma function as 
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0 1 ca) 
T(z) =[ er a =| et tar+ | e't ‘dt 
0 0 1 
= fears [VO ay (4.52) 
1 om nl , 
° tye S a ae 
= dt + 
Ire oor Z+n 


In obtaining (4.52), we have eed the power series of the exponential 
function inside the integration. The residue is clearly the same as that found in 
(4.51). The first integral is analytic or finite whereas the second summation 
contains all the poles at n = 0, —-1, —2,... This is called the Mittag-Leffler expansion 
of the gamma function and is also known as Prym’s decomposition. Mittag-Leffler 
played a crucial role in the final decision of “not” including mathematics in the 
Nobel Prize. This story is reported in the bibliography section. This singularity of 
the gamma function leads to the search for other factorial functions that contain no 
singularity. One such choice is called the Hadamard factorial function and will be 
examined in more detail in Section 4.3. 


4.2.5 Recurrence Formula 


The recurrence formula is given by 

(z+) =2zT() (4.53) 
This is essentially the same as (4.7) for integer argument. This can be proved easily 
by integration by parts and L’Hopital’s rule. 


nese | oeiaoeerh 


+ | t’ edt 
0 


m 222 I) 
Les e! 
= zI(z) 
We have applied L’H6pital’s rule m times with m > z. When z = n is an integer, we 
have the factorial function [(m+1)=n! as shown in Example 4.1. This is the most 
basic property and explains why the gamma function is an extension of the factorial 
function to the non-integer case. 


LO4e I "Feat (4.54) 
0 


4.2.6 Legendre Duplication Formula 
The following duplication eee was derived by a 
T'(2z) = = 27? F(a) (z+ (4.55 
(2z) = = (2)'( > ) 


To prove this, we consider the following term outa in the right hand side of 
(4.55): 
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p=2"'P(Qr(e +) <2 [ete taf ets? as 
2 2 a (4.56) 
_ aa | | et) p21 52-2 ty 
0 20 


Applying a change of variables as 


a= s, B=vt (4.57) 
and taking the differential on both sides, we obtain 
jo ie ae (4.58) 


2Vs avi 


With these new variables, the double integration becomes 


=P [Pee 52292 4apdad B 
wae (4.59) 
2 Po 2 2 2 
= 22s } e@ +f (ap) adad B 
0 Jo 
On the other hand, we can independently apply another change of variables as 
a=vt, B=vs (4.60) 


Following a similar procedure, it is straightforward to see that 


Ps oa WO ea ae 4aBdad B 
0 Jo 


(4.61) 
= 22s } oP) (ag) Bdad B 
0 Jo 
Addition of these two independent results given in (4.59) and (4.61) leads to 
2p =21[ "|e (apy (a+ B)dadB (4.62) 
0 Jo 


Because the integrand is symmetric with respect to both @ and /, we have the 
integration as symmetric with respect to a = B as shown in Figure 4.3. In particular, 
we can express the double integral for the first quadrant (i.c.,0<a<a«and0<B< 
oo) as twice the shaded area J2 shown in Figure 4.3 (1.e., 0< a<B, and 0 < B <): 


T= I i } “e@ +P) a8) 7"(a+ B)dad B 
ie (4.63) 
28 (a2+p?) 22-1 
=4 | } e (2ap)?"(a+ B)dad B = 41, 
0 Jo 
Note that /, which is symmetric with respect to q@and f, can be expressed in terms 
of J2, which is shown in Figure 4.3. Apply another change of variables 
u=a°+ PB’, v=2aB (4.64) 
Thus, we find that 
u+v=(at+By (4.65) 
Inversely, we can rewrite (4.65) as 


Pa on ee (4.66) 
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The domain integration can be expressed in terms of the Jacobian of the mapping 
function as 


O(u,v) 


dudv = ——— dad B (4.67) 
0(@, B) 
Or, inversely, it can be expressed as 
dad B= dudv 4.68 
Ha.) 
Substitution of (4.64) into (4.67) gives 
au ow 
O(u, 0a oO 2a 2 
(u,v) _ B\_ B = 402 — B) 
0(a,B) |Ov Ov) |2B 2a (4.69) 


da Op 


= A(a + Ba B) = 4Ju+vdu-v 
The domain of integration for 
a:0>£f, £B:0>50, (4.70) 
is converted to: 
u:0>0, v:0>0 (4.71) 


Thus, the integral J becomes 


fe 4 [ve “di yt Fer ieee dudyv 
4Jutvau-v 


=|, [ew ot (4.72) 


—(u-v) 

oi ew [PC d(u-v 

=( el =v) 4, 
0 0 u-v 


Figure 4.3 The domain of symmetry and the physical domain for 1 


Clearly, a change of variables for the inner integral can be used: 
u-v=w’, d(u—v) = 2wdw (4.73) 


Series Solution of Second Order ODEs 239 


The inner integral is thus integrable as 


0 oR) a7 xo 
{ e d(u-v) =| ae 
0 w 


0 u-v 
=2["e" awa 
0 


The last result is obtained by using the Laplace or Gauss integral considered in 
Chapter 1. With the result obtained in (4.74), the remaining integration in (4.72) is 
precisely I'(2z). Finally, we have derived. 


aT (2z) = 277" T(z)P(z+ , (4.75) 


(4.74) 


Dividing through by 2!”, we finally obtain the duplication formula obtained by 
Legendre. A more general multiplication formula was derived by Gauss 
n-1 


T(nz) =(22)0? n™ 7 T(2)T(z4 rise jake ) (4.76) 
n n n 


For n = 2, the Legendre duplication formula is recovered. The proof of this 
formula is more advanced as it involves the Stirling formula to be discussed in 
Section 4.2.8, and thus the proof of the Gauss multiplication formula will be 
postponed until Problem 4.28. 


Example 4.4 Prove the following identity 
1 x x 
T(-->T(-= 
a 


aoe 


Solution: Set z = —x/2 in Legendre’s duplication formula given in (4.55) to get 


r(i-x)= 


(4.77) 


1 —y-] Neco > 26 
—xT(-x) = -—= 2x2 T T —)=T(1 4.78 
xT(-x) i ar aa a (4.78) 
Rearranging (4.78), we obtain 
Kael, 


T(1-x) 


At IG-D (4.79) 


This is the required result. This result will be used later in deriving the recursive 
formula of Hadamard factorial function in Section 4.3. 


4.2.7 Digamma or Psi Function 


Literally, digamma function means differentiation of the gamma function. It is 
defined as the differentiation of logarithm of gamma function, instead of the 
gamma function itself: 

I"(z) 
I'(z) 
A plot of the digamma function is given in Figure 4.4. Because the normal symbol 
for the digamma function is the Greek letter psi or y, it is also sometimes called the 


y(z)= d InT'(z) = (4.80) 
dz 


240 Theory of Differential Equations in Engineering and Mechanics 


psi function. The digamma function is a special function in its own right. There are 
a lot properties associated with the digamma function. It will be shown in Section 
4.6.2 on the Bessel equation that the digamma function appears naturally in the 
definition of the Bessel function of the second kind or Y(z). 

Some essential properties of the digamma function are reported here. First, 
the recurrence formula of the digamma function is 


1 
y(zt+l) =—+w(z) (4.81) 
Zz 
This can be proved easily based on the recurrence formula of the gamma function: 
Td+z)=2zI(z) (4.82) 
Differentiation of (4.82) with respect to z gives 
(+z) =T(z)+20'(z) (4.83) 


where I"(1+z) means the differentiation of [(1+z) with respect to the argument (i.e., 
1+z in this case). Dividing the whole expression by I'(1+z), we get 
Piiez) Tt) . zI'(z) 


(4.84) 
Trd+z) Yd+z) Td+z) 
Recalling that [(1+z) = zI'(z), we can further simplify (4.84) as 
(+z) 1 ©) (4.85) 
Td+z) z T@) 
By virtue of the definition given in (4.80), we immediately obtain 
1 
y(z+l)=—+y(z) (4.86) 
Zz 


This completes the proof. 


Figure 4.4 Plots of digamma function 
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Another important identity is the duplication formula of the digamma 
function (Formula 6.3.8 of Abramowitz and Stegun, 1964) 


y(2z) = swle)+swer5)+In2 (4.87) 
To prove (4.87), we start with the Legendre duplication formula: 
1 j22-1/2 1 
T(2z) = —=2 I(2r(z+-—) (4.88 
V2n 2 ! 


Differentiation of (4.88) with respect to z gives 


21r'(2z) = = {(In 2)2?7 7? T(z) (z+ > 


V2n (4.89) 
42D (2 + , 42727 )P(2+ >} 


Dividing through the whole expression by I(2z) and recalling the duplication 
formula in (4.88) again, we obtain 


"> 2z-1/2 ' ’ 
5P'Qz) _ N2a 2 ji Gig gro, (4.90) 
TQ2) «e277 T(z) T(z+1/2) 
Using the definition of the digamma function, we finally get 
2y(2z) =2In2+w(z)+w(z4+1/2) (4.91) 


This completes the proof of (4.87). 
The reflection formula of the digamma function is (Formula 8.365.8 of 
Gradshteyn and Ryzhik, 1980; (1.3.4) of Lebedev, 1972) 
yw(-z)=w(z)+zcotzz (4.92) 
To prove this reflection formula, we start with the reflection formula of the gamma 
function (see (4.35)) 
1 


T(z)rd-z)=— (4.93) 
sin z7 
Differentiation of both sides with respect to z gives 
t 
# g@rd-2}=Tord-2+P(ord-2) =-7? ZZ (4.94) 
dz sin 27 
The last of (4.94) can be rearranged to get 
' = ' t 
Herd=s7-2 2 FO ee (4.95) 
Td-z) T(z) sin za 
Reapplying the reflection formula in (4.93) to the left hand side, we obtain 
1 cot zz 
—— {-(l-z)+y(z)} =a ——— (4.96) 
sin z7 sin z7 
Canceling the common factor on both sides, we arrive at 
{-y(1-z)+w(z)} =-acot za (4.97) 


This of course agrees with (4.92). 
Some special formulas for the digamma function are given here without 
proof: 
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InPe)= | w@yag (4.98) 
Dirichlet formula 
1 
~dx 4.99 
ack le oa wa 
Gauss formula 
el er e™ 
w(z) |< a (4.100) 
1 wv 
w(z) =-y Be ees (4.101) 
y(2)=-7+(2 Denes (4.102) 
2 
y(nt+ Jar 22+ a (4.103) 
y(l)=-y (4.104) 


Another important formula of the digamma function y will be given in (4.301) and 
will be proved there. 


Example 4.5 Prove the following identity 
1 
Wis t2)=WG-2)+atan az (4.105) 


Solution: We start with the following identity for the gamma function (see proof in 
Problem 4.1): 


1 1 
T(—+2r(—--2)=— (4.106) 
2 2 COS 7Z 
Taking the natural logarithm of (4.106) gives 
In (G+ 2)+InT(5-z)=Ina—Incos a2 (4.107) 
Differentiation with respect to z gives 
W5*2) WG z)=atanzz (4.108) 


Rearranging this gives the identity given in (4.105). 


4.2.8 Stirling Formula 


For a large argument, asymptotic expansion of the gamma function was derived by 
Stirling in 1730, which is given by 
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nl~ J2nn™2e™ (4.109) 
for n + ©. To derive this formula, we recall for x > 0 that 


T(x41)= I “Petdt (4.110) 
0 


We can introduce a new variable as 
t=sx, dt=xds (4.111) 
Substitution of (4.111) into (4.110) gives 


T(x+]l)= I, (sx)"e™"xds aqttl i s%e-"ds 
es 2 (4.112) 
= 7", er(-stlns) 75 = |. exp[x(@(s)]ds 


where 
¢(s) =—-s+Ins (4.113) 


We can examine the asymptotic behavior of the integral for large values of x (1.e., x 
— o) by a method similar to the Laplace method for the case of real gs). This 
method is closely related to the so-called Riemann method of steepest descents (or 
Debye’s Saddle point method) if &(s) is complex, or the Stoke’s and Kelvin’s method 
of stationary phase if &s) is purely imaginary (Erdelyi, 1956). More details on these 
methods can be found in Bleistein and Handelsman (1986) and Chapter 12. 

First, we need to find the maximum or critical point of the function @(s) by 
considering: 


$(s)=-1+—=0 (4.114) 
S 


We now consider a Taylor series expansion of ¢(s) around the critical point s = 1 (the 
solution of s in (4.114)) as 


Hs) = A) +(6- DP + (0-1? 8" + 
; (4.115) 
= -1-5(s-1) +36-)" oe 


Note that 
?(=0, o'(=-1<0 (4.116) 
and this confirms that the critical point is indeed a maximum. Now we apply another 
round of change of variables to (4.112) as: 
u=s-—l, du=ds (4.117) 
a) 2 © 2 
Txt) =x"! } exp[x(-1— = $..)]du wx te } exp(- sd (4.118) 
—1 =1 
Since we are expanding «(s) about s = | or wu — 0, all higher order terms can be 
neglected as a first approximation. In addition, we are interested in the asymptotic 


case that x — 0, and the main contribution to the integration comes from the region 
around the maximum value of function @(s) around s = 1. Let us consider another 


change of variables: 
2 
2 2 
May, du=ady= Py (4.119) 
XU xX 
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Thus, the integral in (4.118) becomes: 
2 
® xu 28 2 
exp(———)du = =| exp(—y" )d 4.120 
[ieee yau = FJ" exp? yay (4.120) 


This integral can readily be integrated if we recall from Abramowitz and Stegun 
(1964) that an error function can be defined as 


= } “exp(-t2)dt =1—erf(z), erf (-z) =—erf (2) (4.121) 
WT 4 


Finally, we have 
Pe 2 
| exp(— du = Za serfs (4.122) 
“1 2 2x 2 


Substitution of (4.122) into (4.118) gives 


Txt) xe “E (+erf(vx/2)} (4.123) 


As x— oo, an error function can be approximated by (Abramowitz and Stegun, 
1964) 


erf (00) = rah — |" exp(—t?)dt = <7 =] (4.124) 
In obtaining (4.124), we have used the Laplace or Gauss integral discussed in 
Chapter 1. Finally, we obtain the following approximation 


T(x +1) = J20x"7e% (4.125) 


When x = n, we arrive at the Stirling formula given in (4.109). In fact, the Stirling 
formula can also be proved using the Laplace method but the proof is quite lengthy 
and will not be presented here. 

If more terms are retained in the Taylor series expansion given in (4.115), 
more terms in the asymptotic expansion can be obtained as : 


T(z+l)* Onze 7 + u + : : We ; oH) r 
12z 288z° 51840z° 2488320z (4.126) 
163879 * 5246819 534703531 


209018880z° 75246796800z° 902961561600z’ 
This eight-term series was obtained by Stirling in 1730. Another completely 
different form of asymptotic expansion of the gamma function was obtained by the 
Indian prodigy Ramanujan 


nix iz)" Bn 4n?+n+— (4.127) 
e 30 


This is completely different the from Stirling formula given in (4.126). However, if 
we retain only the first term inside the square root of (4.127), we obtain precisely 
the first term of (4.126) or (4.109). The biography of Ramanujan is given at the 
back of this book. 
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4.2.9 Important Formulas 


To conclude our introduction to Euler’s gamma function, some important identities 
of the gamma function are reported here without proof: 


12. 22 

Pr “3B (4.128) 

T(z) =(-1)" I(-z+n+l) (4.129) 

T(z-n) T(-z +1) 

Reer) 1" T(z+1) (4.130) 
I(-z) T(z-n+l) 

nm) = 2.6789385347... (4.131) 

T(z) = p? (eee (4.132) 


1 
ae == im 2 [Te3| (4.133) 


Euler’s interpolation formula 


pai ae (4.134) 
n> n'T(n) 
Wendel limit 
fe | eee 4 (4.135) 
no I(x+)) 
Legendre formula 
14271 =| 
w(z) = -7+[ dt (4.136) 
0 t-l 
Binet’s first formula 
1 | 1 le™ 
InP(x) =(x——)Inx—x+In J22 | | dt (4.137 
@=G-5) Fat ppt 4139 
Binet’s second formula 
1 » tan! (¢/ x) 
ad came x+InJ2z +2/ ~ pd 4 (4.138) 
0 eo = 
Mittag-Leffler expansion (this has been proved in (4.52)) 
( 1)" +f —t,z-1 
T et dt 4.139 
@)= a z) di ( ) 


In recent years, some physical problems involve the study of complex arguments of 
the gamma function. However, I am not aware of its application in engineering and 
mechanics yet. Nevertheless, as a general introduction, some cases of complex 
argument are reported here without proof (in fact, their proofs are straightforward). 
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If the argument of the gamma function is complex, we have the following 
identities 


T+iy) =i (iy) (4.140) 
Ty) Piy) = ee (4.141) 
ce wT iy) = aa (4.142) 
rd +iy)F0-iy) = — oe (4.143) 
Re{InT(iy)} = aS (4.144) 


See Problems 4.20, 4.21, and 4.24 for two more formulas for the gamma function 
with a complex argument. There are also special functions which are closely related 
to the gamma function, including the incomplete gamma function (e.g., Lebedev, 
1972), multiple gamma function, Barnes G-function (or double gamma function), 
and beta function. But these are outside the scope of the present book. Thus, the 
studies of the gamma function are by no means simple. 


4.33 HADAMARD FACTORIAL FUNCTION 


One major problem with the definition of the gamma function by Euler is that it 
possesses an infinite number of singularities at n = 0, -1, —2, —3, ... In fact, there is 
more than one way to define the factorial function that coincides with the factorial 
for the limiting case of the integer. An interesting choice was proposed in 1894 by 
French mathematician J. Hadamard (Davis, 1959) and the following definition of 
the Hadamard factorial function was reported in Question 46 of Chapter 12 of 
Whittaker and Watson (1927) (without mentioning the name of Hadamard): 


H(x)=VxV sAud as (4.145) 
dx dx 
where U and V are defined as: 
gxl2 gxl2 
rd a T( > a 


To the best of my knowledge, Hadamard’s factorial function H(x) has not been 
covered in any textbook. Some of the identities to be covered in this section are 
new and have not been published before. 

First, we will show the following recursive formula of the Hadamard factorial 
function 


A(x+1)=xH(x)+ 


ee (4.147) 


Note the definition of the digamma function and the following result 
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d nxvi2 _ (1 x/2 
—2** =(—In2)2 4.148 
- . n2) ( ) 
We obtain 
x/2 
2 ayy (4.149) 
dx T=) 2 
2 
x/2 
Wn eye (4.150) 
dx 1 x 2 2 
re->) 
2 2 
Substitution of (4.149) and (4.150) into (4.145) results in 
m2* x I 
H(x)= at i -D)-¥G- (4.151) 
2Td-—)I'(—-= 
( 7 G a 
Replacing x by x+1, we obtain 
moet x+1 1 x41 
H(x+1)-—_Y"_iyg VG 
2r(1 ( ) 
a @ 2 (4.152) 
Vn2* 1 x x 
ST an ao ee 
T(—--~)r(-= 
C ( y 
On the other hand, 
ne % 1% 
stt(x) = —72""_tya VG 
ard->r(e->) 
i; (4.153) 
Jn 2* 


2 x 1 x 
= eo a gg Se 
r-r(- - = 
( a ( 5 a 
We have used the following identities in arriving at the last equation of (4.153) 
x x 


Xx xX 
rd oS as 5» y(l x Pa i (4.154) 


Next, we consider 
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— Vn2* x 1 x 1 
ee ae rr -¥ * ¥ODtVG 2 * Fa» 

a 2 2 

1 Vn2* x 1 x 1 
9 prt y( eG Taw 
-—F _yS syd) 
ee lee 
= H(x+1) 

(4.155) 


In obtaining the second line of (4.155), we have applied the result of Example 4.4. 
This is the required result. 
Consider the special case that x > n and (4.155) becomes 


BPD Sa (4.156) 
For integer n ( = 1,2,...), the gamma function I'(1—n) is unbounded or 
: >0, A =1,2,3,... (4.157) 
Td-4) 
Therefore, for positive integers we must have 
A(n+l) =nH(n)=n(n-1)---1- A) =F +1) (4.158) 


where H(1) = 1 (see result of Problem 4.29). Clearly, the Hadamard factorial 
function is another choice that extends the factorial function to the case of the non- 
integer argument. 

One major problem of Euler’s gamma function is that the gamma function 
becomes unbounded at zero and all negative integers. We will examine whether the 
Hadamard factorial function will be finite-valued. Note first that the Hadamard 
factorial function given in (4.151) can be rewritten as 


x 1 x 
A(x)= vn2 2 ; amy P (4.159) 
rd- rs = oy ril= Pe = a 


The only problems clearly arises from gamma and digamma functions. We know 
that both gamma and digamma functions are finite except at 0, —1, —2, ... (all 
negative integers). Thus, we only need to investigate the behavior of H(x) at those 
arguments where /’and y becomes unbounded. 


Note that 
1 x 1 x 
i =5) Sw, >o, x=1,3,5,7,... 4.160 
Co oe (4.160) 
Pl-) >=, W- 3) >, x = 2,4,6,8.... (4.161) 


Therefore, we only need to study the behavior of the following terms: 
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1 x 
1 x Ve a 
Mia =) = s Hor = 1,3,5,7 0 (4.162) 
2 2 1 x 
r->) 
2 2 
and 
x 
x ute") 
NM (1-=) =, ;. for x=2,4,6,8.... (4.163) 
2 x 
ra-7) 


Both y; and x2 have the indeterminate form of 00/co as x approaches odd and even 
integers respectively, but we will show that both of them have a finite limit as x 
approaches an integer. Consider the reflection formula for the gamma function 
given in (4.35) with z = x/2+1/2; we have 


ee we: a 
Mg a 


a (4.164) 
T(—+ = sin 7x 
2 2 
Similarly, using the reflection formula for the digamma function given in (4.92) 
with z = x/2+1/2, we obtain 
1 x 1 x 1 x 

ae a y{l (a) og ee (4.165) 

Substitution of (4.164) and (4.165) into (4.162) yields 


1 x 1 ox. 
—-—) T(_+—)sinzx 
3 LG _ C 5 


2 12. lL & ya 
re 
G ry, 


1 1 x 1 x 1 x 
a +—)P(—+—)sin zx + cos zxI(— + — 
aa - G > . >? 


nC [V5 +5)+xcot x3] 


(4.166) 


For odd integers, we have the following limiting value 
1 x 
ae ams 
lim y,(x)= lim {—*+—+—;=(-1)°""T(1+m) (4.167) 
x>2m+1 x—>2m+1 1 x 
r(5-2) 
This is finite for all m = 0,1,2,... Similarly, we find 


x 
Ors) _ ] 


x a 
r=) 


Ml >) = {we +00 afin mxT(~) 


(4.168) 


1 x x x 
= —y(—)sinzxI (—)+ cos zxI'(— 
oe x . oe 
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r w->) 
lim y,(1-—)= lim = (-1)°"T(m) =T(n) (4.169) 
x>2m 2 x>2m x 
ra= 


Thus, the Hadamard factorial function is finite for all values of x. More 
specifically, the Hadamard factorial function of the integer argument becomes 


Vx2?"! T+m) 


A(x)= x=2m+1 
2 1 
I(—-™m) 
2 (4.170) 
Vr27" T(n) 
> x=2n 
2 Td/2-n) 


where m = 0,1,2,... and n = 1,2,... It can be shown that the Hadamard factorial 
function can also be defined in the following form (Davis, 1959): 


l-x 
1 a. [PS 


A(x)= (4.171) 
Td-x) dx rd-3) 
To prove this, consider the differentiation term on the right of (4.171) as 
1-x 
i) S 
Cp A ECL ed om 9 
dx x dx 2 2 
Tl = (4.172) 
1 l-x x 
= 1 
5 \w qe > 
By noting that 
Td.- x) =-2sT'(-x) (4.173) 
we have 
1-x 
i) 
1 d 1 1-x x 
In, —2— = vi—)-wd->)p (4.174) 
T(1-x) dx ra=%) 2xI(-x) 2 2 
2 


Next, we observe that for z = —x/2, the Legendre duplication formula given in 
(4.35) becomes 


T(-x) = eeere Ore = > (4.175) 


Substitution of (4.175) into (4.174) yields the RHS of (4.174) as 


RHS = = - we vil 5) (4.176) 
xI(- aS ~ > 


On the other hand, Legendre’s duplication formula gives 
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x x x 
Td-—) =-—T(-— 4.177 
( >? 5 ( 7 ( ) 
Using this result, 

2 Ix 
Kl. 
2r(1-—)I'(—-= 
( ay . 5 


This proves (4.171), which is given in both Whittaker and Watson (1927) and 
Davis (1959). 

The intermediate result of (4.174) gives a slightly different definition for the 
Hadamard factorial function 


1 1-x x 
HG) = — |W a VM =} (4.179) 


Therefore, H(x) can be defined equivalently by (4.151), (4.159), (4.171), and 
(4.179). 


RHS = 


{va *) WD}. (4.178) 
) 


4.3.1 Recurrence Formula 


The recurrence formula for the Hadamard factorial function can be found in Alzer 
(2009): 


1 
A(x+1) = xH(x) +———_ 4.180 
(r+) =H) + (4.180) 
This recurrence formula has been proved in the last section (see (4.147)). We have 
used this result to see why it leads to the factorial function type for the integer 
argument (see (4.158)). 


4.3.2 Reflection Formula 
Before we consider the reflection formula for the Hadamard factorial function, we 
first report the following Luschny formula for the Hadamard gamma function 
(Luschny, 2006; Alzer, 2009) 
sinzazx, x x 1 
A(x)=T 1+ + 4.181 
(x) oo = WG) WA | ( ) 


The proof of this identity is given in the next example. 


Example 4.6 Prove the Luschny formula for the Hadamard gamma function 


siInax. x x. 1 
A(x) =T 1+ + 4.182 
(x) oo = Iv) LAG | (4.182) 
Solution: First, the reflection formula for the gamma function can be written as 
Sin 7X _ 1 (4.183) 


w T(x)Pd-~x) 
Substitution of (4.183) into the right hand side of (4.182) gives 
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x para | 
RAS =T(x)51+ + 4.184 
oF TET GAA »| (4.184) 
The reflection formula for the digamma function gives 
x x 1 
=) =y(1—-—)—-———_ 4.185 
W) a 2) tan(zx / 2) ¢ ) 
Recall from the result of Problem 4.2 that 
Ws +2)=WG-2)+atan az (4.186) 


Putting z = x/2 into (4.186) gives 
1 x 1 x Xx 
—+—)=y(—-—)+27 tan(— 4.187 
y( a y( F a ( ) ( ) 
Subtracting (4.187) from (1.185) gives 


x x 1 x 1 x 1 1X 
WC) van en? 
an(—) 
2 
x 1 x 1 2, AX 
y(1 >) Ws a es {1+tan } (4.188) 
an(—) 
2 
x 1 x 2a 
“V-)-VG- 9) Snax 
Putting this result into (4.184), we obtain 
1 x 1 x a 
f= 2T(1-x) lve ? WG 7)! T(1—x)sin zx 
i 7 és (4.189) 
~ ray Mi Pees) 


By virtue of the reflection formula of the gamma function given in (4.93), we get 
the final result of (4.182). 


We now return to consider the reflection formula. First, we rewrite the Luschny 
formula as 


A(x) =T(x)P(x) (4.190) 

sin 7x x, x 1 x 
P(x)=1-g(x)—_, sg) =slvGt+7)-vO)! (4.191) 

1 22. 2 2 

Next, we consider the Hadamard function using (4.171) 
1-1+ 
i Pe) i ew.) Oo 
H(1-x) =———_— In an Ps a —lIn ones (4.192) 
Td-1+.x) dx r= 5) T(x) dx r+) 
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Multiplying (4.190) and (4.192), we have 


1a, | TD |a, nS) 


A(x)H(1-x)= 


T(x) -x) dx 1 x,| dx x 
a + - T(1 a (4.193) 

_ sin zx x 1-x x 1 x 

an vd > yv( 5 lw) v5) 


The final expression of (4.193) results from differentiation as well as using the 
reflection formula (4.35) for the gamma function. We note that 
(1-x) l-x 1 1-x (1-x) x 1-x 
1-x)= + = 1 4.194 
g(1-x) q 7 5 > ws aS (4.194) 
Now, it is clear that the right hand side of (4.193) can be expressed in term of g(x) 
defined in (4.191) as 


H(x)H(\-x) =-——"* 9(x) g(1- x) (4.195) 
mx(1—x) 
We call this the reflection formula for the Hadamard factorial function; it is a new 
formula for the Hadamard function that has not been reported before. This formula 
bears similarity with the reflection formula of Euler’s gamma function given in 
(4.35). 


4.4 HARMONIC EQUATION 


With our basic knowledge of the gamma function and factorial function, we return 
to series solution of second order ODEs. First, we consider the following second 
order ODE 


y"+a(x)y' + B(x)y =0 (4.196) 
where a(x) and B(x) are analytic functions of x and thus can be expanded in a 
Taylor series expansion. That is, they are single-valued and possess derivatives of 
all orders. Thus, the solution of y exists in series form around a regular point a 


y= >16,(e-ay (4.197) 
r=0 


The term “regular point” was coined by German mathematician Thomé, who was a 
student of Weierstrass, in 1873. We will start the series solution technique for the 
simplest case, that is, the harmonic equation with the solution being sine and 
cosine. We have introduced the following differential equation (called the 
harmonic equation) in Chapter 1: 


HJ paG (4.198) 


y= Yar (4.199) 
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ioe) 


y= yer ex (4.200) 
r=0 
The harmonic equation (4.198) requires 
Yr =lex + Yiex" =0 (4.201) 
r=0 r=0 


The index of the infinite series is then shifted by r = k—2 and consequently (4.201) 
becomes 
YD +E +Dep yx" + Deg" =0 (4.202) 
k=-2 k=0 
However, the first two terms of the first series are evidently zero, so (4.202) can be 
simplified as 


(k+2N(k+Ne;,5 +e, =0 (4.203) 
Thus, 
Ch 
2 4.204 
M2 (k+2(k+1) ( ) 


Considering the even terms (i.e., A+2 = 2m), the recurrence formula becomes 
Cy, =e (4.205) 
(2m—1)(2m) 
We can reapply the recurrence formula (4.205) to the coefficient c2-2 on the right 
of (4.205) to get 


= (-1)" Coma 

(2m —1)(2m —3)(2m)(2m — 2) 
Repeat the application of the recurrence formula (4.205) m times and the following 
formula is obtained 


(4.206) 


Com 


=] m 
ee cee (4.207) 
(2m)! 
For odd terms in the series solution, we set k+2 = 2m+1 in (4.205) 
C2m-1 
c =—- aa 4.208 
2m+1 (2m 4 1)(2m) ( ) 
Similarly, reapplying (4.208) m times, we get 
(- 1)” CQ 
Cc = ———. 4.209 
2m+l (2m + 1)! ( ) 


Back substitution of these coefficients into the series solution, we get 
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7 (- 1)” Co 2m 7 (- 1)” Cy m+ 
= x +) ——-—_ x 
_* 2 (2m+1)! 


2 4 3 5 
no}i-EaSenpeg |e Late.| (4.210) 
2! 4! 3! 5! 


= Cy COSX +c, SINX 
This result, of course, agrees with the conclusion that we got in Chapter | that the 
general solution for the harmonic equation is sine and cosine. 


4.5 FUCHSIAN ODE WITH REGULAR SINGULAR POINTS 


Before we consider the Bessel equation, let us consider the series solution of a 
more general second order ordinary differential equation 


(x-a)’ y" +(x-a)a(x)y’ + B(x)y =0 (4.211) 

where a(x) and B(x) are analytic functions of x and thus can be expanded in a 
Taylor series expansion. Rearranging (4.211), we get 

pi Dp Peg (4.212) 

(x-a) (x-a) 

This type of differential equation is called a Fuchsian type ODE. Clearly, this 

differential equation is singular at x = a. However, if the singularity does not 

concur with (4.212), the technique discussed in this chapter does not apply. The 

point x = a is called a regular singular point, and this term was coined by Thomé in 

1873, as remarked earlier. For such equations, a standard procedure of series 

solution exists, which is called the Frobenius series. 
By the assumption of a(x) and B(x) being analytic, we can expand them in 


a(x)= > p,(x-a)", BO) = )og.(x-a)" (4.213) 
r=0 r=0 
And the series solution is assumed in the following form with an extra index pas 
y(x) = ae (x-a)?*” (4.214) 
r=0 


This is called the Frobenius series. This extra index p allows us to satisfy the 
governing equation, and thus it depends on the type of ODE. 


ae (p+ry(ptr—-I(x-ajP" + yp, (= a Sc, (p+ry(x—a)r™ 
r=0 s=0 r=0 (4.215) 


Sasa Sear <0 
s=0 r=0 


Collecting the coefficient for (x—a)? and we find the indicial equation: 
p° +(P) lp +d =0 (4.216) 
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if co # 0. Clearly, there are two roots for the index p, namely p; and p2. The term 
indicial equation was coined by Cayley. Collecting the coefficients for (x-a)?” 
leads to 


¢,(o+ro+r-1+ > [p,(o+r—s)+9,]e,_, =0 (4.217) 
s=0 
Grouping the first term in the infinite sum with the first term of (4.217), we obtain 


clips (ptr-1+ po(ptr)+qol+ D Lp. (ptr-D+qcle-s=0 (4.218) 
s=l 
This recurrence formula allows us to determine the coefficient c,. 
In general, there are two independent solutions for the second order ODE 
(4.212) 


n(x) = Doe (e-ayrr (4.219) 
r=0 

y(x)= De (e-ay” (4.220) 
r=0 


where c, is obtained from (4.218) with p = pi and c,* is obtained from (4.218) with 
P = p2 with co # 0 and co’ # 0. However, special consideration is needed when the 
roots for p are not distinct or when their difference equals an integer. In summary, 
there are three scenarios depending on the roots of p. 


Case 1: p,-—p~, #0, and p,—p, # integer 


y(x)= Ye (x-a)""", y(x)= > 2G Agee (4.221) 
r=0 r=0 
with co # 0 and co’ #0. 
Case 2: —, = p, 
n@)= > ¢,@-ay” (4.222) 
r=0 
V(x) = y, In(x-—a)+(x-a)” Yb, (x-a)’ (4.223) 
r= 


Case 3: p, = P, —n (n = integer) 


yj) = bee =a” (4.224) 
r=0 
yo(x) = gy In(x—a) + (x-a)” )°b, (x=)! (4.225) 
r=0 


where g;, is the coefficient of x, of the expansion of the following term 
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n+l 


tyra)’ 
The proof of this result was outlined in Section 10.15 of Copson (1935) and will be 
proved in detail here. Note that a similar result was given in Sneddon (1956) but 
there are various typos and the results are given without proof. 

If pi-p2 = n and n is an integer or zero, then for n # 0, we have case 3, 
otherwise it is case 2. Therefore, cases 3 and 2 can be considered simultaneously. It 
is a well-known theorem in differential equations that if a solution of an n-th order 
differential equation is known, we can depress the order of the differential equation 
to n—1 (e.g., Forsyth, 1956). Using this idea, we first rewrite (4.212) as 


exp J; nena (4.226) 


y"+ p(x)y'+q(x)y =0 (4.227) 
Then, we define a new unknown v as: 
y= y,@@)v(x) (4.228) 


where yi(x) is the known solution for exponent p; and the purpose is to find the 
unknown function v(x). Then, we have 


d*y Us toy m" 
a =), vt+2yvt+yyv (4.230) 


Substitution of these results into (4.227) gives 


(vy + py tay )vt yw" + (2y/ + py’ =0 (4.231) 
The first term in the bracket is clearly zero since yi(x) is the solution of (4.227). 
Thus, we have 


y"+(p422 jy’ =0 (4.232) 
JI 
Clearly, from Section 3.5.8 we can let 
u=v' (4.233) 
such that 
u'+(p+22 ju =0 (4.234) 
aa 
This first order ODE can be solved exactly because it is separable: 
Glee Ge ese ae (4.235) 
u JI 
Inu =—[(p+2*)dx+C, (4.236) 
JI 
The final solution is 
u = Bexp[- | (p+22)dx] (4.237) 
mal 


Let us simplify the integrand as 
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-[tv+24bya = -[ var-2f as 
7 -| pdx —2\ny, (4.238) 


= | pdx+ Inf} 
y, 
Back substituting of (4.237) and (4.238) into (4.233) and conducting integration 
with respect to the argument, we obtain 


W(x) = A+B ) en exp| p()dgidé (4.239) 


Substitution of (4.239) into (4.228) yields the second solution 


1 
v(x) = 10) | ——expl-[ p(S)aeaae (4.240) 
Jyoyr 4 
In obtaining the above solution, we have dropped the integration constants. 

To further simplify this solution, we recall the characteristic equation from 
(4.216) 


p+ (Do “Vp +4 =0 (4.241) 
We now consider the special case that p2= pi—n into (4.241) gives 
(2, —n)” +(p9 -I(p, -2) + qo = 0 (4.242) 
But, we have from (4.241) that 
go =-Pi - (Po DA; (4.243) 
Substitution of (4.243) into (4.242) gives 
Py =n+1-2p, (4.244) 


Let us consider the Frobenius series for the first solution and recall the Laurent’s 
series expansion of p(x): 


Y, =x"! (ay +a,x+...) (4.245) 
xp(x) = yp, p(x) =2" + p, + poxt... (4.246) 
By virtue of (4. sae and (4.246), the pun of (4.240) becomes 
“por J rt ci 
Pa 7 ror! J [Pe py pyle} (4.247) 


exp{Inx?1"} 


xP fay +ajxt...3 


5 expt fa + Pox +...)dx} 


Therefore, the integrand can be simplified as 
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—l-n 


“Gee exp { fo + poxt...)dx} =x" 2(x) (4.248) 
0 1 eve 


We have 
1 
g0)=-—; (4.249) 
MQ 
For ao # 0, g(x) must be regular and thus g(x) allows a Taylor series expansion as: 
2(x) = > g,,x" (4.250) 
m=0 


Thus, we have 


y@)= co] 8S gate = 10 [ Y yx Me 


m=0 x m=0 


(4.251) 
= 109) Seth 1+ Ye") 
x m=0 m=n-+l 
This can be integrated readily as: 
n-l m-n m-n 
E&m* E&m* 
va) = @){) A +g, Inxt > Su} (4.252) 
m=0 m=n+l 
When n = 0 (i.e., case 2), we have 
yx) = gyi Qx)Inx+ x? Yaya Y Se (4.253) 
k=0 m=1 
Let i = m—1; we have 
i+l ea) i 
ye _ Sin =% Sini¥ (4.254) 
a i+] = i+] 
Thus, the two summations can be soubined as one 
WX) = BoM (@)Inxt xP > by L (4.255) 


m=0 
Since go = g(0) # 0 as given in (4.249), we can divide (4.255) through by go: 


y2(x) = yi(x)Inx +x" Y5,x i (4.256) 
m=1 
When n #0, we have 
Sax! y em — =x" e,x/ =e) x! (4.257) 
k=0 m=0,m#n meh j=0 j=0 


With this, the final solution is of the form: 


y(x) = g(x) In x+ x” ya (4.258) 


260 Theory of Differential Equations in Engineering and Mechanics 


where g, may be zero and therefore, we cannot scale it like (4.256). This completes 
the proof for the case of a = 0 in (4.221) to (4.226), and this analysis can be easily 
extended to the case of nonzero a. 


4.6 BESSEL EQUATION 


We will now consider the following Bessel equation 
xy" +xy'+(x7 -v’\y=0 (4.259) 


where v is called the order of the Bessel equation. Clearly, it falls into our class of 
ODEs with a regular singular point at x = 0. The zero order Bessel equation was 
first considered and solved by Daniel Bernoulli in 1733 when he considered the 
problem of vibrations of chains. The first order Bessel equation was derived by 
Bernoulli when he extended chain vibrations to non-uniform sections. Euler, in 
1739, considered a chain with a weight proportional to x, and arrived at the Bessel 
function of general order. In 1764, Euler also found that the Bessel function is 
related to vibrations of a stretched membrane. Note that the Bessel function with an 
imaginary argument will lead to a modified Bessel function and it will be 
considered in Section 4.6.4. 
Another popular form of the Bessel equation is 


2 2 
¢ a ge aly 2/220 (4.260) 
dp pdp p 


The subscript v is used to highlight the fact that the solution is a function of v. 


4.6.1 Frobenius Series for Non-Integer Order 
A solution in terms of the Frobenius series is sought 
Z,= ap" =p" (antaptap +..tap* +..) (4.261) 
r=0 


Differentiation of this series solution once and twice leads respectively to 


1 dZ,, A-2 2 k 
——=1p’(aj+qptap +..+a,p +...) 
pdp aa ee z (4.262) 


+ p*"(a,+2a,0+...+ka,p*" +...) 
d°Z 


+ =1A-1)p** (ay tap tarp’ +...+a,p* +...) 


+2Ap* (a, +2a,p+...+kap* +...) (4.263) 


+p7 (2a, +6a,9+...+k(k —l)a,p** +...) 
2 
~~.Z, =-v7p* (a) +ap+arp? +..tap* +...) (4.264) 
fo) 
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Substituting these into (4.260) and collecting the coefficients for p*? gives the 
following indicial equation for A 


[A(A-1)+A-v* Jay =0 (4.265) 
For nonzero ao, two roots for the indices in the Frobenius series are obtained 
A=tv (4.266) 
Equating the coefficients for p***? on both sides of the equation gives 
[A(A-1)4+ 2Ak+k(kK-1) +A+k-v" Ja, +a, =0 (4.267) 


Using the indicial value 4 = v obtained in (4.266) gives the recurrence formula for 
the coefficient 


kK(2A+k)a, + ay_5 =0 (4.268) 
In view of (4.266), it can be simplified as 
A-2 
a, = -———_ 4.269 
t k(QA +k) een) 
For even k (i.e., k = 2m), the coefficient of the series solution becomes 
44m-2 4 m—2 1 (- Lay m4 


a 2m(2A + 2m) 4m(A +m) 4m(A +m) (2m—2)(2A+2m—2) 


2 
ms z oma (4.270) 
2° mm—-N(At+m)(A+m-1) 
(-1)" ml) 


22" miAt+m)\(Atm—-l) (At) 
The initial constants ao and a; are arbitrary constants, and thus without loss of 
generality can be set to 
1 1 


oY Fa + ph 
(- 1)” 1 
@2m = Samay 2 
2 m'IT(A+m+l1) 
With these coefficients, the series solution becomes 


— . ele Py2m+v _ 
ty pS er arenray ey J(p) (4.273) 


Aom1 = 0 (4.272) 


m=0 
This gives the solution as J\(p), the Bessel function of the first kind of order v, 
which is plotted in Figure 4.5. In principle, we can also retain odd order terms 
instead of even order terms, but the resulting function is not the Bessel function. 
For the second solution, we can simply make the substitution of 4 =—v 


J ey=>, ee Cy (4.274) 


mIT(-v+m+l) 2 


m=0 
Therefore, the general solution of the Bessel equation is 
Z, = AJ, (p)+ BI, (p) (4.275) 
provided that v # integer. However, this solution is rather limited and is normally 
not used because vis an integer for many physical problems. 


262 Theory of Differential Equations in Engineering and Mechanics 
4.6.2 Bessel Function of Second Kind for Integer Order 


The Bessel function of the negative order reported in the last section breaks down 
when the order is an integer, and thus (4.275) is not the most general solution. 
To illustrate the problem, we see that the recurrence equation becomes for v 
=n 
(k* —2kn)a, +a,_, =0 (4.276) 
Since we have assumed that k is even (or k = 2m, where m = 1,2,3,...), (4.276) can 
be written as 
(4m? —4mn)a, + a,_, =0 (4.277) 
The solution is for & being an infinite series (or m = 1,2,..., 00). Therefore, no matter 
what is the given order in the ODE, m will match n sooner or later. The recurrence 
formula of the series solution breaks down when m = n. 


Figure 4.5 Bessel functions of the first kind 


In fact, for the case of integer order (i.e., v = 1), J-, is not independent of J,. In 
particular, they satisfy the ai identity 


J_,(p)=(-))" J,(p) (4.278) 


To prove this, we first note that the gamma function in the Bessel function becomes 
a factorial function for the case of integers. That is, 


Ze. (p) _ > (-1) (2) 2m+n -> (-1) ae (4.279) 


= ~ mir (atm) 2) 4mi(n+m)! 2 


Similarly, the Bessel function of negative order is 
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(-1)” 2m-n 
4.280 
J_,(p)= anes aa (4.280) 
No matter what the initial value of the integer n is, we encounter the unbounded 
cases of the gamma function when m <n. More specially, we have 


1 1 
J ~ ———_ > — =0, m<n 4.281 
-a(P) IT(m-n+l © ( ) 
Effectively, we can drop all terms with m <n since they are zeros. The solution 
(4.280) becomes 


oa} 


J_,(p) = >» (-l) fy n (4.282) 


4 m\T(m—n+l) 2 


Since the series is infinite, we can shift back the lower index of summation to zero 
by setting k = m-n: 


IS (p)= = ve (2) 2k+n ( vy = 1) (eye 


eae 5 4 (k enytkl 2 
(4.283) 
n ye (2 2mt+n _ ¢_4\n 
=(-1) eae no 


This is (4.278) given above. 

Our main job is now to search for an independent solution for the Bessel 
function for the case of integer order. There is no unique choice in this process. 
Recall that even the choice of Jv is not unique (we have selected a particular value 
of ao such that the definition of Bessel function becomes more compact). The 
following definition by Weber and Schlafli is widely accepted (Watson, 1944): 


Y, (p) = lim Ze P08 Fv) (4.284) 
von Sin V7 
where Y, is called the Bessel function of the second kind of order n. When the 
integer limit is taken, both numerator and denominator approach zero. Since the 
limit is of the form 0/0, L’H6pital’s rule can be applied to (4.284) to arrive at 


BP) css —msinvaJ,(p)— 0-(0) 
Y, (p) = lim —O¥ (4.285) 
von 7 COSVIT 
Note that 
cosnz=(-l)", sinnz =0 (4.286) 
In view of these limiting values, (4.285) becomes 
¥.(p)= li im( Ph 4 yn aJ-y(P), (4.287) 


Ov 
Note that we have applied ne limit eae for sine and cosine functions only. Our 
next step is to find the derivative of Jy with respect to the order v. In particular, the 
power is first rewritten as: 


(Bypmty = elnle/2y" = e(2mev )in(p/2) (4.288) 
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Then, the differentiation can be evaluated as 


ad, (p) = 0 y( 1" 1 el2mrv)in(o/2) 
Ov ov mIV(v+m+)) 


(4.289) 


ice) 


=J,(p)In(p/2)— S°(-1)” —, (p/2yme 


m=0 


Similarly, we have 
aJ_,(p) _ m_ W(vtm+tl) 
Vv 


mIT(-v+m-+1) 


=, (p)In(p/2)+ > (-1) (p/2)2"-" (4.290) 


é 
Now, we can take the limiting case as v > n 
a, (p) . 
hon = In(0)In(9/2)— DY 
m=0 


Both yw and [ are finite and it is clear that for the positive order no special 
treatment is needed. For the negative order, we have 


of 5 
[ (A) =-J_,(p)In(p/2)+ yi "[ 


ov m=0 


- m y(-v+m-+1) 2m-n 
Oem phone 


m=0 


m W(nt+m+l) 


Pp (4501 
a ere al ) ( ) 


[ 


y(-v+m+l) 
mIT(-v+m-+1) 


i (p fi 2) aes 


m=n 


(4.292) 

Both the gamma function and the digamma function in (4.292) approach infinity 

for m <n. Thus, a special treatment is needed for the first sum on the right hand 

side of (4.292). Our objective is to evaluate the ratio of them given in the square 

bracket. Applying the reflection formulas for both digamma and gamma functions, 
we have 

y(v+m+l)=y[l-(v—-m)]=yw(v-m)+zcot[z(v -m)] (4.293) 

T(-v+m+41)=P(1-(v—m)) = —~ : (4.294) 

sin z(v—m) T'(v—m) 


Dividing (4.293) by (4.294) gives 
y(-v+m-+l) 
T(-v+m+l) 


= : [w(v —m)+ 2 cot[z(v —m)]]sin z(v —m)(v—m) 
1 


_w(v-m)sinz(v—m)I(v—m) 


+cosz(v—m)C(v—m) 
1 
(4.295) 
Note that 
sin z(n—m) =0 (4.296) 


pAb 
T( on 1) von 


Therefore, the limit between digamma and gamma functions exists and is finite for 
the first summation term on the right of (4.292). In addition, we want to shift the 


=T(n-m)cos(n—m)az =(-1)" "(n—m-I)!_— (4.297) 
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summation index of the last term in (4.292) back to starting from 0, instead of n. In 
particular, we can shift the index of summation by assuming m = n+k: 


= m y(-v+m-+1) 2m-n 
I eran Te 5) 


m=n 


A i al Hes P\2k+n 
=] D (n+ BI Te+D 2? (4.298) 


n = m I y(m+l) P\2m+n 
ae! GaamiTomst 2 


m=0 
n = im WUMNF)) oP aun 
il 2 ) mint m2? 


Back substitution of these results into (4.292) we oe 


O_, (2), n+ n (n- m— a m-n 
[Fo bon = CD" p(p)In(0/2)+(-D ye (p/2)° 
" (4.299) 
+( yd yn “_ (ai2y"" 


m)! 

m=0 

Finally, the second solution for nM Bessel equation is obtained by substituting 
(4.299) and ae into (4.287) 


Mae : (4.300) 
1 Ds a me 
Lars 0 


Instead of expressing in terms of ee functions, whose evaluation requires 
computer programs or numerical tables, we can express the digamma function in 
finite series as 

m 


y(m+}) ara ar: (4.301) 
To see the validity of this equation, we start at Weierstrass’s canonical form 
given in (4.27) 
1 Z 7 Z\ -z/n 
= ze?” 1+—)e 4.302 
we IT -) ! (4.302) 


Inversion of (4.302) gives the following form of the gamma function 
1 
I'(z) = 


yz = z —z/n 
ze []{+2 } 


n-l 


(4.303) 


Taking the logarithm of the gamma function given in (4.303), we obtain 
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InT(z) =-Inz—yz— "Inf + 2)e2!"]=-Inz-yz- > find + 7)-7] (4.304) 
n=l n n=l ue n 
Differentiation of (4.304) gives 
d T(z) _ 


nl) = ae 


1 1 1 
w2)=-—-7- DI ] (4.305) 


Next, we want to evaluate the special value of w(1) 


“ 1] ae alll 
“yeni 2 eae, 7” id da! > 


n=l n= n=l (4.306) 
= 1 = 1 
med 2, n+l 2, k+1 
Note that we have shifted the index of summation using n = k+1 for the last sum. 
Finally, we recall the recurrence formula: 


1 1 1 1 
y(nt+l)= : +y(n)=—+ +yw(n-l)=—+ +..4¢1+y() 
n n n-l n n-l 


epee 
: Zoo. 


To obtain the final line in (4.307), we have substituted the result obtained in 
(4.306). This gives the required series expansion for the digamma function given in 
(4.301). 

Finally, with substitution of (4.307) into (4.300) we get the commonly seen 
definition of the Bessel function of the second kind or Y,, as 


n-| 
¥,(o)=-> YSN pian 


(4.307) 


m! 
m=0,n>0 , (4.308) 
2x Oe 3) eee rar Sl 
+ 1 In(—) + + 
pout ) mint © d 5 (Quy 2.4)! 
where the Euler or Euler-Mascheroni constant y is defined as 
y= lim Si+-inm =0.5772157 (4.309) 
30) k 


More discussion of Euler constant is given in Appendix E. It was Mascheroni who 
extended Euler’s 16 digit calculation of yto 32 digits in 1790 (although wrong) and 
gave it the symbol “gamma”. Figure 4.6 plots the Bessel function of the second 
kind. Finally, we obtain the general solution, which is valid for both integer and 
non-integer order 

Z, = AJ, (p)+ BY, (p) (4.310) 


The Bessel function is considered one of the most commonly encountered functions 
other than circular and hyperbolic functions. It appears naturally in many branches 
of science and engineering. As summarized by Watson (1944), it is intimately 
related to the Riccati equation. The most comprehensive authority of the Bessel 
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function remains the classic book by Watson (1944). Some people regarded it as 
the best mathematics book of all time. A brief biography of Watson is given in the 


To end this section, we record the first few terms in the series form of the 


back of this book. 
Bessel functions as: 
i= ey oy ee (4.311) 
° die 2: iy 2 Oy? ~ 
x x4 x° 
J)(x)=1 + Paes 
0 2  9?.42 2?.42.6? (4.312) 
2 4 6 , 
x 2# x 
=l1-—+ + 
4 64 2304 
J,(x) ail Cars : Oa C xy el (4.313) 
x)= is 
' 2 1!2! 2 2!3! 2 314! 2 
1 a es rs 
J,(x)= 1 + x +... 4.314 
2(x) Git TTS hi PTS ) ] ( ) 
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Figure 4.6 Bessel functions of the second kind Y, 
2 x 
Yo(x) = Pee, + 7]Jo(x) 
2 x4 x6 (4.315) 
ae (1+ DS fiz) ] 
2 22. 4? 2°3 sie 


a 2 
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4.6.3 Circular Functions and Bessel Functions of Half Order 


The Bessel function of half order can be expressed in terms of circular functions. 
Let us consider the case that v = 1/2: 
2 GC iy” 2mtl/2 


Jp@=>) 3 (4.316) 
a=) g2mtl/2 m WG + m) 
The gamma function can be evaluated as 
re +m) = ie eine +m) = eC +m)(m Linn 4) 
2 2 2 2 2 2 (4.317) 
1 1 1_1 2m+1)(2m-1)-:-3-1 : 
= + m\(m— 5) 51) = 
Substitution of (4.317) into (4.316) gives 
ca) ‘= iV 2mtl/2 qimtl 
Ji2(x) = 
1/2 sear ae re 
7 is j 1 2m 
1X “4 2" m!1-3-5++*(2m +1) 
(4.318) 
_ {2> (- 1)” 2nd 
ax i= (2-4-6:++2m)[1-3-5+-(2m+])] 
7 {2> Cir 7 [2 sinx 
mx“ (2m+1)! mx 
Let us consider the case that v = —1/2 
oO Cir 
Fap@=>) (4.319) 
a0 g2m-W2 m TS + m) 
The gamma function can be evaluated as 
Pm) =(m-=)P (m= 2) = (m= 2\(m-2)P n=) 
2 2 2 2 2 2 (4.320) 
1 3, 311. (2m-1)(2m-3)*3-1 , 
=(m——)\(m—_)»-——T(_-) = 
(m—Sy(m—S)- S50) on Va 


Substitution of (4.320) into (4.319) gives 
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J () 7 x ele (x / go Qm 
NON £4 M1356 Qm—DNr 


-~> Cis” 
WX AA 2" (1-3-5 m)[1-3-5+*(2m—1] 
-(2> (-h"x" 

mx 4 (2-4-6++2m)[1-3-5++(2m 1] 


0 m 2m 
= 2 > ( ) a = Z COS X 
\ mx (2m)! mx 


m=0 


(4.321) 


Therefore, the sine is related to Ji. and the cosine is related to J-12. The following 
example provides another proof of the functional form of the two identities. 
Example 4.7 Show that the solution of the following Bessel equation 
1 
xy" + xy" +(x? “ye (4.322) 
can be expressed as 


Pe ake ree eee (4.323) 


vx vx 
Solution: Assume the following change of variables 
w(x) (4.324) 


aes 


Differentiation of y can be expressed in terms of w as 


' L. 3/2 t 
=)-—x w(x) +—=w (4.325) 
yf pene 
y" a {dena tW sew (4.326) 
4 x 
Substitution of (4.326) and (4.325) into (4.322) results in 
w"+w'=0 (4.327) 
The general solution is of course sine and cosine 
w= Acosx+Bsinx (4.328) 
Thus, we have 
pee ae ee eee (4.329) 


Vx vx 
On the other hand, the solution of (4.322) is Bessel functions of the order 1/2: 
Y= CI _12(4) + CJi2() (4.330) 
This agrees with the results obtained in (4.318) and (4.321). 
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4.6.4 Modified Bessel Function 


When the argument of a Bessel function becomes purely imaginary, we can make 
the following substitution: 9 <-ip or 9<--ip. The Bessel equation becomes the 
so-called modified Bessel equation 


az, dZ, 2 
ge ¥-/14+ |Z, =0 (4.331) 
dp pdp pP 

Its solution is evidently J,(ip) . Since (4.331) is a homogeneous differential 
equation, any constant multiplying the Bessel function of the imaginary argument is 


also a solution of the modified Bessel function. Thus, we can define its solution as: 
I,(p)=i "J, (ip), (-a<argp <7/2) 
=i'J,(-ip), (#/2<argp<z) 
This is called the modified Bessel function of the first kind, and it was defined by 


Basset in 1889. Substitution of <—ip into the series definition of the Bessel 
function gives 


(4.332) 


90 (p/ ay 
I = 4.333 
vp) airy rmaD ( ) 
which is plotted in Figure 4.7. Thus, the general solution can be expressed as: 
Z,(p) = Al,(p)+ BL, (p) (4.334) 


For the case of negative integer order, similar to the case of the Bessel function of 
integer order, these two solutions are no longer independent. In fact, we can show 
that 

I, (p)=L.,(P) (4.335) 


where n is an integer. 
To show this, we first note that 


fon) Qm+n i) 2m+n 
L(p)= > 2p) (4.336) 


mT (n+m-+1) m\(n+m)! 


m=0 m=0 
The modified Bessel function of negative order becomes 
2 /2 2m-n 
L,(p)= ye (4.337) 


m!T(m—n+1) 


m=0 
As we have shown before, the gamma function I'(k) becomes infinite for k = 0, -1, 
—2, ... Thus, all terms with m <n vanish in the summation, or we have 


_ ics) (ai2y"" 
Tal) pa oe eareny oe) 


Now, we can make the following change of summation index: 
k=m-—n (4.339) 
Then, (4.488) can be rewritten as 


Series Solution of Second Order ODEs 271 


7 oo (aioe _ 20 (pia _ 
Fl?) de+miresD 2, kami"? oe) 


This completes the proof. 

For integer order, we therefore need to define another independent solution. 
The following second independent solution is called the modified Bessel function 
of the second kind (Abramowitz and Stegun, 1964): 

GiGi Fp) L,(p) 
2 sin vr 

which was defined by MacDonald in 1899 and therefore it is also known as the 
MacDonald function. For integer v = n, the definition of K, was proposed by Basset 
in 1889. The modified Bessel function of the second kind is plotted in Figure 4.8. 
Applying L’ H6pital’s rule to (4.341) we obtain 


_C" AL) ap) 


(4.341) 


K 4.342 
n (p) 2 { ev Ov ttn ( ) 
Following a similar step in obtaining Y,, we have 
al,(p)_ 0 3 1 p2mevyin(p!2) 
Ov Ov “4 mT (v+m+]) 
mus (4.343) 
- l y(v +m+ 1) 2m+v 
=) In(p/2 /2 
pi yinkp 2) Lim TvemD” 


Similarly, we have 


ol_, (p) Parte 7 1 wcvt+mt)) 2m-v 
eels Liev el2)* > ramet) (p/2) (4.344) 


m=0 


Figure 4.7 Modified Bessel functions of the first kind J, 
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Figure 4.8 Modified Bessel functions of the second kind Kn 


Now, we can take the limiting case as v > n 
(oe (p) 
Ov 


It is clear that for the positive order no special treatment is needed, as the right 
hand side of (4.345) is finite. For the negative order, we have 


= _ . y(nt+m+tl) 2m+n 
lon = 2,()n(p/2) pS erarenry (p/2) (4.345) 


Ol_,(p) ey y(-v+m+l) om-n 
pein te =-[ In(p/2)+ > SS /2 
[ Av W235 —n (p) (p ) a ml! T(-v deiyesh 1) ie (p ) 


ice) 


1 wiv+m+4)) 2m—n 
+ > —[|_———_ /2 
a Terenas ! 


(4.346) 

Our next objective is to find the limit in the square bracket in the first summation 

term in (4.346). Both the gamma function and the digamma function in (4.346) 

approach infinity for m <n. Thus, we have an indeterminate form of 0/oo. By 
applying the reflection formula of gamma and digamma functions, we have: 

y(-v+m+l) 


T(-v+m+l) 


iy = : [{v(v—m) +x cot[z(v —m)]} sin z(v —m)l(v —m)] 
1 


von 


_ ee —m)sin 2(v —m)T'(v —m) 
1 


+cosa(v —m)I(v- m| 
von 
=(-1)" "(n-m-1)! 
(4.347) 
Similar to the calculation for (4.298), we have 
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= 1 y(-v+m+t]) P\2m-n _ = I wk +l) P\2k+n 

2 nas © Labi TED? 

ee Ce a (4.348) 
ay y(m+1) ea) 2m+n 


= —j m\(n +m)! 2 


Back substitution of these results into (4.346) yields 
ay ~ n-m \A—-™M —l)! m-n 
pe. pre ay Xe yee py 
Ov = m! 
(4.349) 
—y(m+))_ + 1) = ip py" 
<4 m\(n+m)! 


+ 


The second ‘slition is called the modified Eeee equation of the second kind 


K,(e)=¢ yy, (p)In( p/2)+ yi 1)” (n—=m— ae i) ae n 
= (4.350) 


ey 0 (aio? 
re ne ae 


Finally, by using the series expansion for digamma given in (4.301) we obtain 
n—l| 


Kip=5 Deyn pian 


m=0,n>0 : (4.351) 
00 /2) m+n p atm 4 m 
+(-1 n+l (p In as + 
(1) Jereren (+r AO yy I 


where Euler’s constant y (= 0.5772157). 
Thus, the general solution of (4.331) is 
Z,(p) = Al, (p) + BK, (p) (4.352) 
To summarize, we simply report the first few terms of the modified Bessel functions 
as 


_ iste VT otg. 2 of 
Iy(x) “oF oa * op rae * opp GF +... (4.353) 
h@)=5 ie teat : Cay : (1x96 4..J (4.354) 


12! 2 213! 2 3!4! 2 


K(x) = (n+ 7)lo(x) + 
(4.355) 
11 o1.4 he le .xg 

ie (5a + (14 Dev ao 
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11 
Ki) = Hn iO) +04 DG 
; 1 ie 11 es 
6 
+++ oa Oot sta a aa”? ttl 


We have shown that Bessel functions of half integer order can be related to 
circular functions (i.e., sine and cosine). We will consider the half integer order of 
modified functions here. 


(p/2ymtl? (alzye" 
: 1 4.357 
2p) = ier oases ela on 


By using (4.317), the gamma function in (4.357) can be rewritten as: 
m\T(m+3/2)2?"" 


qintl 


m-+1 
=(2%mn—o2-? Seca te aE 
(4.358) 


=[2m(2m —2)---4-2][(2m +1)(2m-1)-+-3-1-Vz] 
=(2m+1)!vzr 


Therefore, _— can ae written as 


po 5 
2 2 
I = = |- Hi = | sinh 4.359 
v2(P) pe cae a a + } i p ( ) 


Similarly, the modified Bessel is of —1/2 can be considered following the 
same procedure: 


(p/ P) ais (p/2y" 
T. 1 4.360 

v2le)= ae 1/24) 2 ntona ee 
By using (4.320), the gamma function in (4.360) can be rewritten as: 


m\T(m+1/2)22™ 


m 1 3 5 31 i " 
=e esllee ae em yee Ce (4.361) 


=[2m(2m —2)---4-2][(2m—1\(2m—3)-+:3-1-Vz] 


=(2m)!Vx 


Therefore, (4.360) can es ey as 


pe 
[2 - 2 [2 
L_ip(p)= me soni oo a 7 ae . coshp (4.362) 


In summary, the modified Bessel ne of half integer order can be expressed in 
terms of hyperbolic functions. In the next section, we will consider the integral 
representation of the Bessel function. 
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4.6.5 Helmholtz Equation and Integral Representation 


To consider the integral representation of the Bessel function of the first kind, we 
start from the 2-D wave equation 


Ow Ow 1 O°w_ 


ae Oe eee (4.363) 
We seek a periodic waves or harmonic waves of the form 
w=u(x, ye (4.364) 
Substitution of (4.364) into (4.363) gives the following Helmholtz equation 
a ae (4.365) 
axe Gy? 
where 
2 
Kk? = = (4.366) 
We seek a solution of the form 
u = Ae) (4.367) 
Substitution of (4.367) into (4.365) gives 
a’ +b? =k? (4.368) 
Thus, it is natural to assume a new parameter @ such that 
a=kcosa, b=ksina (4.369) 
This suggests a change of variables in polar form: 
xX=rcosg, y=rsing (4.370) 


Combining these, we obtain the following solution form of (4.367) 
u = Agitr cosacosp+sinasing) = Agikrcos(p-a) (4.371) 
For the special case of ~@= @, we have the case of a plane wave propagating in the 
direction of g = @ (or in the form of a one-dimensional wave). The change of 
variables given in (4.370) results in 
2 22 Or x 
r=(x+ 7 = 
Oo +y") & +yy? 


= COS — a =sing 
oy 


; ig : (4.372) 
jays. e. =six = i ee 0p _ cosg 
Ox lt+(y/x) x+y r oy r 
Application of (4.372) and the chain rule of partial differentiation leads to 
Ou _ Or Ou Og Ou eT ci sing Ou (4.373) 
Ox Ox or Ox 0p or r O@ 
Similarly, we also have 
OH iggy 8 | ONG (4.374) 
oy or r Op 


Application of (4.373) twice, we have 
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ou _ 0 au 
ax? Ox Ox 
acs 0 es sing Ou, sing Oo eee sing Ou, (4.375) 
or or r 0O@ r 09 or r 0g 
ack cos’ ptsin? 9 Lu 1 Hu )+2sin cos (ato! Oru 
eo Ot og) ee Op TF Oper 
oe oe of (4.374) twice leads to 
Ou 
a ts 2 
dy” Oy oy 
oe ro} ie , 208 ay 08g 0 ie _ £089 oy (4.376) 
or or r 0p r 0p or r 0p 
2 au 2 
ae pecs al as rae “) 2sin poos oon = Ou 
or” r ap. r 0p dp r Ogor 


Finally, combining (4.375) and (4.376), we obtain the polar form of the Helmholtz 
equation 
2 2 
hey ao yl, 1 ee, (4.377) 
dy? ar? r or r- oy” 
Applying another round of change of variables of p = kr, we have 
Ou léu 1 eu 


ap : Pon Pee tu=0 (4.378) 
Next, we seek an angular dependence of the form: 
u=Z,(p)e” (4.379) 
Substitution of (4.379) into (4.378) gives 
a pee aa a He20 (4.380) 
dp pdp p 


This is of course the Bessel equation that we discussed earlier. If the boundedness 
of the solution of origin is enforced, the solution of wu can only be written as: 


u=AJ,(p)e”” (4.381) 
Now we can rewrite (4.371) by assuming 
A=Ce"™ (4.382) 
and integrate the solution from £ to y as 
=O) } 7 giPeos(p-a) gina fry (4.383) 
B 
Physically, this corresponds to a bundle of waves with directions varying from a = 
B toa=y. 
We apply the following change of variables: 
A=w+p (4.384) 
to give 


: wo. 5 ea 
u= Ge) el Pos Ww ew dw (4.385) 


Wo 
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where 

wy =B-9 W=r-— (4.386) 
With this change of variables, the angular dependence of @ appears explicitly in 
(4.385) and it closely resembles the exact solution that we got from (4.381). The 
only problem is that the limit of integration cannot be dependent on the angular 
variable @. Thus, we must remove the @ dependence from wo and w;. This is 
probably the most crucial step in this analysis. This can only be achieved if we 
allow both f and y to tend to infinity. Or, we have 

Wy >, Wc (4.387) 

Thus, the issue becomes the study of the convergence of the integral given in 
(4.385) as B and y > . Since physically the wave will decay to zero at infinity, we 
must have a converging solution. In other words, we need to make uw remain finite 
as w — oo. This can be done if we allow w to be a complex number. That is, 


w= ptigq (4.388) 
We note that 
cos w = cos(p+iq) = Gos poostiq) —sin psin(iqg) (4.389) 
= cos pcoshq —isin psinhg 
In obtaining the above equation, we have used the following identities 
cos(iq) =coshg, sin(iq) =isinhg (4.390) 
These identities can be obtained readily by noting Euler’s formula that 
e” =cosO+isind, e'? =cosd—isind (4.391) 
Allowing 0 be purely imaginary (or 0 =i), we get 
e? =cos(id)+isin(ig), e” =cos(id) —isin(i) (4.392) 


Adding these equations gives 
e +e* —cos(id)—isin(id) + cos(id) +isin(id) 
aaa 2 
Similarly, subtracting them gives 
ef-e? _ cos(ig) —isin(ip) —cos(ig) —isin(ig) _ 
2 2 
Thus, for real p in (4.385) we must have 
Re(icos w) = sin psinhg <0 (4.395) 
Referring to the complex plane shown in Figure 4.9, we are searching a path along 
which (4.385) will converge to zero as w > . 
For the upper w-plane, we have g > 0 and we need 


cosh ¢ = =cos(ig) (4.393) 


sinh ¢ = isin(id) (4.394) 


sin p <0, -m<p<0 (4.396) 
For the lower w-plane, we have g < 0 and we need 
sin p > 0, O<p<a (4.397) 


For p > a and p < —1, we can also establish the converging zone according. The 
regions for which the passage for wo and w being infinity is permissible are shaded 
in Figure 4.9. For the case that p is complex, the regions shown in Figure 4.9 only 
shift horizontally, but we will not consider this possibility here (e.g., see 
Sommerfeld, 1949). 
We now consider the special case that 
Wy =atin >, w,=b+ic0—>o (4.398) 
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where 
-m<a<0, 2<b<2z (4.399) 
Let us take the constant c, in (4.385) as 
C, =e? (4.400) 
2a 
Comparison of (4.381) and (4.385) yields the following result 
1 ; 
Se ee ipcosw in(w-7/2) 4.401 
Iylp) = 5 fermen" Maw (4.401) 


where W is the rectangular contour ABCD shown in Figure 4.9 such that 

1 +100 > -—1 > 1 > 1 +i00 (4.402) 
Note that this contour integral over a complex path W has a great advantage over 
the real representation in that it is not limited to only integral values of n but 
remains valid for arbitrary values of n. Consider the path BC and use a change of 
variables of 


B=w-a2/2 (4.403) 
This 8 axis is shown in Figure 4.9. Thus, we have 
cos w=cos(# + 72/2) =cos Pcos(z /2)—sin f sin(z / 2) 
=-sin B 
With this change of variables, the integration path BC of -w2 < w < 32 is 
mapped to becoming —z < #< z. The integral in (4.401) along path BC becomes 


(4.404) 


_ 1 o i(nB—psin 2) 
SWOge =5- [ie psinB) yp (4.405) 
For the path AB, we use a change of variables of 
B=-nt+iy (4.406) 


Thus, we have 
sin £ = sin(—z +iy) =—sin zcos(iv) + sin(iy) cos z 


=-—sin(iy) (4.407) 
For points A and B, we have 
A:B>-m+iew (y>~); B:B>-at (y>0) (4.408) 
The integral in (4.401) along path AB becomes 
J, (lp _ Sof enrtinttosntn ay = Serine (* emrsinsinin gy 
ae ; 2 ° (4.409) 
= el eymepsiahy) gy, 
For the path CD, we use another change of variables of 
B=ntiy (4.410) 
Thus, we have 
sin £ = sin(z +iv) =sin zcos(iy) +sin(iv) cos z (4.411) 
=-—isinh y 
For points D and C, we have 
D:Brnrt+iw (yoo); C:Broa (y>0) (4.412) 


The integral in (4.401) along path CD becomes 
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i ei iy)—psinh i inn [* —(yn+psinhy) 
J _ I elatiy) psinhy yy, — e I e d (4.413 
n(Plep 2n 40 ae 0 ” ) 


Baxi 


D btic 


As) 


= 


Figure 4.9 Permissible regions in the complex plane for contours giving converging integral 
(Sommerfeld’s contour) 


Adding all these results gives the following integral representation of the Bessel 
function of the first kind: 


1 ipcosw_in(w-7 
J,(P) = xed? ee e : Maw = Ji(P| an + InP ne + Ji lap 


-=—[" [cos(nB — psin B)+isin(nfB — psin B)d B - ne e mtpsinhy) gy, 


e ntesinhy) gy 


= =|" cos(n — psin B)d B- 


sin nz al 


-— |" cos(n B — psin Bd B- sane emt esinhy) gy, 
0 


(4.414) 
where n may not be an integer. The imaginary part of (4.414) vanishes because the 
sine function is an odd function from —z to 7 and the limit of the cosine term is 
rewritten from —z to m to 0 to 2 by recognizing the even properties of the cosine 
function. This is Schlafli’s generalized Bessel integral obtained in 1873 (Watson, 
1944). However, the current proof follows from that of Sommerfeld (1949) instead 
of from Watson (1944). Other integral representations were reported by Watson 
(1944). 

For the case of integer order 1, we have sin(7z) = 0 and thus 
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J,(p) =~ |" cosinf — psin Bap (4.415) 
#0 


This is the well-known integral representation of the Bessel function of the first 
kind. 

Let us demonstrate the power of this integral representation. For the Bessel 
function of negative order of integer —n, we have from (4.415) 


J_,(p)= =" cos(nf + psin P)d B= =|" cos[n(z —@)+ psin(z-—a)|da 


= ~f*costna -na+psinalda 
40 

= ~f* cost-na + psina]cos(nz) —sin[-na@ + psina]sin(uz)da (4.416) 
40 


=(-1)” ~ {* cosl na+psinalda 
m 40 


a (-1)” J, (p) 
This, of course, agrees with our earlier result that the Bessel function of the first 
kind of negative integer order is dependent on that of the positive integer order 
given in (4.278). 

This type of integral representation was found important in the investigation 
of light diffraction problems and this is also the prime objective of Sommerfeld 
when he studied these integrals. Sommerfeld published in 1894 an important paper 
on diffraction of light by a screen that improved on the results from Fresnel, 
Kirchhoff, and Poincare on short wavelength limits. His results were confirmed by 
experiments for large and small diffraction angles. This piece of work brought 
considerable fame to Sommerfeld. In wave propagation problems, Sommerfeld’s 
radiation condition is of profound importance. 

We will say a little more about the legendary story of Prof. Arnold 
Sommerfeld here. According to Crawford (2001), Sommerfeld was the king of 
Nobel Prize nominations and had been nominated to receive the Nobel Prize for a 
record of 81 times over a span of 34 years from 1917 to 1950. On average, he 
received 2.38 nominations per year in the 34 years. Clearly, he should receive a 
Guinness World Record certificate for this. On April 26 1951, Sommerfeld was run 
over by a car when he was playing with his grandkids. Unfortunately, he never 
received the prize. Ironically, he himself had made two nominations, one to 
Einstein and one to Planck, and both of them of course received the Nobel Prize. 
Even more ironically, he had taught 7 PhD or post-doctoral students, who 
eventually received the Nobel Prize. These recipients include W. Heisenberg 
(physics, 1932), H. Bethe (physics, 1967), W. Pauli (physics, 1945), P. Debye 
(chemistry, 1936), L. Pauling (chemistry, 1954), M. von Laue (physics, 1914) and 
II. Rabi (physics, 1944). It was reported that Einstein once told Sommerfeld: 
“What I especially admire about you is that you have, as it were, pounded out of 
the soil such a large number of young talents.” 

His series of textbooks, Theoretical Lectures on Physics, made significant 
impacts on the new generation of scientists and on the development of physics. His 
books include: Mechanics (Theoretical Lectures on Physics Vol. 1), Mechanics of 
Deformable Bodies (Theoretical Lectures on Physics Vol. 2), Electrodynamics 
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(Theoretical Lectures on Physics Vol. 3), Optics (Theoretical Lectures on Physics 
Vol. 4), Mathematical Theory of Diffraction, Differential Equations in Physics 
(Vol. 6), Atomic Structures and Spectral Lines, and The Theory of Top Volume III. 
4.7 LOMMEL DIFFERENTIAL EQUATION 


The Lommel differential equation is a special type of nonhomogeneous Bessel 
function: 


xy" + xy" +(x? —v? y= xi (4.417) 
The solutions of this equation are the Lommel functions 
5 ,y(X) = 1, (x) Wee (s)ds— J, (x) i oy (a (4.418) 
gap Lte ty) 
Spy (2) = 54, (2)- ——+—_{y, (x)-cos[a(s—v)/2]¥,(x)} (4.419) 
aT") 


There are other types of nonhomogeneous Bessel equations, including the Anger 
differential equation and the Weber differential equation. Details can be found in 
Abramowitz and Stegun (1964). Lommel did research on meteorology, light and 
physical optics. He was the PhD advisor of J. Stark, Nobel Prize winner in physics 
in 1919. 


4.8 HANKEL FUNCTION 


We have given the solutions of the Bessel equation as Jy and Yy. Actually, the 
general solutions to the Bessel function can also appear in different forms. For 
example, N. Nielsen in 1902 defined the following functions, which are now known 
as Hankel functions of the first and second kinds or Bessel functions of the third 
kind, 

HO (x) = J, 00) +i¥,00), HO (x) = J, (x) -i¥, (0) (4.420) 
The symbol H was chosen by Nielsen to honor the contribution of Hankel on the 
integral representation and asymptotic expansions of Bessel functions. 


Recall from (4.284) that 
¥i)= A y/Mpesvr = T(P) (4.421) 
sin v7 


Substitution of (4.421) into the first equation of (4.420) gives 
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J,(x)cosva —J_, @), 
sin vr 
_ J, (x) sinva + i[J, (x) cosva —J_, (x)] 
sin vz 
J_,(x)—J,(x)(cos va —isin vz) , 


H® (x)= J, (0) +1 


(4.422) 


it 
sin v7 
Je Ie) 
isinvaz 
Similarly, the second equation of (4.420) can be expressed as 
J_(x)-e'" J, (x) 
—isin vz 

Inverting these two equations, we can also express the Bessel function of the first 
kind in terms of the Hankel functions as 
H® (x) + H® (x) a HU te HY (x) 

2 ’ J -v (x) rae 2 


A slightly different form of these formulas can be obtained by recognizing the 
following identity: 


HO (x)= (4.423) 


J,,(x) = (4.424) 


eta a ety (ai/2) = jt2” (4.425) 


Using this identity, we have 


TO ~FvOl pay = TA” ~J. 


CO ¢y, 426) 
Sin V7 sin V7 


Hy Q)= 


Analogously, relations between Hankel functions and Bessel functions closely 
resemble the relation between exponential functions and sine and cosine functions 
as illustrated in Table 4.1. 

The analogy is more vivid if we consider the asymptotic expansion of Bessel 
and Hankel functions: 


J,(x) ¥ = cos(x—-“Z- 4), Y,(x) = sin(x- 2-4) (4.427) 


HO (x) x iB exp(x a “) , B(x) x 2 exp[-(x = oI (4.428) 


With this analogy in mind, it is not difficult to visualize the importance of Hankel 
functions. In fact, for physical problems of wave propagation, the role of 
exponential functions and circular functions in Cartesian coordinates is actually 
reflected by Hankel and Bessel functions in polar cylindrical coordinates. 
Therefore, Hankel functions are commonly encountered in wave propagation 
problems (see Section 9.2.3 in Chapter 9). 

Since Bessel functions of imaginary argument can be expressed as modified 
Bessel functions, clearly we can do the same for Hankel functions of imaginary 
argument. In particular, by substituting x by ix into the first equation of (4.426), we 
have 
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Table 4.1. Analogy between circular functions and Bessel functions 


Function type Analogy between functions 
Exponential function Hankel functions 

1 e AO GO 

2 e* HOG) 

3 cos x J,(x) 

4 sin x Y,(x) 

5 e* =cosx+isinx H® =J,(x)+i¥, (x) 

6 e =cosx—isinx H® =J,(x)-i¥, (x) 

H (ix) =— J, (ini ?” -J_,(i0)] (4.429) 
sin vr 


For —a <argx < 7/2, recall that the definition of a modified Bessel function is 
L(x)=i "J, (x), L(x) =i" J_, (ix), (4.430) 
Substitution of (4.430) into (4.429) yields 


-l-v -—(I+v) 


H® (ix) = - E(x) -L,@)1= 


[Z_, (x) i , (x)] 


sin vr (4.431) 
_2 jy) 2 Uy. (2) =i Ae)] 2 MK (x) 
a 2 sin vz 1 


The last result immediately follows from the definition of the modified Bessel of 
the second kind given in (4.341). Inversely, we can substitute x by —ix into the left 
hand side of (4.431) to give 


HO) = 2K, (a), (Ca /2< arg <2) (4.432) 
a 


Following the same procedure, it is straightforward to show that for the case 
of -z/2<argx<7, we have 


H® (ix) = 2 jv = T_T) 
1 


= PRG) (4.433) 
2 sin vr 1 


Again, substitution of x by ix into the left hand side of (4.433) results in 
H?)(x) = 2 jr K (ix), (-a <argx<z/2) (4.434) 
1 


This provides relations between Hankel functions and modified Bessel functions of 
imaginary argument. 


4.9 KELVIN FUNCTIONS 


A kind of special function closely related to the Bessel function is called the Kelvin 
functions. They were proposed by Lord Kelvin in 1889 when he considered certain 
electrical problems. Actually, Kelvin functions not only appear in electrical 
problems that Kelvin encountered, they also emerged naturally in the bending of a 
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cylindrical shell, in the problems of circular plates on elastic foundations, and in 
symmetrical bending of shallow spherical shells. The Kelvin equation of order v is 

x y"+xy'-(ix? +v7)y =0 (4.435) 
The general solutions of this equation can be written as special forms of Bessel or 
modified Bessel functions: 


y=CJ, Givix)+ Ory, (ivix) (4.436) 
y=C, (Vix) + OK, (Vix) (4.437) 


However, these modified Bessel functions are complex and the real and imaginary 
parts of these functions are called Kelvin functions, which are defined as: 


J, (xe!) = J, (igfix) =i", (fix) = ber, (x) + ibei, (x) (4.438) 
eK (xe™!4) =i" K, (Vix) = ker, (x) + ikei, (x) (4.439) 
Note that these Kelvin functions are real. For the case of zero orders, we have 
Iy(Vix) = ber(x) + ibei(x) (4.440) 
Ky (Vix) = ker(x) + ikei(x) (4.441) 


It is customary not to write the subscript “0” for the case of zero order, because 
only the zero order was actually proposed by Kelvin in 1889. Kelvin proposed the 
name of ber(x) and bei(x) because they closely resemble the roles of circular 
functions cos(x) and sin(x). Actually ker(x) and kei(x) were defined by Russell in 
1909 while all higher order Kelvin functions were proposed by Whitehead in 1911 
(Watson, 1944). With the Kelvin functions, the solution of (4.435) becomes 

y = A(ber, (x) +ibei, (x)]+ Biker, (x) + ikei, (x)] (4.442) 
Kelvin functions bery(x) and bei(x) are plotted in Figures 4.10 and 4.11 
respectively. For the case of moving loads on a circular ice plate on water, the 
following equation is encountered (Wyman, 1950; Nevel, 1959; Assur, 1959): 


a’ ld da ied 
VVE+6= +i oC =0 4.443 
ee te (4.443) 
where 
= Ywvil4 
=r( 4.444 
r se ( ) 


and y,, is the unit weight of water and D is the bending stiffness of the plate. The 
solution of (4.443) is 

6 =cber(r) + cybei(r) +c3ker(7) + cykei(7) (4.445) 
A similar equation is also obtained for the case of bending of spherical shallow 
shells (see Timoshenko and Woinowsky-Krieger, 1959). 

Equation (4.443) is not normally named in the literature, and we call it the 
Kelvin equation, as its solutions are Kelvin functions. The role of the Kelvin 
equation versus the biharmonic equation is similar to the role of the Helmholtz 
equation versus the Laplace equation. This is illustrated in Table 4.2. We can see 
that there is a close resemblance of the Laplace and Helmholtz equations versus the 
biharmonic and Kelvin equations. 

In series form, Kelvin functions of the first kind are defined as 
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4g eID . &i2y 
ber(x) =1 ae + ie we (4.446) 
inn pee GI2) Giz" ver 
bei(x) 6) 2 + 5p (4.447) 
/ | 
Ber, (x) / Ber,(x) | 
Pi H 
—7 ne | 
J \ | 
. ” _ 6 . \ ij . 8 
oe K x 
: \ - oh 
My \ / \ 
TN \4Z \ 
"Ber, (x)\ 
> \ 
Ber(x)\— \ 
Figure 4.10 Kelvin function ber, 
Bei, (x) | 


10| 


~10! 


ee 
a 
- ~~ *% \ / 
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Figure 4.11 Kelvin function bei, 
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Table 4.2. Analogy between Laplace and biharmonic equations 


Function type Analogy between Laplace and Biharmonic equations 
Laplace type equations Biharmonic type equations 
Basic equation V’°é =0 (Laplace) V’V’°é = 0 (biharmonic) 
Additional term V°€é+¢ =0 (Helmholtz) VV°E +E =0 (Kelvin) 
Polar form 6 =CJ(r)+CyY,(*) ¢ =Cber(r) + C,bei(r) 
solution +C,ker(r) + C,kei(r) 
i) 2k+v 
ber, (x)= >" ig (4.448) 
aj kIT tk+)) 4 
ca 2k+v 
bei, (x)= >> 2) ig 2 (4.449) 
akilvt+k+l) 4 
Kelvin functions of the second kind can be expressed in series form as: 
4m 
ker(x) =—In(x/2)ber(x) +— 7 bei(x) + 53 CY" PY" yom +1) (4.450) 
m=0 [(2m) a 
jE i (x/ aye 
kei(x) = —In(x / 2)be1(x) — “ber(x) + > Tama YO" +2) (4.451) 
m=0 


ker, (x) = —In(x / 2)ber, (x) + “bei, (x) 


n-l 2k-n 
SS k- die! zy os 32 ee (4.452) 


n+2k 
era a) {w(k+l+y(n+k+1)}cos 


(3n+2k)r 
4 (n+k)\k! 4 


kei, (x) = —{In(x/ 2) + 7}bei, (x) ber, (x) 


n-l 2k-n 
3 (n=k=I'(x/ 2)" 0 Gnt2k)x (4.453) 


= k! 4 
n+2k 
ae re RTD +k D}sin OD 


where the digamma function has been defined (4.307) as 
n 1 
y(nt+)=-vy+ >) — (4.454) 
ar: 


We can use the Kelvin equation to derive the formula for the second differentiation 
of the Kelvin function. For example, the zero order of (4.435) is 
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x’y"+xy'—ix’y =0 (4.455) 


One of its solution is 
y = ber(x) + ibei(x) (4.456) 
Substitution of (4.456) into (4.455) gives 
x*ber"(x) + xber'(x) + x*bei(x) + [.x7bei(x) + xbei'(x)—x7ber(x)]=0 (4.457) 
Setting the real part and imaginary part of (4.457) to zeros gives 


ber’"(x) = = hess —bei(x) (4.458) 
xX 


ee me (4.459) 
Xx 


Another solution of (4.455) is 
y =ker(x) +ikei(x) (4.460) 
Following a similar procedure in getting (4.458) and (4.459), we obtain 


ker" (x) = ~~ ker (2) —kei(x) (4.461) 
kei(x) = ker(x) -— a kei'(x) (4.462) 
Differentiating (4.459) once more gives 7 
bei’ (x) = ber'(x) + ~ bei'(x) - ~bei"(x) (4.463) 
Substitution of (4.459) into (4.463) ee 
bei’"(x) = ber'(x) + = bei'(x)-— ber(x) (4.464) 
Similarly, we have the following identities for other Kelvin functions 
ber’"(x) =—bei'(x) = ber'(x) ; bei(x) (4.465) 
ker’"(x) =—kei'(x) = ker’(x) ; kei(x) (4.466) 
x 
kei’’(x) = ker'(x) + a kei'(x) - ~ker() (4.467) 
x 


Next, we will consider some simple formulas of integrals involving Kelvin 
functions. More specifically, we differentiate the following term: 


4 Tabet] = bei'(x) + xbei"(x) 
dx 


i (4.468) 
= bei’(x) + x[ber(x) — — bei'(x)] = xber(x) 
x 
Integrating both side, we have 
[ xbercayax = xbei'(x) (4.469) 


Similarly, one can easily obtain the following formulas 
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[xbeicae = —xber’(x) (4.470) 
[ xker(ayax = xkei'(x) (4.471) 
[xkei@xyde = —xker'(x) (4.472) 


Note also the following recurrence relations for differentiation (Abramowitz and 
Stegun, 1964) 


zs 


ber'(x) = 5 [ber, (x) + bei, (x)] (4.473) 
bei'(x) = syle (x) + bei, (x)] (4.474) 
ker'(x) = gylen (x) + kei, (x)] (4.475) 
kei'(x) = syle (x) + kei, (x)] (4.476) 


Expansion for the cross product can be found in Abramowitz and Stegun (1964): 
1 (x? /4)* 


ioe) 


ber? + bei; =(x/ 2)” 4.477 
“J ee) DtesksDry sD k! ( ) 
For the special case of v = 0, we have (p. 82 of Watson, 1944) 
4 8 12 16 
be eberGjais OO ee. aa 
oi 4-41 67.6! 87.9! 
More extensive formulas about Kelvin functions can be found in Abramowitz and 


Stegun (1964). 
In closing, we report the following limiting values of Kelvin functions at zero 
argument: 
ber(0) =1, bei(0) = 0, ker(0) > «©, kei(0) = —7/4, 
ber, (0) = 0, bei, (0) = 0, ker, (0) + —c0, kei, (0) + —00 
In predicting the onset of diffuse mode bifurcations of thick-walled hollow 
cylinders of geomaterials, Chau and Choi (1998) found that the evaluations of 
Bessel functions of the first and second kinds of complex arguments are necessary. 
However, Fortran subroutine for such calculations is not readily available in 
standard textbooks (e.g., Press at el., 1992). To check the accuracy of their Fortran 
subroutine, it was found that numerical tables of Kelvin functions given in 
Abramowitz and Stegun (1964) could be used. 


(4.479) 


4.10 LEGENDRE EQUATION 


The Legendre equation is resulted from the spherical form of the Helmholtz 
equation. The solution of the Legendre equation is called the Legendre polynomials 
and it is closely related to spherical harmonics, which is the basic eigenfunction 
expansion of spherical coordinates. Legendre polynomials were first proposed by 
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Legendre in 1785 when he expand Newton’s gravitational potential in series 
expansion. Laplace in 1782 also studied them in relation to the gravitational field 
due to a spherical planet. But, later mathematicians, like Jacobi, Dirichlet and 
Heine agreed that credit should go to Legendre. First, we recall the Helmholtz 
equation in spherical coordinates 


Vy +k?y =0 (4.480) 
In terms of spherical coordinates, we can write the Helmholtz equation as 


10 ,0y lL 85.304 1 @y .4 
+ 0 + +k°yw=0 (4.481 
a” oF Pande a6) Pane ie ear 


Let us assume the following separation of variables 
y(r,0,9) = R(r)O(A)B() (4.482) 
Substitution of (4.482) into (4.481) gives 


nD : 2 
sin Gd (2 dR) 308 4 (in 9 20) 4 424? sin? 9 =-— 2 =m? (4.483) 
R dr dr © dO do ® df 
Then we have 

2 

oe m’® =0 (4.484) 
dg 
The solution of (4.484) is of course 

® = Acosm¢+ Bsin md (4.485) 
Rearranging (4.483) we can rewrite 

1 d d®.  m 1 d_ dR 


(sin@ —) (> —)-k’r? =-B? (4.486) 


@sindd0- dO sin2@ ~Radr- dr 
Thus, the theta-dependent function can be expressed as 


1 d_. ,d®,.... m 
sin 0 + (Br @=0 4.487 
sin@ do dd sin? ry 
Applying the following change of variables, 
x=cos@, P(x)=@(6) (4.488) 
we get 
d 2, dP > m 
1-x P=0 4.489 
ee) ee (4.489) 
Further simplification of this equation by assuming £” = n(n+1) leads to 
d°P . dP m° 
(l-x*) 2x—+[n(n+]) |P =0 (4.490) 
dx? dx lg" 


This is the associated Legendre equation. For the special case of m = 0 (i.e., y is 
independent of ¢), we have the following Legendre equation: 


2 
(1 ys 2x2 + n(n+1)P=0 (4.491) 
X 


We see that the Legendre equation results from the separation of variables of the 
Helmholtz equation. 
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4.10.1 Series Solution 


Let us consider the solution in terms of infinite series as: 


P= ya’ (4.492) 
m=0 
Differentiation of this proposed form gives 
dP _ . m-| d’P = . m-2 
a Dent rr Ze —l)a,,x (4.493) 


Substitution of (4.492) and (4.493) into (4.491) gives 


ice) ice} ice} 
(l =x) }) m(m—1a,, x" 2x) ma,x" +n(n +)>) a,x" =0 (4.494) 
m=2 m=1 m=0 
Multiplying the non-constant coefficients into the series gives 
ive} ioe) ioe) ioe} 
» m(m—l)a,,x"? — > m(m—l)a,,x"" — > 2ma,,x” + Sinn +1)a,,x" =0 
m=2 m=2 m=1 m=0 
(4.495) 
As the summation is for infinite terms, we can always shift the summation to start 
from m= 0. More specifically, we can assume k = m+2 in the first sum and write 
the series as: 


sy (m+2)(m+ la, ox" — x m(m—I)a,,x"" — Sarge” + > n(n +])a,,x"" =0 


m=0 m=2 m=1 m=0 
(4.496) 
Writing out the first few terms explicitly, we have 
2a, + 6a,x—2a,x+n(n+l)ay+n(n+Yax 
(4.497) 


ioe) 
+5 "[(m +2)(m+1)a,,.5 —m(m—Na,, —2ma,, +n(n+l)a,, |x” =0 
m=2 
Thus, setting the coefficients of the zero, first, and general m-order terms to zero, 
we obtain 


2a, +n(n+l)ay =0 (4.498) 
6a, —2a, +n(n+l)a, =0 (4.499) 
(m+2)(m+))a,,.. —[m(m—-1) + 2m—n(n+)Ja,, =0 (4.500) 
These three equations give 
n(n +1) 
ay ie a (4.501) 
6 
Gene (m—n)\m+n+l) (4.503) 


a 
(m+2)(m+1) ” 
Therefore, for m = 2 we have 


Series Solution of Second Order ODEs 291 


y= (2—n)(2+n+]) om (2—nj(2+n+)n(n+)) a (4.504) 
For m = 3, we have 
-_ (3—n)\(4+n) a (3-n)(4+n)(n—-1)(1+ 2) a, (4.505) 
20 5! 
Using (4.503) to (4.505), we can express all terms using only two constants: 
P(x) = doy +a Vo gene) 
where 
= nat) 2, (n= 2)nn + (043) 4 (4.507) 
2 4! 
peal = +2) 3, (2-3\(n Yen eae (4.508) 


This provides the general solution for the Legendre equation. From (4.500), if ” is 
an integer, we have am+2 = 0, for m =n, n+2, .... That is, the series becomes finite 
and is of order n (for both even and odd). We can work backward to express all 
terms in terms of dp. In particular, we first rewrite (4.500) as: 


(m+2)(m +1) 


a > m<n-2 4.509 
e (n—m)(m+n-+l1) me ( ) 
Letting m =n-2 in (4.509) gives 
Oy > = qt, (4.510) 
2(2n-1) 


Applying this expression twice, we get 


a MADO=D (ey? MA=DM= BO“. 5 
4(2n-3) 2-4-(2n—-1)(2n-3) 
Application of the recursive formula m times results in 
a5, =(-1)" n(n—1)(n—2)(n—3)---(n-2m+]) a (4.512) 
2-4---2m(2n—-1)(2n—3)---(2n—2m+1) 
With this recursive formula in mind, the function P(x) becomes 
POjaa2" Hiak  He a Hem, (4.513) 
for n even and 
PQ) =2:0" 0,58"? 46,8" 4a.90% (4.514) 
for n odd. These expressions can be rewritten as a single expression: 
M 
1 2 oo 2m+1 
P(x) = a, >a 1)” n(n yn Yn 3) (n m+ ) xt 2m (4.515) 
— 2-4---2m(2n -1)(2n —3)---(2n-2m+1) 
where 
M=n/2, neven 
(4.516) 


=(n-1)/2, nodd 
To further simplify this form, we first derive the following identities: 
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n(n—1)(n—2)---(n—2m +1) 
(n—2m)(n—-2m-—1)-+-3+2+1 


=n(n—1)(n—2)---(n—2m+1) (n—2m)-3-2-1 (4.517) 
n!} 
~ (n—2m)! 
2-4---2m=2" m! (4.518) 
(2n -1)(2n-3)---(2n-2m +1) 
_ 2n(2n—1)(2n— 2)(2n-3)-++(2n—-2m +1) (2n—2m)! 
7 2n(2n —2)((2n—-4)---(Qn-—2m+2)  (2n—-2m)! 
a Ce (4.519) 
2” n(n-1)(n—-2)-+-(n—m+1)(2n-2m)! 
7 (2n)! (n—m)! 
2" (2n—2m)!n(n—1)(n—2)-+(n—m +1) (n—m)! 
_ (2n)\(n—m)! 
2" (2n—2m)!n! 
Substitution of (4.517)-(4.519) into (4.515) arrives at 
P(x) =a, 3 Cox (2n-2m)!n! nam 
“ny (at 2m)!2" m\(2n)\(n —m)! (4.520) 


sa ys (-1)” (n!)° (2n-—2m)! 2m 
" m=0 m \(n ~ 2m) \(n _ m)\(2n)! 


Since a, is an arbitrary constant, we can choose any value for it. For the following 
choice 


_ (Qn)! 


a,= ; (4.521) 
2" (nl) 
we can express the solution as 
M m 
—l)"(2n-—2m)! 
P (x) = > ( ) ( n m) tom (4.522) 


“9 2 m\(n—m)\(n—2m)! 


This is the Legendre polynomial for the case of integer n. The Legendre 
polynomial given by (4.522) is finite for all values of -1 <x < 1. Forn being a non- 
integer, we have P,(x) > «© as x + +1. The first few terms of the Legendre 
polynomials are: 
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Fy(x) =1, 
F(x) =x, 
P,(x)= 5G" 1), (4.523) 


P(x) = 56x" ~3x), 


P,(x) = 505s" —30x* +3) 


The first five Legendre polynomials (except zero order) are plotted in Figure 4.12. 
Since (4.491) is a linear second order ODE, we need to find another independent 
solution in additional to the Legendre polynomials obtained in (4.522). To do so, 
let us assume the other independent solution in the form: 

P= A(x)P,(x) (4.524) 
where A(x) is the unknown function to be found. Again, (4.524) results from a 
standard theorem stated in Forsyth (1956). Substitution of (4.524) into (4.191) 
leads to 


Figure 4.12 Legendre polynomials of the first kind Pn 


(l-x’)(AP," +24'P! + A"P,)—2x(AP, + A'P,) +n(n+1)AP,=0 (4.525) 
This can be regrouped as 
A{(l—x?)P,” —2xP! +n(n+))P,}+(1-x’)(24'P, + A"P,)—2x4'P, =0 (4.526) 
The first term is evidently zero as the Legendre polynomial is the solution of 
(4.491), and thus we are left with 
P, A" 2x 


24+ = 4.527 
Bod tae (4.527) 


n 


This can be rewritten by assuming u = A’ 
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dP, du 2xdx 
2 + 


=0 4.528 
P, u  1-x’ 
Integrating term by term, we have 
In P? +Inu+In(i—x”)=C, (4.529) 
This can be written as 
u= Se ie (4.530) 


dx (1—x°)P? 
Thus, integration leads to the result for A and the final solution results in Legendre 
polynomials of the second kind: 


dx 
On(3) = | pe (4.531) 


As examples, we demonstrate the first few terms of the Legendre polynomials of 
the second kind. For Po(x) = 1, the Legendre polynomials of the second kind of 
zero order are 


dx a L giles 


CNN Ged oly te eg 


= =i +x)-In(l—x)]= =i) 


(4.532) 


For P(x) = x, the Legendre eaaeey of the ee he of first order are 


dx 1 1 
a=] Tae =x|(—5+)de 


l-x" x 
f 


(4.533) 
l+x 


= Fino) -1 


More generally, Legendre ae ae of the second kind can be evaluated as 
(Abramowitz and aoe 1964) 


Q,(0)=5 Py(x)In( =) ye nC) P,m(X) |x]<I 
m= (4.534) 


SPA) YAP, Ppa) f]>1 
m=1 


where P,,(x) = 0. Thus, we have 


1 
Oy (x) = Py(x)Qy(x) Stas Gx? n=) = (4.535) 
5 2 1 1 5 w) 
O3(x) = P,(x)Qp (x) ae rics 3x)In(- =) oo (4.536) 
Thus, the general solution of (4.491) becomes 
P(x) = AP, (x) + BO, (x) (4.537) 


Note, however, that Q,(1) 00 and for solid spheres containing the poles (i.e., 0 
=0,n or x = +1 at poles) we normally require B = 0 based on physical grounds. 
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4.10.2 Rodrigues Formula 


Actually, Legendre polynomials can also be generated by using the Rodrigues 
formula: 
dq” 
2"n! dx" 
The formula was discovered by Rodrigues in 1816, and rediscovered by Ivory in 
1824 and Jacobi in 1827. The current name was given by Heine in 1878 in his 
book. To prove this identity, we can start with the binominal theorem for the 
function under the differentiation sign: 


n (— 1)"n! 28 2m n m 2n-2m 
-1 =) C,C1 4.539 
Nee > ear > iCnt-Ds (4.539) 
Substitution of (4.539): info the left hand side of (4.538) yields 


P.(x)= (x =1)" (4.538) 


n n 1)" yt n 
I d x2 hi _ I ( 1) ni d ona 
Onl ax" 2 nl = miin—m) as" 


The differentiation term on the right hand side is zero if 2n—2m <n and thus we 
must have n < 2m or n/2 < m for even n and (n—1)/2 < m for odd n. Note also that 


a ! 
x? = p(p-(p—2)-\(p=nt lx?" =—P— xP (4.541) 
dx" (p-n)! 
Thus, differentiation of (4.540) becomes 
d" 2n-2m (2n ~ 2m)! n-2m 
x = x 
dx" (n—2m)! 
Back substitution of (4.542) into (4.540) gives 
n M = m — ! 
1 d (x? =)" = > ( 1) (2n 2m)! 2m _ P (x) (4.543) 
2” n!\ dx" “29 2 m!\(n—m)!(n—2m)! 


where MM is defined in (4.516). This completes the proof of the Rodrigues formula. 
Legendre polynomials are of fundamental importance in the analysis of spherical 
coordinates, and are the marrow of spherical harmonics. They also provide the 
basis of eigenfunctions for spheres. 


(4.540) 


(4.542) 


4.11 ASSOCIATED LEGENDRE EQUATION 


For the case that m # 0 in (4.490), it is the associated Legendre equation: 


dw. dw 
fx" 2x—+4[n(n+1 4.544 
(ax?) 2 + Io (4.544) 
Let us start from the Legendre function (i.e., m= 0) that 
dV. dV 
1-x? 2x— +n(n+l)V =0 4.545 
(l-x") Pe " (n+) (4.545) 


Differentiating this equation m times, we have 


296 Theory of Differential Equations in Engineering and Mechanics 


m 2 m m 
d id ei s d O28 +n(n4 i E=6 (4.546) 
dx" dx dx" dx dx" 
Recalling the Leibniz rule for differentiation, we have 
m m ¥ m-r m ! r m-r 
d may eaee r=) m! dud” 'y (4.547) 
dx” ar dx" dx” = (m —ry)!r!} dx’ dx” 
Only a finite number of terms remain in (4.546) by recognizing u and v as 
2 
u =(1—x’), —— (4.548) 
dx 


Thus, we have 


m 2 m+2 m+1 = m 
ae ee “j=( gg rT asic) 
dx dx dxit*2 1! dx! 2! dx 
m m+1 m 
#6. 52" ao (4.550) 
dx™ dx dx! 1! adx™ 
Finally, we have 
2 
(1 eyo as 2x(m4 ee +[n(n+1)—m(m+)]U =0 (4.551) 
dx? dx 
where 
yoo 2 (4.552) 


dx” ax” 
The last part of (4.552) is obtained by recognizing that the solution of (4.545) is the 
Legendre polynomial. 
Let us now introduce the following change of variables 


W=(-x°y"U (4.553) 
Thus, the derivative terms of (4.552) can be expressed as: 
OF = mel?" 4) (4.554) 
dx abe 
2 
a’ = = mx(1—x? "2-7 WI — x7) + (m+ 2)x7]+2mx(1— x7)" an 
“ 2 * (4.555) 
+(l— x2)? d ii 
dx 


Substitution of (4.554) and (4.555) into (4.551) gives 

dw . dw m 
le 2x—+[n(n+1 W=0 4.556 
(1x?) 284 [nn 1-5) (4.556) 


This is precisely our associated Legendre equation. Thus, finally we have the 
solution of (4.556) as the associated Legendre polynomial: 
dP 
P(x) =W= a op ua 8 _ ary?" : nO) (4.557) 
x 
This associated Legendre function of the first kind is also known as Ferrers 
functions, proposed by British mathematician Norman Ferrers (Olver, 2012). 
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Clearly, the zero order associated polynomials (1.e., m = 0) become the Legendre 
polynomials. It is straightforward to see that this procedure is equally applicable to 
the associated Legendre function of the second kind, and thus we have 


m/2 d”Q, (x) 


m 


O” (x) =W =(1-x°)"U =(1-x”) 


(4.558) 


Combining these two solutions, we have the general solution for the associated 
Legendre equation as 


W(x) = AF," (x) + BO? (x) (4.559) 
Similar to the observations for Legendre polynomials, for solid spheres containing 
the poles (i.e., 8 =0,2 or x = +1 at poles) the associated Legendre function of the 


second kind becomes infinite. Thus, for such problems we normally require B = 0 
based on physical grounds. 


4.12 HYPERGEOMETRIC FUNCTION 


The hypergeometric series or functions had been studied by mathematicians before 
its governing equation was known (or called the hypergeometric equation). It has 
been considered by Wallis, Euler, Gauss, Kummer, and Riemann. Its application is 
related to the analyses of a weightless cable containing point masses, of particle 
physics, and of fluctuation in electric circuit. The term hypergeometric series was 
coined by J.F. Pfaff (1765-1825), who was the advisor of Gauss. Unlike Bessel 
functions, the hypergeometric function is one of the topics of special functions that 
has been commonly left out in the syllabus of engineering mathematics. In the 
bending theory of a shell having a surface of revolution subjected to axisymmetric 
loadings, the hypergeometric equation and function appear naturally (e.g., Chapter 
16 of Timoshenko and Woinowsky-Krieger, 1959). However, such topics are not 
normally covered even in graduate courses in "the theory of plates and shells" 
because the hypergeometric function is considered too complicated for graduate 
students or even for engineering professors. In fact, if we do not go too deep into 
the mathematical theory of hypergeometric function, it is not so formidable. In this 
section, we will illustrate that it is just another series solution with three parameters 
and one variable. If one tries to find the analytic solution of a complicated integral 
using the software Mathematica, it is very likely that Mathematica’s solution will 
involve the hypergeometric function. Therefore, one should be familiar with this 
special function. 


4.12.1 Frobenius Series Solution 


The following hypergeometric equation was discovered by Gauss: 

z(l—z)y"+[y-(a+ B+)z]y'-aBy =0 (4.560) 
where o, B, and y are parameters of the equation. Near z = 0, let us seek a solution 
in terms of the Frobenius series 


y=2" (ay +ajztayz” +. Fae +0) (4.561) 
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Thus, we have 
y' =Aagz* + (A4Na,z* +(A42)agz""" +...4(A+ba,z* 41 +... (4.562) 
y" =A(A=Nagz*? + A(At Nayz* + (A+ 2A +4)ayz* 


+..+(A+hK\k+A-Daz,z***? 4... 
Substitution of these into (4.560) results in 
z(1—z)[A(A—Nayz*? + A(A41)ayz* | + (A4+2)(A 4)ayz* 


+..4(Atkh\(k+A-laz,z**? +.) 
Hy —(a+ B+1)z] {Aagz*? +(A+)ayz* +(A+2)a,2*" (4.564) 


(4.563) 


k+A-1 


tut Atkja,z +...} 


~af(ayz* +a)2z*"| +52"? +...4.a,24* +...) =0 
Collecting the coefficients for z*"', we have 
A(A-1+y7)=0 (4.565) 
There are two solutions for the index 2 
A=0, A=1-y7 (4.566) 
Collecting the coefficients for z***, we have 
[(A+k+)A+kh)+7A+k+ Jaz.) 


=[(A+hAtk-)+(at+ft+DA+k)+ aB a, 
For the case of A = 0, the recursive formula for the coefficients is 
(a+k\(B+k) 
Cee =e aN 
(A+1)(k+7) 

We further set ao = 1, and the first solution is Gauss hypergeometric series 
op ea+)BB+l) >, is69) 


ly L2yy4D 

where F(a,B,y,z) is called the hypergeometric series or hypergeometric function, 
and the first three arguments indicate the indices of the ODE and the last one is the 
variable. This hypergeometric function can be defined in a more compact form: 


(4.567) 


a (4.568) 


yv=y, =F(a,B,7,2) =14 


. (2). (Pek 
F(a, B,y,z)= ) ——~~“2z (4.570) 
2, (Y)xk! 
where the Pochhammer symbol is defined as 
CO Cee ne eae (4.571) 
T(a@) 


It was proposed by Prussian mathematician Pochhammer, and its name was coined 
by Appell in 1880. Thus, we can define the hypergeometric series in terms of the 
gamma function 


ee 362) T(atkl(B+k) z* 
F(a, B,7,2) rare 2 aor ae (4.572) 


For the second root of A in (4.566), A = 1—-y, we have 
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y=y, =2''F(a-ytl,B-ytl,2-y,2) (4.573) 
Therefore, the general solution of the hypergeometric equation is 
y = AF(a, B,y,z)+ Bz!’ F(a—y +1, B-y +1,2-7,2) (4.574) 
This solution is only valid for 
-l<z<l, y-(a+f)>-1l (4.575) 


Note that the differentiation of the hypergeometric series is 
d 
£F(a,p.72)= Eras p+by +2) (4.576) 
Zz y 
To show this, can find that 


Pe prigisats (@+I(B+l) (ata +2V(B+W(B+2) 9 | 
oa v+l 1-2-(y+I)(y +2) 


(4.577) 


af 


Seti pag ee, 
y 


vr+)) 
a+ Wat QB(B+B+2) 2 
1-2-y(y +7 +2) 
The function on the right hand side is precisely the derivative of the 
hypergeometric series. This completes the proof. 


(4.578) 


4.12.2 Confluent Hypergeometric Function 


A related function is called the confluent hypergeometric function or Kummer 
function, which is defined as: 
ioe} 


GP CIN eo >= (4.579) 
yl! yv(v+t) 2! a Ont! 


which was introduced by Kummer in 1837. It can be obtained as a special case of 
the hypergeometric function that 
Brow z>0, Pz>-p (4.580) 


Using the following change of variables 


M(a@,y,p)=1+ 


p 
zZ=— (4.581) 
B 
we have 
d dy d° d° 
tug, SY git (4.582) 
dz dp dz dp 
Substitution of these into (4.560) gives 
Pi Py pd y Pip 
(=) +[y-(a+ B+1)—|B —— aBy =0 (4.583) 
BB dp” B dp 


This can further be simplified as 
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2 
p(B-p)<% +98 (a+ B+ p| apy =0 (4.584) 
dp dp 


Finally, we can take the limit B — oo and we obtain the following confluent 
hypergeometric equation or Kummer’s equation 
2 


d*y dy 
+(y-p)——-ay=0 (4.585) 
"ap ap 


The second independent solution of this second ODE was introduced by Tricomi in 
1947 
rd 

d=7) Ce aa RG dee |) 
T(a-y+l) (a aa 
which is also known as the Tricomi function. The hypergeometric series also gives 
the solution of Riemann’s P-equation or the Papperitz equation (see chapter 10 of 
Whittaker and Watson, 1927). 

The appearance of the solution form in (4.574) is not unique. If the range of z 
is not constrained by (4.575), the solution of the hypergeometric equation will 
appear in different forms. The next section will present the classification of 
solutions proposed by Kummer. 


O(a, 7,P) = p’M(a-y+l,2-y,p) (4.586) 


4.12.3. Kummer’s Classification of Hypergeometric Series 


Actually, there are three singular points at z = 0, 1, © for the hypergeometric 
equation. The solution near singular points z = 1, and z > oo will appear 
differently. Near each of the three singular points, there are always two linearly 
independent solutions. There are many different ways to express these solutions. 
The following Kummer classification is presented by Goursat in his thesis and was 
translated to English in Chapter 7 of Craig’s (1889) book. For the solution near the 
point z = 0, we have two characteristic roots (as given in the last section), and the 
situation is the same for indicial equations at the other two singular points. The 
indicial equations at z= 1 and z = ~ are respectively: 


Mu-yt+atPB)y=0 (4.587) 
r(r-l)-r(at+ B-1)+afB=0 (4.588) 
Similar to (4.565), these equations are also quadratic and thus there exist two 


solutions. This is summarized in Table 4.3 below. To show the validity of (4.588), 
we introduce the following change of variables 


= (4.589) 


u 
Table 4.3. Roots of the indicial equation near different singular points 


Singular point Roots of indicial equation 
M =0 1) = 1-y 
1 Li=0 ba= y-a-B 


oO FLO r2=8 
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Note that the singular point z > oo is mapped to u = 0. The derivatives of the new 
variables are: 


dy _ dy du | pe 


(4.590) 
dz du dz du 
2 2 
PLS pO 598 ey (4.591) 
dz? du du’ dz du du? 
Substitution of these derivatives into the hypergeometric equation (4.560) gives 
d 
w?(u-1)—* +ul(at B-I)—-(y-2ul] Bs op 6 (4.592) 
du du 


Thus, a power series in u about u = 0 corresponds to the singular point z > o. 
Near u = 0, let us seek a solution in terms of the Frobenius series 


y=u' (ay tautau’ +...tau" +...) (4.593) 
Thus, we have 
y =ragu’ | +(r+ Dau" +(rt2)agu" +..4(r+ haut +... (4.594) 
yp" =r(r—-Dagu’? +r(r + Nau’! +(r+2)(r+ Dau” oo 
+4(r+k\rt+k—-Nau' +... 


Substitution of these into (4.592) yields 
u?(u—l)[r(r lagu’? +r(r + Dau" +(r+2\(r + Dau" 
+..4(rth\(kt+r—Nawui 


+u[(a + B-1)—(y —2)u] frau” + (rt law" +(r+2)au™™ (4.596) 


+...] 


+..4(r+ka,ui 


+...} 
-~aB(au’ tau"! +a,u"*? +...4a,u"™ +...) =0 
0 1 2 k 


Collecting the coefficient of the u” term, we obtain for nonzero ao 


r(r-l)-r(at+ B-1)+afB=0 (4.597) 
This is precisely (4.588). The two roots are r = @ £. For the case of r= a, we have 
(a+k)1+k) 


(4.598) 


Ags 


Bk—l)—a(k +1) —(k-D(k+1) 
To show the validity of (4.587), we introduce the following change of variables 
z=l-u (4.599) 
Note that the singular point z = | is mapped to u = 0. The hypergeometric equation 
becomes 


2 
u(L ns » t[(at+ B+1-y) u(a+ B+I))”—apy=0 (4.600) 
lu u 
Near u = 0, let us seek a solution in terms of the Frobenius series 
y=u" (ay tautayu’ +...+a,ué +...) (4.601) 


Thus, we have 
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y= Magu +(+ lau" + (ut 2)anu" +...4 (ut bau** 1 +... (4.602) 


y= uu Maguh? + wet Nau"! + (+ 2+ Nau" 


ae (4.603) 


tit (utky(k + u-la,u hase 

Substitution of these into (4.600) leads to 
u(l—u)[ ue lagu” + wut Nau" + (0 +2)(ut aru" 
+..4+(utkh(k+ ula! 


+H(at+ B-ytl)-ulat+ B+) ]{uagu" | + (ut Yau" + (+ 2)ayu“*! (4.604) 
k+u-1 


+...] 
t..+¢(u+k)a,u +...} 


—aB(ayu" +aué! + au"? +...4a,u"** +...) =0 
Collecting the coefficients of the u’ term, we obtain for nonzero ao 


Muta+B-y)=0 (4.605) 
This is (4.587). The two roots are = 0 and y—-a—f. For the case of 4 = 0, we have 
A = sala 2d a (4.606) 


(k+D(k+B+a-y+l * 
In general, the solution forms near the singular points can be expressed in the 
solutions summarized in Table 4.4. 


Table 4.4. General solution of hypergeometric functions near different singular points 


Singular point General solution form of Frobenius series 
0 y=C,2*U, +C,2*U, 
1 y=C(l-z)"V,+C,0-z)”V, 
co yeQz"W, +z", 


Therefore, there are six types of solution forms, and they are U1, U2, Vi, V2, Mi, 
and W2, which, we have the following functional forms 


U;,U, =U,(z),U2(2) (4.607) 
VV, =V(2-D,Vy(z-D (4.608) 
W,,W, =W,(1/z),W,(1/z) (4.609) 


They can be expressed in terms of hypergeometric functions. The hypergeometric 

equation can be rewritten in different forms, depending on whether the solution is 

being sought near z = 0, | or o. There are six different ways of transforming an 

ODE to the standard form by the following transformation: 
va! +b 


~ cutd 
Mapping of this form is also called bilinear transformation. More specifically, we 
can transform the differential equation to different forms by using the mapping 
shown in Table 4.5. 


(4.610) 
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Table 4.5. Six mappings that can rewrite the differential equations in different forms near 3 
singular points 


Mapping Value of u 
z=0 z=1 Z=o 
Z=u 0 1 00 
z=u/(u-1l) 0 00 1 
z=l-u 1 0 00 
z=(u-l)/u 1 00 0) 
z=1/(1-u) 00 0 1 
z=l/u 00 1 0 


The third and sixth mappings have been applied earlier to obtain the 
hypergeometric equations near the singular at x 0 and x = 1. For the solution 
near each singular point, we can express the solution as: 

y=z*(1-z) f(z) (4.611) 
where f(z) is the hypergeometric function of certain parameters. Since there are two 
roots for both A and u, four ways of expressing the solution exist. Each solution of 
these different forms can be represented by six different forms (see (4.407) to 
(4.409)). According to this system, we have 24 solution forms. This is called 
Kummer’s 24 solutions for hypergeometric equations. The results are summarized 
in Table 4.6 for the solutions near the singular point z = 0, in Table 4.7 for the 
solutions near the singular point z = 1, and in Table 4.8 for the solutions near the 
singular point z > ©. 

The first four Kummer solutions given in Table 4.6 are equivalent whereas 
the fifth to eighth are equal. That is why they were grouped into solutions y; and yo. 
Near the point z = 0, the general solution of the hypergeometric equation is 

y = Ay,(2)+ By, (2) (4.612) 
Altogether we have six groups of solutions with four each as shown in Tables 
4.6—4.8. Thus, a total of 24 solutions can be obtained. 

Near the point z = 1, the general solution of the hypergeometric equation is 


y = Ay3(z)+ By,(z) (4.613) 
Near the point z > ©, the general solution of the hypergeometric equation is 
y = Ays(z) + Bye(z) (4.614) 


It can be shown that all solutions of 3, v4, ys, and ye are dependent functions of 
and y2 (Craig, 1889). 


4.12.4 Hypergeometric Series versus Other Functions 


The hypergeometric series is considered as one of the most general functions that 
can embrace many functions, which can be expressed in terms of infinite series 
with rational coefficients, as its special case. Here are some examples: 


F(-n, B, B,-x) =(1+x)" (4.615) 
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lim F(1,n,1,x/n) =e" (4.616) 
n—-oo 
Fd, £, B,x) =1/(1-x) (4.617) 
fii. (4.618) 
Zz 
lim F(m,n,1/2, z/ (4mn)) = cosh z (4.619) 
nym—o 
iin KOS Oe VaR (4.620) 
n,m—o 
lim F(m,n,3/2,-22 /(4mn)) ==? (4.621) 
nym—->o Zz. 
Table 4.6. General solution of hypergeometric functions near z = 0 
Number _| Solution Type region 
1 F(a, B,Y,2) aa z=0 
2 (l-z)"**? F(y-a@,y—B, 7,2) 
3 lag)" Fy = 9.9527 (2=1)) 
4 (l-z) *F(B,y—a,7,z/(z—-))) 
5 z’’F(a—-y+l,B—-y+1,2-7,z) Vy z=0 
6 z'(1-zy ** Fd-a,1-B,2-y,2) 
7 z’?(1—z)"*" F(a—y+11-£,2-y7,2/(z-1)) 
8 z'7(1-z) ?" F(B-y4+1,1-a,2-y7,2/(z-])) 
Table 4.7. General solution of hypergeometric functions near z = 1 
Number | Solution Type | region 
1 F(a, Pp,a+ B-yt+l,1—-z) V3 z=l1 
2 z'’F(a-y+l,B-y+la+B-y+l,1-z) 
3 z°F(a,a-yvt+lat+f-yv+1(z-)/z) 
a z*F(B,B-y+l,a+B-y+1,(z-1)/2) 
5 =z)" "Fy ay -p.7-@-P+1,1=2) yy | z=l 
6 (l-z)’**z'’Fd-a,1-B,y-a-B+1,1-z) 
7 (l-z)”**z*’F(l-a,y—a,y—a—B+4+1,(1—-z)/2) 
8 (l=zy* "2" "Fl =£.7- B.y-@=6 +1,0=2)/z) 
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Table 4.8. General solution of hypergeometric functions near z > 0 


Number | Solution Type | region 
1 z°F(a,a-y+la—-£+1,1/z) Vs Oe oe 
2 z (1-1/2) ** Fd-£,y-B,a—B-y+11/z) 
3 z*(l-1/z)* F(a,y—B,a—£+1,1/(1-2z)) 
4 z*(1-1/z)"*" F(a—y+1,1—f,a—641,1/(1—z)) 
5 z°F(B,B-v+1,8-atl11/z) Ms se 
6 z*(l-1/z)"** Fl-a,y—a, B—a+l,1/z) 
7 z*(-1/z)" F(6,y—-a, B—at1,1/(1—-z)) 
8 z*(-1/z)*'F(6-y+11-a, B-—a+1,1/(—-z)) 

lim F0n,n,1/2, ~z’ /(4mn)) =cosz (4.622) 

F(-n,n+1,1,(1—x)/2) = P,(x) (4.623) 
(n—m)!2"m! on 


F(m-n,m+n+lm+b(1—x)/2)= 


(n+m)\(—x?)""? Fy (x) (4.624) 


F(-n,n,1/2,(1—x)/2) =T, (x) (4.625) 
U,, (x) 


v1-x? 


where 7,(x) and U,(x) are Chebyshev polynomials of the first and second kinds 
(Abramowitz and Stegun, 1964). 


F(-n+1,n+1,3/2,(1-x)/2)= (4.626) 


4.13 GENERALIZED HYPERGEOMETRIC EQUATION 


A related function is called the generalized hypergeometric function and is defined 
as 


= hide) ‘(Oe 
(B,)- (Pr), Bader! 


= — the co generalized hpargcomeri ie 


F,,(Q,@p,.. Ons Bi, Pr-B, (4.627) 


Lolth —1)-- +B, -l- nora) ot 4 mY =0 (4.628) 
This Ou can be recast into es form as 


Loe y-sf]oray)oo-0 (4.629) 


j=l j=l 
where the differential operator D is the Euler derivative and is defined as: 
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D=x £ (4.630) 
dx 

Note the following special property of this operator 

Dx" ag Ghat (4.631) 
dx 
We now assume a power series for y in (4.629) 
y(x) = yaa (4.632) 
k=0 


Substitution of (4.632) into (4.629) we have 


» Hts —le, Perf =0 (4.633) 


k=0 j=l j=l 


Thus, the recursive formula for the two successive constant coefficients is 
m 


[ [+2 
Gg == Oj (4.634) 
k | (k+B,-1) 
j=l 
This recursive formula made it possible to formulate the series solution without 


much difficulty. Reapplying this recursive formula n times, we have 
m m m 


[[@+2)[ [@-1+2)--[]a+e) 
j=l j=l j=l 


Cy, = — - ——¢y (4.635) 
kk] [+ B, -)] [&+2; -2)..[ [B; 
j=l j=l j=l 


Reshuffling the terms and setting co = 1, we have the very compact form of cx in 
terms of the Pochhammer symbol as 


m 


[]@: 


¢, =| — (4.636) 
a Bay 
j=l 
Finally, substitution of (4.636) into (4.632) leads to 


—are 1 (Q) (Mp), (Gy) 
V(x) = \ xk = r mr 
k=O eT] (B.), = Pade Taal! (4.637) 


=m BQ Og 530 ys Bis Pas Bas X) 
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The compact notation given in (4.637) was introduced by Pochhammer in 1890 
whereas the notation of ,,F;, with one subscript before and one after F' was 
introduced by Barnes in 1907. This verifies that the solution of ODE (4.628) is 
indeed the generalized hypergeometric function given in (4.627). 

For the special case of m = 2 and n = 1, we have the hypergeometric function 


Se ee — (a,), (Ay), _ Say, t+NT (ay + r7)T(B,)x" 
ee 2, (B),7! 2 Ta )(an (A +r)r! (4.638) 
= F(a, Q, f,,x) 


For the special case of m = 1 and n = 1, we have the confluent hypergeometric 
function or Kummer’s function M(a,B,y) being recovered 


(a,),x" ST (a, +)T(B)x" 
Fi (%3 Bx) = > “ (B), “ee T(a@)I(B,+nr! (4.639) 


= M _ »f%) 
There are also a number of important functions that can be expressed in terms of 
the generalized hypergeometric function 


Fi(n4+1/2,2n+1,2ix) = y'nteS,(2) (4.640) 
F(-n1/2,x*) =(-1)" a eran (4.641) 
iFi(-n, 3/2, x 2) = (- 1)” i Sai (x) (4.642) 


where H,(x) is the Hermite polynomial (Abramowitz and Stegun, 1964). Note there 
is symmetry between a and £ for 2F) that we obtain 


F(a, By.) =) Fy(a, Bayo) = yee (4.643) 
r=0 res 


4.14 MOVABLE SINGULARITIES AND PAINLEVE EQUATIONS 


In Section 3.2.8, we have seen that the Riccati equation can be converted to a linear 
ODE. Fuchs realized that the Riccati equation only has movable poles. Whenever a 
nonlinear ODE that has a pole as its only movable singularity, we say that this type 
of ODE is the Painlevé type. More importantly, it was found that only this kind of 
nonlinear ODE with movable singularities can be transformed to a linear ODE, just 
like the Riccati equation. By definition, a singularity is called a movable singularity 
if its location depends on the initial conditions of the ODE. That is, the location of 
the singularity is not fixed solely by the coefficients of the ODE. 

To illustrate the idea of the movable simple pole, we consider the following 
ODE: 
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Mew, wz =0)=m (4.644) 
dz 
It can be shown that the solution of (4.644) is 
ae (4.645) 
a Zo 


We see that the location of the singularity is at zo, which is a function of the boundary 
condition wo and is not fixed by the ODE alone. 

The Painlevé type of nonlinear ODEs are known to be linearized and perhaps 
can be solved exactly. Indeed, Painlevé conducted an extensive analysis on what 
kind of second order nonlinear ODEs can be convertible to linear ODEs, like the 
Riccati equation. In particular, Painlevé investigated the following second order 
ODE: 


2 
We ys (4.646) 
IZ 


He examined the second order ODE of this form that only has poles as moving 
singularities. He found a total of fifty ODEs, and all of them can be reduced to (a) 
linear ODEs, (b) Riccati equations, (c) equations satisfied by elliptic functions, and 
(d) six "new" equations. Painlevé discovered that these six equations are not 
reducible to "known" differential equations. These are called Painlevé 
transcendents: 


@ 


Pr a =6w' +z (4.647) 
Z 
2 
Pir: ‘ ~ =2w > +2w+a (4.648) 
dz 
dw (Wy w (aw+b) 3 d 
Pim: = + +cw +— (4.649) 
dz” Ww vA Zz Ww 
dw (wy) 3w i eB b 
Py: = +4zw +2(z° —a)w4 (4.650) 
dz” 2w w 
2 
1 1 
be 22 P= wy os as (aw+ >). 2 ae” (4.651) 
dz 2w w-l w-l 
2, 
fw ato, 4 1 Jw")? by Ly 1 Jul 
Put dz w w- Ww-Z Z 2= W-Z (4.652) 
ia 1)(w-2) bz c(z-V) _ =) 
+ See vs 5 
Zz “(n= ie w (w- 1° Cee) 


where w'= dw/dz and a, b, c, and d are constants. These equations have been shown 
to be convertible to linear integral equations. But such transformations involve 
complicated complex analysis and are out of the scope of this chapter. Recent 
research shows that some very important nonlinear PDE can be reduced to Painlevé 
transcendents. They include the KdV equation, Sine-Gordon equation, and 
Boussinesq equation, and these are all soliton types of equations. If any PDE can 
be reduced to Painlevé transcendents, we say that it satisfies the Painlevé property. 
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More importantly, they can be solved by using “inverse scattering transform” and 
their solutions are of the soliton type (Ablowitz and Clarkson, 1991). 


4.15 SUMMARY AND FURTHER READING 


The series solution method is a major topic in the theory of ODE. The fact that a 
solution of a second ODE can be expressed in terms of an infinite series allows the 
evaluation of the solutions. Many so-called "special" functions are actually a 
particular type of series solution of ODE, including the Bessel functions, modified 
Bessel functions, Kelvin functions, Legendre polynomials, associated Legendre 
polynomials, hypergeometric functions, confluent hypergeometric functions, and 
general hypergeometric functions discussed here. The study of ODEs with regular 
singular points leads to the investigation of the Fuchsian type ODE. Among these 
special functions, the general hypergeometric function appears to be the most 
general and powerful, and it covers nearly all of the special functions as its special 
case. If you try to do complicated integration by a symbolic manipulation program 
such as Maple and Mathematica, it is very likely that you could get an analytic 
result in terms of some sorts of general hypergeometric functions. This is one of 
the major reasons that it should be covered in a chapter on series solutions. More 
discussion on hypergeometric functions can be found in Craig (1889), Bateman 
(1918), Piaggio (1920), Copson (1935), Poole (1936), Erdelyi (1953), Spiegel 
(1968), Lebedev et al. (1965), and Lebedev (1972). 

The investigation of special functions probably started with Bernoulli and 
Euler in 1700s. They include elliptic integrals and Bessel functions. We have given 
an introduction on elliptic integrals in Chapter 2 when we discussed the pendulum 
problem. Further information on Jacobi’s elliptic integral is also given in Appendix 
A. Euler introduced gamma functions as a non-integer continuation of factorial and 
studied elliptic integrals related to pendulums, and Bessel functions related to 
vibrations of circular drums. Nearly all of his investigations are driven by everyday 
applications. Related to celestial mechanics and potential theory, Legendre 
polynomials emerged. There are also other special functions in terms of 
polynomials, such as Hermite polynomials, Laguerre polynomials, Chebyshev 
polynomials, and Jacobi polynomials. Because of the orthogonal properties of these 
polynomials, they are important as the basis of eigenfunction expansion (see 
Chapter 10). Some of them relate to the theory of probabilities, quantum mechanics 
and wave scattering theory. 

One of the best coverages of special functions remains the classic book by 
Erdelyi (1953) (i.e., the Higher Transcendental Functions of the Bateman 
manuscript project). Other textbooks on special functions include Sneddon (1961), 
Lebedev (1972), and Bell (1968). For Bessel functions, the number one authority is 
still the classic book by Watson (1944). Serious readers should consult these 
books. The number one reference book on special functions is Abramowitz and 
Stegun (1964) and its updated version by Olver et al. (2010). It was reported by 
Biosvert and Lozier (2001) that Abramowitz and Stegun (1964) was cited more 
than 2000 times in 2009 alone by journal articles. In view of the importance of the 
use of special functions in engineering and industry, the National Institute of 
Standards and Technology (NIST) also published an online version called the 
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Digital Library of Mathematical Functions, which provides a link to software for 
evaluating many of the known special functions (http://dlmf.nist.gov/software/). 
The handbook of Gradshteyn and Ryzhik (1980) also compiled many results of 
integrations related to special functions. 


4.16 PROBLEMS 
Problem 4.1 Prove the following reflection formula for the gamma function 
1 1 
hare = — (4.653) 
2 2 COS 2Z 


Hint: Use appropriate substitution of the argument into the original reflection 
formula. 


Problem 4.2 Prove the following reflection formula for the digamma function 
1 1 
Wore i tae (4.654) 


Hint: Use the result of Problem 4.1. 


Problem 4.3 Prove the following identity for the gamma function 


PCr v2n (4.655) 
4 4 (-1)" 
where n is an integer. 
Hint: Use the reflection formula. 
Problem 4.4 Prove the following identity for the digamma function 
1 3 1—tan zx 
Wi +x)-W(F x) = —a( ) (4.656) 
4 4 1+tan 2x 
where n is an integer. 
Hint: Use the reflection formula. 
Problem 4.5 Show that 
1 
ve-n) a at (4.657) 
where n is an integer. 
Problem 4.6 Show that 
1 
ta) ae ee re es (4.658) 
T(z-n) 


where v is an integer. 


Problem 4.7 Show that 
1 
I(- BL =-2/a (4.659) 


Series Solution of Second Order ODEs 311 
Hint: Use the recurrence formula. 


Problem 4.8 Show that 


We 


rR >)= 2 2oN7. (4.660) 


Hint: Use the recurrence formula. 


Problem 4.9 Show that 
WG) =-7-2In2+2 (4.661) 
Hint: Use the recurrence formula. 
Problem 4.10 Show that 
WG) =-y—-2In2 (4.662) 
Hint: Use the recurrence formula. 


Problem 4.11 Show that 


640, z 
1e pos =) (4.663) 
iW fi 


Hint: Use the multiplication formula of Gauss with m = 3. 


Problem 4.12 Prove the following identity for the n-th derivative of the digamma 
function 


s 7 —1)"n! 
y (z+) =y' y+ (4.664) 
Zz 
Hint: Use the recurrence formula. 
Problem 4.13 Prove the following identity for the gamma function 
ly (41/ny 
T(z)= 4.665 
®) Zz I] (1+z/n) ( ) 


Hint: Try to show that the right hand side equals the factorial definition given in 
Section 4.2.2. 


Problem 4.14 Prove the following identity for the gamma function 


rares=-—— (4.666) 
ZSIN 77Z 
Problem 4.15 Prove the following identity 
1 
1 oe ee ds (4.667) 


2 2 1 x MX 
I —+— = 
G ai 2) 
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Hint: Use the reflection formula for the gamma function. 


Problem 4.16 Show that U and V defined in (4.146) can be expressed as 
Qx/2 Qx/2 ] 
U =-—sin(% EG *) Vat 09 r(— +4) (4.668) 
a 1 2° 2.2 
Hint: Use the reflection noah . the gamma function and the result of Problem 
4.15. 


Problem 4.17 Use the results obtained in Section 4.3 to prove the following 
formulas for the integer argument of the Hadamard factorial function: 


2m 2m+1 
Hm) = N72 ro) , HQm+l)= v2 au (4.669) 
; r(-m) r(,-m) 


where m = 1,2,3,... 


Problem 4.18 Prove the following formula of the Hadamard factorial function: 


A@HED= aie sinzx, g(x)g(1—x) (4.670) 


mx x(1—x) 
where P(x) and g(x) were defined in (4.191). 


Problem 4.19 Prove the following formula: 
Td+x)Td-x) = g(x)+g(-x)-1 (4.671) 
where g(x) were defined in (4.191). 


Problem 4.20 Prove the following formula: 
1 2 
reLeyyre i) = (4.672) 
4 4 coshzy+isinhzy 


Problem 4.21 Prove the following asymptotic formula as y > oo 


Re[InIT'(iy)] - = In(2n) - sin yr sy (4.673) 


Problem 4.22 Prove the following formula 


v(m)! 


T(m+ = (4.674) 
2?" m! 
Hint: Use the Legendre duplication uae 
Problem 4.23 Prove the following formula 
pl! 
Pim+=)= a ae (4.675) 


where 
(2m —1)!!=1-3-5++-(2m—3)(2m -1) (4.676) 
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Problem 4.24 Prove the following formulas 


if sin(y Int)e“‘dt = (TW) +T(-)} (4.677) 


in sinh(yInt)e“dt = strc y)+T(-y)} (4.678) 


Problem 4.25 Use the definition of the Pochhammer symbol to show that 
i 
i (—n), =(n+r)!/n! (4.679) 
(ii) 
(1), =r! (4.680) 
Problem 4.26 Show that the following hypergeometric equation can be converted 
to a Riccati type equation 


d°w dw 

e-D TY Hy (a+ B+ —afw=0 (4.681) 

by 
"s a, P,é)dé 
wae re (4.682) 

Ans: 

MS gee) PAPA, OF ng (4.683) 

dt t(t-1) (t-1) 


Problem 4.27 Prove that the error function discussed in Section 4.2.8 can be 
expressed as: 


2 £2 2 
erf (2) == } exp(-1?)adt (4.684) 
Vix #0 
Problem 4.28 Prove the Gauss multiplication formula given in (4.76): 
T(nz) = (22) 0? n™ 1? T (2) (z+ re + ey T(z+ a (4.685) 
n n n 
(i) First show that this is equal to 


n-l 
T(nz) = (27)ePneeTT fee (4.686) 
k=0 a 
(11) Use the recursive formula given in (4.53) and Euler’s factorial form (4.25) to 


show that 
m IZ tkin-l 


Pee) tim (4.687) 
n 


m>o k k k 
z+— |) z+—41}---} z+—--l+m 
n n n 


(iii) Use the Stirling formula given in (4.109) to show that 
k : J2m(mn |e)" m7*k/ "V2 
T(z+—) = lim 
n m0 (nz+k)(nz+k+n)---(nz+k—n+mn) 


(4.688) 


(iv) Take the product function of (4.686) from & = 0 to n—1, and change the index 
by replacing mn with n to show 
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[[re+t 7 ibe (J22) (m/e) m2 hla 4 689) 
ea m0 (nz+k)(nz+k+n)+(nz+k—-n+mn) 


(v) Reapply the Stirling formula given in (4.109) and Euler’s factorial form given 
in (4.25) to show 
n-l 
[[re+* =) =(22))? n!" (nz) (4.690) 
k=0 
(vi) Finally, prove (4.685) by using (4.690). 


Problem 4.29 Using (4.167) and (4.159) to show that 
H())=1 (4.691) 


Problem 4.30 Solve the following ODE by ie a change of variable of z = x? 


d’u 1 du 
—+—— 44g? a= 0 4.692 
ere ) (4.692) 
Ans: 
u = AJ, (x*)+ BY, (x”) (4.693) 
Problem 4.31 Solve the following ODE by assuming a change of variable of z = 
x1? 
du ldu 1 v? 
+ + 1 u = 0 4.694 
dx? xdx ag x ) ( ) 
Ans: 
u = AJ, (Vx) +BY, (Vx) (4.695) 


Problem 4.32 Solve the following ODE by assuming a change of variable of z = 
Bx" 


d’u 1 du va 
es +o tax" -2 a ju=0 (4.696) 
Ans: 
u= AJ, (Bx*)+ BY, (Bx*) (4.697) 


Problem 4.33 Solve the following ODE by assuming two change of variables: 
(i) Z=x"?u 


(i) G=x!” 
2 —t 
cleats cen (4.698) 
dx x dx 4x 
Ans: 
w= x"? (AS, (vx) + BY, (Vx)] (4.699) 


Problem 4.34 Solve the following ODE by assuming two change of variables: 
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(i) Z=xPYy 
(ii) C = yx? 
2 — — 
ae = ie “ (ByxP ay? +2 ee 26 (4.700) 
XxX 
Ans: 


u=xP"*T AS, (yx?) + BY, (yx )] (4.701) 


Problem 4.35 Solve the following ODE by assuming two change of variables: 
(i) Z= Vxu 


(li) y = Bx 
du a 4s! 
+ (8 Ju =0 (4.702) 
dx? 4x? 
Ans: 
u=x[AJ, (Bx) + BY, (Bx)] (4.703) 
Problem 4.36 Solve the following ODE by assuming two change of variables: 
(i) Z=x%u 
(ii) G = Bx" 
d’u 1—2a du 1.2 ae -y?y’ 
— —+ x7)" + —+_]u = 0 4,704 
7 Bye) ae (4.704) 
Ans: 
u=x" (AJ, (Bx")+ BY, (Bx’ )] (4.705) 
Problem 4.37 Solve the following ODE by assuming two change of variables: 
(i) Z= xu 
(ii) y= Bx!” 
2 2 02 
Ae en alla ne (4.706) 
dx? 4x 4x? 
Ans: 
uw =Vx[AJ, (BV x) + BY, (BVx)] (4.707) 


Problem 4.38 Solve the following ODE by assuming two change of variables: 
(i) Z =Vxu 
Gi) y = yx* 
2 
+(Byx?)Pu =0 (4.708) 
dx 
Ans: 
w= Vx ASiyap) 7x") + BY 2p (7x" JI (4.709) 


Problem 4.39 Solve the following ODE by assuming the following change of 
variables: y= Be* 
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ee e -~v’)u=0 (4.710) 


Ans: 
u= AJ, (Be*)+ BY, (Be*) (4.711) 


Problem 4.40 Solve the following Airy equation in terms of modified Bessel 
functions of the first and second kinds by assuming two change of variables: 

(i) Z=Vxu 

(ii) € =(2/3)x*” 


2. 
ayaa =0 (4.712) 
xX 
Ans: 
2 3/2 2 3/2 
u=Vx[Ahys(* )+ BKy3(Gx"”)] (4.713) 


Problem 4.41 Solve the following equation by assuming two change of variables: 
(i) Z =Vxu 
(ii) € =(2/3)x*” 


2 
FH vy =0 (4.714) 
dx 
Ans: 
2 2 
u= VxtAJis(G x7) + BY G ae | (4.715) 


Problem 4.42 Solve the following equation by assuming two change of variables: 
(i) Z =Vxu 
(ii) € =(4/3)x*" 


d’u , U9 (4.716) 
dx” Vx 
Ans: 
w= Vet Ads(G 4) + BY sa) (4.717) 


Problem 4.43 Solve the following equation by assuming two change of variables: 
(i) Z= xu 
(ii) € =(4/3)x*" 


d’u_u _ (4.718) 
dx? Vx , 
Ans: 
4 4 
u= Vx Alo xy + BRas5 x | (4.719) 


Problem 4.44 Solve the following equation 
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u=0 (4.720) 
Ans: 
u =x AS, (e!*)+ BY, (e"*)) (4.721) 


Problem 4.45 Solve the following equation by assuming a change of 
variables: u = sec x Z(x) 


du du _v* tanx 
f 2tanx 1 =0 4.722 
ue os, s (4.722) 
Ans: 
u =sec x[ AJ, (x) + BY, (x)] (4.723) 


Problem 4.46 Solve the following equation by assuming a change of 
variables: u = csc x Z(x) 


ro) ae du v’ cotx 
} + 2cotx u=0 4.724 
ae ome (4.724) 
Ans: 
u =cscex[ AJ, (x)+ BY,(x)] (4.725) 


Problem 4.47 Solve the following equation 
2 " ' ’ " ' 
du (LO cy FO) LOM, 8'O, oy yO) 
i 0, IO). LO) eet 
+2 =0 
Fa! Fo) fog OD 


Ans: 
u=f(x\[g(~)) TA, (g(@) + BY, (g())] (4.727) 
Problem 4.48 Solve the following equation 
Pu fod, 3S @Op_1/'@)_3 8" 1a" 
de fade 4 FG) 2F0 48) “280 ang) 
2 o- Loe (x)e: 
Heo (x)-v wey ju=0 


Ans: 


- LB 44, e09)+ BY (e(0) (4.729) 


Problem 4.49 Solve the following equation 
ay, AF"). 3-7" @)p 


oe 2 Fa) a Fay! +f? (x)-v" ral 


fy 
f(x) 


Pyw=0 = (4.730) 


Ans: 
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u= || PD Ay, (£00) + BY, (FI (4.731) 
f'@) 


Problem 4.50 Solve the following 2nd order ODE with non-constant coefficients 


du f"(x) f(x), du 2 2 2 f'(),2 _ 
a a) +(2u-1) Fx) de el ee Ol! u=0 (4.732) 
Ans: 
u=[f(x)TA, FQ)) + BY, (F())] (4.733) 
Problem 4.51 Solve the following 4-th order ODE 
d‘u 2d°u 2v?+1d7u_2v? +1 du vv? —4) 
= 4.734 
de x de oa x dx . ees ks 
Ans: 
u = AJ, (x) + BY, (x) +CI,(x)+ DK, (x) (4.735) 


Problem 4.52 Find the general solution of the following Airy equation in terms of 
Bessel functions: 
2 


arty =0 (4.736) 
Hint: By assuming 
u= VaZG vx?) (4.737) 
Ans: 
u = (Ay sGve?) BY, s(Gvx"?)} (4.738) 


Problem 4.53 Prove the validity of (4.465) to (4.467). 
Problem 4.54 Prove the validity of (4.470) to (4.472). 


Problem 4.55 Prove the validity of the following identity: 


Va _2:2,4-4, 6-6, 8-8. 
Ge aa a 8) (4.739) 


Hint: Refer to (4.3) and choose a proper value of n. 


Problem 4.56 Prove the validity of the following identity: 


va)=-7+ PE -— (4.740) 


n+x-l 


n=l 


Problem 4.57 Prove the validity of the following identity: 
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x-l 


1 feu 
= —d 4.741 
SO)=F5 I, mri (4.741) 
where C(x) is the Euler-Riemann zeta function defined as 
1 
6= >) — (4.742) 
n 


n=l 

Hints: Apply the change of variables ¢ = nu to the definition of gamma function 
given in (4.4) and sum 1 from | to ©, the reverse order of integration and 
summation, and finally sum the infinite series inside the integral by geometric 
series. For real x, (4.742) was studied by Euler in 1737 and was extended to 
complex x by Riemann in 1857. Note that this Euler-Riemann zeta function is 
of fundamental significance to prime number distribution and is linked to the 
celebrated problem of the Riemann hypothesis (e.g., see Havil, 2003; 
Sabbagh, 2003). In short, the hypothesis links to the distribution of primes 
(e.g., 2,3,5,7,11,13,17,..., 15,485,863, ... are primes) for large integers to the 
tnontrivial roots of the Euler-Riemann zeta function of the complex argument. 
The Clay Mathematics Institute set US$ 1 million as the reward for proving 
or disproving this Riemann hypothesis as one of the Millennium Problems. 


Problem 4.58 Rederive the following reflection formula (i.e., (4.35)) by answering 
the following sub-problems: 
1 


T(1-z)1(z) = (4.743) 


sin zz 
(1) Apply the recurrence formula of the gamma function (4.53) and the Weierstrass 
canonical form of the gamma function (4.302) to prove the following identity 


i 4 z 

=z] [,(0-—) (4.744) 

re) Tez) | me 

(11) Expand sinx in terms of infinite series, observe the infinite roots of sinx being 
na, and show that 


sin x x x x? = 
=(1 — ral =.-T{o- 


# n=l 


x2 
4.745 
= | (4.745) 


(iii) Combine the results in parts (1) and (ii) to derive (4.743). 


Note: This proof was given by Euler in 1739 and was reported in Whittaker and 
Watson (1927). This proof is simpler than that given in Section 4.2.4. 
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Taylor & Francis Group 


http://taylorandfrancis.com 


CHAPTER FIVE 


Systems of First Order Differential 
Equations 


5.1 INTRODUCTION 


In this chapter, we consider a system of coupled first order ODEs. This is a relatively 
modern topic, which evolves into the present form with the invention and 
popularization of the computer and the extensive use of numerical analysis after the 
Second World War in the 1940s. For example, it is not covered in the classic 
textbooks of Forsyth in the early twentieth century. Most of the modern computer 
software for solving ODEs, especially nonlinear ODEs, are formulated as a system of 
first order ODE. It is because all systems of differential equations, regardless of the 
order and size, can be recast in the standard form of a system of first order ODEs. It 
turns out that the solution of such systems involves solving the matrix eigenvalue 
problem. This chapter mainly deals with the linear system of ODEs for which analytic 
solutions can be obtained. For nonlinear ODEs, a more advanced perturbation method 
for studying the stability of the evolving systems of ODEs focuses mainly on the first 
order ODE system. 

In particular, we will show that all ODE systems can be recast into the 
following standard form: 

My = FLX], Xp +++ X,) 


Xq = Fy (t,%1,X95+++X,) (5.1) 


X), =F (t,%4, Xp 5+++X,) 

where x; (i = 1,2,...,7) are the unknown functions and fis the variable. If the functional 
form of F; is nonlinear, numerical analysis is normally used. For example, the Runge- 
Kutta method can be used to integrate this nonlinear system and its discussion will be 
given in Chapter 15. However, nonlinear systems of ODEs may have more than one 
solution at certain values of the variable ¢. That is, no unique solution can be 
guaranteed for a nonlinear system. The solutions may sometimes evolve in an 
unpredictable chaotic manner. 

This branch of mathematics has evolved into a major branch of applied 
mathematics, such as the theory of catastrophes, application of topologies, and chaos 
theory in solving nonlinear ODEs. In 1961, MIT meteorologist Edward Lorenz 
studied the chaotic behavior of the convective weather system and found that there is 
a strange attractor for chaotic solutions of the nonlinear ODE system. The shape of 
the strange attractor of Lorenz closely resembles a butterfly. The term "butterfly 
effect" emerges and becomes a fashionable term. It suggests the notion that a butterfly 
flipping its wings will induce small uncertainties in the initial condition that make 
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error cascades upward through a chain of turbulent features in a nonlinear system. It is 
“exaggerated” that storm systems on the other side of the globe months later are a 
consequence of this butterfly effect. The bottom line is that a small error in the initial 
condition can make a nonlinear system totally unpredictable. This contradicts the so- 
called "Laplace hypothesis" that the world is predictable if all physical laws and initial 
conditions are known. In 1966, Thom proposed a theory of catastrophe that includes 
the study of the geometry and topology of chaos structures. He believed that 
complicated chaotic structure at singularities (such as the butterfly-shaped Lorenz 
attractor) can be unfolded using studies of differentiable manifolds. Terms like cusp, 
bifurcation, and jump were introduced. In 1975, a Los Alamos based scientist, 
Mitchell Feigenbaum, made a major breakthrough in modern nonlinear analysis. He 
studied Robert May’s “‘period-doubling” of biology populations. May discovered that 
chaotic solutions emerge through a series of bifurcations, and at each bifurcation point 
the period doubles and one cycle splits into two cycles, and at the next bifurcation 
from two cycles to four cycles, and so on. This splitting makes a fascinating pattern. 
Feigenbaum discovered that the splitting appears to come at a faster and faster rate, 
but there is a scaling that the rate of splitting is a constant. To his amazement, no 
matter how different the nonlinear system that he started with, he arrived at the same 
or a “universal” convergence rate. The number is approximately 4.669201609103 and 
now this number is called the Feigenbaum number. More importantly, he discovered 
an order within the so-called chaos. This sheds light on the hope that there is order 
within chaos. In other words, chaotic solution is predictable, and it is just our limited 
knowledge or ignorance of patterns in the so-called chaos. This also leads to the 
revitalization of scaling studies of fractal dimension, the Julia set and the Mandelbrot 
set. Unfortunately, there has not been another major breakthrough since Feigenbaum's 
discovery. 

Returning to linear systems of ODEs in this chapter, we will show how 
systems of ODE of arbitrary order and with an arbitrary number of unknowns are 
transformed into a system of first order ODEs. We will also summarize the 
uncoupling of ODEs for multiple unknowns involving higher derivatives originated 
by Jacobi and Chrystal. The determinacy of ODE systems and its relation to the 
number of arbitrary constants in the solution are considered. The solution of ODE 
system with constant coefficients will be presented in the context of a matrix 
eigenvalue problem. The solution form is classified into the cases of distinct 
eigenvalue, repeated eigenvalues for Hermitian and non-Hermitian matrices, and 
complex conjugate pairs of eigenvalues. 

Systems of differential equations have been formulated by Euler, D’Alembert, 
Lagrange, and Laplace. The development of the matrix method for solving a system 
of first order ODEs can trace back to the time of Laplace, Lagrange and Cauchy, 
when they considered the eigenvalue problems and formulating problem in celestial 
mechanics. They were also concerned with the stability of the solution of 
differential equations. Lagrange successfully dealt with the case of repeated 
eigenvalues, but it was Laplace who recognized the importance of the symmetric 
matrix on eigenvalues. However, Laplace, Lagrange, and Cauchy were not aware of 
the relation between eigenvalue problems and solving systems of first order ODE. 
It was J.C.F. Sturm, a student of Fourier and co-founder of the Sturm-Liouville 
problem, who pointed out to Cauchy in 1828 that the eigenvalue problem is related 
to solving the system of first order ODEs. Later contributors to the development of 
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the matrix method for systems of ODEs include Cayley, Weierstrass, Dirichlet, 
Jordan, and Frobenius, among others. As summarized by Hawkins (1975a-b, 1977), 
the development of matrix theory is, in fact, closely related to the solution of 
systems of first order ODEs. 


5.2) REDUCTION OF N-TH ORDER ODE TO SYSTEM OF EQUATIONS 


Consider an n-th order ODE given symbolically as 
y =F (ty, 2 yee y” | (5.2) 


This ODE can be recast as a system of n ODEs. In particular, we make the following 
identifications: 


X =Vs Xp = V's Hy = "one, = YO (5.3) 
where the new unknowns are the original unknown y and its higher derivatives up to 
order n—1. Therefore, one unknown is split into unknowns, and the first n—1 
definitions in (5.3) actually also provide the first n—1 ODEs of the system: 


Uy _ 
x =X 
x) =X3 
: (5.4) 
t 
Xn-1 = Xn 


Xp, = F(t, %,Xp,0++X,,) 


Thus, (5.4) is the equivalent n ODE system for the single n-th order ODE given in 
(5.2). 

Let us consider a practical problem of two oscillators connected through a 
string and a dashpot (or a mechanical damper) given in Figure 5.1. 


Figure 5.1 Two connected movable masses as a system of oscillators 


Newton's second law of force equilibrium can be applied to formulate the equations 
of motion for the two masses. In this formulation, we will neglect the frictions (both 
static and kinematic or dynamic) between the base of the mass and the horizontal 
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ground that the two masses rest on. The damping forces are assumed proportional 
to the velocity of the damper. We have the following equations of motion: 


dx, 


d(x.—%) ay 
m, —— =k,(x,—-x,)-kx, +¢ Cc +Fi(t 
1p 2(X2 — X41) — kx, +5 di 1h i(t) 
hh ikea tele ee Re 
dt dt 
P at (5.5) 
X45 Xy —X 
m =—-k,(x,—-Xx,)-c + F(t 
2p 2(%2 -%)—C2 i 2(t) 
dx dx 
Shek =k + —_ - oe SR 
a es ae ae >(¢) 
Similar to our earlier discussion, we can redefine the unknown as: 
dx, 
=x, = 5.6 
aA 1> J2 dt (5.6) 
AX, 
=X, = 5.7 
¥3=%22 Ya (5.7) 


Using these definitions, (5.4) and (5.5) can be recast as a system of first order 
ODEs 


W=)2 


, 1 
Va5= thi thy) yy thy y3 (+) 2 tO, + HO | 
1 


; (5.8) 
V3 = 4 
, 1 
V4 = pmlc —hyV3 + 6,V. - OV, + Fy) | 
2 
Putting (5.8) in matrix form, we have 
[ 0 1 0 0 | 
y 1 1 1 1 y 0 
: ——(k,+ky) -—(q+e) —k, —oe, 
djy2|_| ™ m my ai v2 JAO (5.9) 
dt | ¥3 0 0 0 1 V3 0 
1 1 1 1 Fy(t 
V4 —ky — 05 -—k, -—c Va (0) 
L My My my m | 
Symbolically, the first order ODE can be written in matrix form as 
y =Ay+F (5.10) 


Therefore, it is clear that all coupled ODEs can be expressed as a system of first 
order ODEs. This is particularly important if the ODE is nonlinear and an exact 
solution cannot be found. Nowadays, nearly all computer software for numerical 
analyses deals exclusively with systems of first order ODEs (e.g., Press et al., 
1992). One of the most popular methods is the fourth order Rung-Kutta method, 
which will be discussed in more detail in Chapter 15. 
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5.3 ANALYTIC SOLUTION BY ELIMINATION 


To illustrate the idea of solving a system of equations by elimination, we consider the 
case of two linear coupled ODEs as: 


F,, (D)x+Fy(D)y =f 


(5.11) 
Fy, (D)x+ Fy (D)y =fy 
where F’ are differential operators defined as: 
Fi, (D)x=(a,D‘ +a,D*" +...4.4,)x (5.12) 
Fin (D) y= (byD* +b, D‘" +...4+b,)y (5.13) 


Similarly, F'; and F22 can be defined. We will restrict our discussions to ODEs with 
constant coefficients, but we will not restrict ourselves to first order, which is the 
focus for the rest of this chapter. Differentiation with respect to ¢ is denoted 
symbolically by D as: 


= (5.14) 


To eliminate x from (5.11), we can first take differential operator F'1; on the second 
equation of (5.11) and take differential operator F2; on the first of (5.11). Then, the 
difference of these two resulting equations gives: 


(Fy Fin — Fi Fao) = Fa fi -Fitho (5.15) 
Similarly, we can also eliminate y to give 

(Fa Fi2 — Fu Fa2)x = Fao fi + Piotr (5.16) 
where 

Ar Fo =-+(F,,F,, — FF) #0 (5.17) 

Fy Ep 212 — 11422 : 


This determinant is called the operational determinant (Edwards and Penney, 2005), 
the characteristic determinant (Ince, 1956), or simply the determinant of the system 
(Tenenbaum and Pollard, 1963). If the determinant on (5.17) is zero, the system may 
have either no solution or infinite solutions. Thus, (5.11) can be uncoupled to give 
two ODE for x and y separately. Note that the operators on the left of (5.15) and 
(5.16) are the same. Both of them are nonhomogeneous linear ODEs and thus can be 
solved by the technique covered in an earlier chapter. 

However, the resulting uncoupled ODE is of higher order, and thus the 
elimination process induces a number of unnecessary and undesirable arbitrary 
constants. They need to be eliminated by substituting these solutions back into the 
original ODE system. In other words, the uncoupled ODE is not equivalent to the 
original system. The following examples illustrate this problem. 


Example 5.1 A system of two second order ODEs is given as 
d* d’y : 

( 2)x =-—2sin it 

dt dt’ 

2 2 


(5.18) 
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Find the solution of the system. 


Solution: Adding these two ODEs gives 
2 


FX «sin Jt (5.19) 
dt 
Integrating twice, we obtain 
x = sin J2t+ At+ A, (5.20) 
Substitution of (5.20) into the first equation of (5.18) gives 
2 
fy 2 549% (5.21) 
dt 


Direct integration on both sides yields the solution of y 
1 
y= “40 aur tREER, (5.22) 


Since both x and y are second order in the system of (5.18), it is expected that there 
should be four unknown constants. 


Example 5.2 A system of two second order ODEs is given as 
d° d* 


(<-2)x+ <= ~2sin 21 
eo ut (5.23) 
d* d’y 
(eet a0 
dt dt 
Find the solution of the system. 
Solution: Subtracting these two ODEs gives 
x= sin f2t (5.24) 
Substitution of this result into (5.23) leads to 
2 
= =0 (5.25) 
dt 
Substitution of (5.25) into the first equation of (5.23) gives 
y=Bt+B, (5.26) 


Thus, there are only two unknown constants. Thus, the corresponding boundary 
conditions must be given as 


dy 
=y, t=t 5.27 
” Vy 0 (5.27) 


Note in this case that we only have two unknown constants instead of four as 
obtained in Example 5.1. We will discuss this peculiar result in the next section. 


Y=Vo> 


Example 5.3 A system of two second order ODEs is given as 
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d? d* 
(—-2)x oem =—4sin /2t 
dt (5.28) 
il +2)x+—= =0 
dt” 
Find the solution of the system. 
Solution: Subtracting these two ODEs gives 
x+y=sinJ2t (5.29) 
Differentiation of (5.29) leads to 
2 
id a - ~2sin /2t (5.30) 
t 
Substitution of (5.30) into the second equation of (5.28) gives 
x =sin J2t (5.31) 
Consequently, we must have 
y=0 (5.32) 


We have just obtained another peculiar result that there is no arbitrary unknown 
constant in the solution. Or, we cannot impose any initial condition. Again, we will 
discuss this in the next section. 


Example 5.4 A system of two second order ODEs is given as 
i yx as =—2sint 
dt’ de dt? 


ge Pp (5.33) 
+—)x+ Y=0 
dt? dt dt? 
Find the solution of the system. 
Solution: Subtracting the first of (5.33) from the second of (5.33) gives 
d dy : 
—+1)x+—=2sint 5.34 
( it ) rf (5.34) 
Differentiation of (5.34) with respect to ¢ gives 
dd. d’y 
(—+—)x+ =2cost (5.35) 
dt? dt dt’ 


This ODE is however inconsistent with the second equation of (5.33). Thus, this 
inconsistency leads to no solution. 


Example 5.5 A system of three equations is 
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a ae d dz 
— -2— +3)x+(—-ly+— =0 
Ce dt ms Ch y dt 
d dy dz 
3—+1)x-3 =1 5.36 
Ca m dt dt ( ) 
2x-2y—-z=-4 


Find the solution of the system. 


Solution: Differentiation of the last equation of (5.36) results in 
iy a (5.37) 
dt dt = dt 
Substitution of (5.37) into the first two of (5.36) leads to a system of ODEs for x 
and y only: 


2 
ea +3)x ea +)ly=0 
dt dt (5.38) 
ce +1)x- ay = 
dt dt 
Elimination of y from (5.38) gives 
z z @ 1)x=-1 (5.39) 
dt dt dt 
Assuming an exponent solution for the homogeneous ODE of (5.39), we obtain 
2? 44-D= 0-17" +)=0 (5.40) 
Thus, the homogeneous solution is 
x), = Ce’ +C, cost +C; sint (5.41) 
It is straightforward to see that the particular solution for (5.39) is 
x,=1 (5.42) 
Therefore, we have 
x=1+Ce' +C, cost+C; sint (5.43) 
Substitution of this result into the second equation of (5.38) leads to 
i 7 aa +)x-1 
dt dt (5.44) 
=(C;, +C,)cost+(C, —C,)sint+ 2C,e 
Integration of (5.44) gives 
y =(C3 +C,)sint —(C, —C,)cost+2C,e' +C, (5.45) 
Substitution of these solutions of x and y into the first equation of (5.38) leads to 
Cy, =3 (5.46) 


Finally, substitution of (5.43) and (5.45) into the third equation of (5.36) gives the 
solution of z 

z= -2C, sint + 2C; cost —2C,e’ (5.47) 
This gives the complete solution for the system of ODEs given in (5.36) and note 
that there are only three unknown constants. 
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It can be seen from Example 5.5 that one of the constants needs to be fixed by 
the original ODE. Instead of back substitution into the original system, we can also 
make the transformed ODE system equivalent to the original one by using a multiplier 
system during the elimination procedure (Ince, 1956). As remarked by Ince (1956), 
this topic was first considered by Jacobi in 1865 and was refined by Chrystal in 1895. 

Let us consider two systems of coupled ODEs of 7 unknowns 9; (i =1,...,72) 


U, = F,(D) y, + Feg(D) yy +--+ Fg (Dn — f= 0 (5.48) 

V,. = G4 (D)», +G,.(D)y2 +..+G,,(D)y, — g,(t) =0 (5.49) 
where r =1,2,...,.1 and Fi; and Gi are polynomials in terms of differential operator D 
and with constant coefficients. For the case of constant coefficient polynomials of D, 
it is straightforward to see that these operators are commutative, associative, and 
distributive (Tenenbaum and Pollard, 1963), but the proofs of them will not be 
covered here. 

Now, we are looking for the condition that these two ODE systems are 
equivalent (i.e., the solutions of (5.49) are also solution of (5.48) and vice versa). If 
every solution of U satisfies V, we have 

V, =T,(D)U, +...+T,, (DU, 
(5.50) 


V,, =T,(D)U, +...+T, 


i An 


(D)U 
where 7; are polynomial operators. It is clear that if U satisfies (5.48), V defined in 
(5.50) would also satisfy (5.49). If (5.50) is written in matrix form, we must have 


n 


Vil (hy 2 Bp i 
2 : (5.51) 
7 ee | oe 
Te 2 Ty 
A=|: .. 2/40 (5.52) 
 H 


The determinant is nonzero since all V are independent. Ince (1956) called 7; the 
multiplier system. If every solution of V satisfies U, we also have, by the theory of the 
determinant (Ince, 1956), 


AU, = 4, (D)K +...+4,, (DV, 


= (5.53) 
AU, = 4 (D)V, +--+ Sin (DV, 
where 
fn + 
—=/: .. 1140 5.54 
a ; (5.54) 
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In fact, (5.53) can be regarded as the inversion of the system of ODEs given in (5.51). 
If V satisfies (5.49), we have 

AU, =0,...,4U,, =0 (5.55) 
Thus, if A is a constant, every solution of V = 0 satisfies U = 0. In other words, A 
defined in (5.54) cannot be a function of D. This is the condition in which (5.48) and 
(5.49) are equivalent. 

A natural question would be how to find the multiplier system. The first row of 
it is naturally chosen by our aim of eliminating one of the unknowns of the system. 
The other rows must be found in such a way that the resulting determinant (5.52) will 
become a constant. We will illustrate more specifically the elimination process for the 
first two equations of (5.48). Written explicitly, these equations are 


U, =F, (D)», + Fi(D)y, tact Fi, D)y, — f,) =0 (5.56) 
U, = Fy (D)y, + Foy (D) yy +--+ Fog (D) Yn — fy) = 0 (5.57) 
Let us assume that y; can be eliminated from (5.56) and (5.57) to get 
LU, +MU, =0 (5.58) 
LU, +M"U, =0 (5.59) 


In matrix form, this equivalent pair of ODEs becomes 


eae tet S40 


Since both U; and U2 are identical zero, we have the determinant of the operator 
coefficient being nonzero: 
LM* — ML" = constant (5.61) 
Assume further that there is a common polynomial factor /"between F\; and F; such 
that 
F,\(D)=I®, Fy(D)=T¥, (5.62) 
Clearly, to eliminate ; we should let 
L=¥, M=-@ (5.63) 
where L and M are relatively prime with respect to D. Equation (5.61) provides a 
condition to find the last two operators of the multiplier system L* and M*, which 
were called adjoint operators (Golomb and Shanks, 1965). However, it can easily be 
confused with the adjoint ODE and we will not use such terminology in this book. 
We now illustrate this technique of the multiplier system. 


Example 5.6 Use the method of the multiplier system to solve the following ODE 
system 
(D+1)y, + D’y, +(D+)y, =0 
(D-l)y, + Dy, +(D-y =0 (5.64) 
Wy +Y2 + Dy; =0 
Solution: To eliminate yz from the second and third equations of (5.64), we can sum 
the result of multiplying —1 to the second equation of (5.64) and the result of applying 


D to the third equation of (5.64). That is, we have L = —1 and M = 0. The associated 
multiplier system can be found by inspection as: 
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[ i . il 
= (5.65) 
L* M*) |-1 D-1 


It is straightforward to show that its determinant is | (although any constant will do, 
we have chosen to use 1), and thus (5.61) is satisfied. Thus, we have found L* and M* 
in (5.65) by inspection. The resulting ODE becomes 


(D+ly, + D’y, +(D+Dy, =0 
y, +(D* —D+1)y; =0 (5.66) 
=y, +(D* =2D+1)y, =0 
To eliminate y, from the first and second equations of (5.64), we can use another 


multiplier system: 

-1 D+l1 

(5.67) 
-1 D 


The first row of (5.67) is the pairs of operators need in eliminating y; and the second 
row is the corresponding operators to make the determinant of (5.67) being 1 (see 
(5.61)) and it is obtained by inspection. The new system of ODEs becomes 


—y, -D*y, +(D’ -D* -1)y; =0 
—~D’ y, +(D? —D)y; =0 (5.68) 
~y, +(D* -2D+1)y, =0 


Finally, we eliminate y2 from the second and third equations of (5.68) by using the 
following multiplier system: 


0 1 
° # (5.69) 
Eventually, we have 
-~y, -D’y, +(D’ -D* -1)y, =0 
~y, +(D’ -2D+1)y3 =0 (5.70) 


(D‘ -3D? + D? + D)y; =0 
3 


The last of (5.70) is uncoupled and can be solved by exponential function (i.e., y3= 
e”). The corresponding characteristics equation for y3 is 


At 303 +17 +4=0 (5.71) 
Without going through the details, one can show that it can be factorized as 
AA-DA= 1-42 )(4-1442) <0 (5.72) 


The general solution can be expressed as 
R=CHC 2 Cer Lee (5.73) 
Substitution of (5.73) into the rest of (5.70) yields 
Vp =C, + B+ V2) Ce?” + (3-2) ye" (5.74) 
y= 26-6 2623 2yCe™" 96-32)" (5,75) 
This completes the full solution. 
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To understand the scenarios that we have observed from Examples 5.1—5.6, we will 
consider the determinacy of system of ODEs in the next section. 


5.4 DETERMINACY OF SYSTEM OF EQUATIONS 


Let us consider the following system of ODEs symbolically: 
P(D)x+Q,(D)y + R(D)z =F, 
P,(D)x+Q,(D)y + R,(D)z = F, (5.76) 
P(D)x+0,(D)y + Ry(D)2 = Fy 
where P;, Q; and R; are differential operators. This system of coupled ODEs can be 
written in matrix form as: 
R(D) OW) RWD)](x} [A 


PD) Q(D) RD) hyp =4% (5.77) 
P(D) OD) RD) (2) U3 
[A(D)]{x} ={F} or A(D)x =F (5.78) 


Let the determinant of the matrix of differential operators be A(D) = det | A(D) | ; 
where D is defined in (5.14). The system of ODEs is determinate (1.e., solution can be 
found) if A(D) # 0 and the system is indeterminate if A(D) = 0. The degree in D of 
the determinant A(D) indicates the number of arbitrary unknown constants involved in 
the solution. Therefore, it is necessary to check the determinacy of the system before 
attempting to solve it. 

Let us check the determinacy of each of the examples considered in the last 
section. The A(D) of Example 5.1 is 


p=? =p 
D?+2 D* 
Thus, A(D) # 0 and this system is determinate. The degree of D is four and we have 


four unknown constants. This agrees with the result in Example 5.1. The determinant 
for the system given in Example 5.2 is 


Dp =k De 
D2 DF 
Hence, it is determinate and there are two unknown constants, and it is what we found 


in Example 5.2. The determinacy of the system given in Example 5.3 can be 
determined as: 


A(D) = =2p4 (5.79) 


A(D) = =-4)° (5.80) 


P=) De =A 
P42 Dp? 


Thus, there is no unknown constant like we found in Example 5.3. For Example 5.4, 
we have A(D) 


A(D) = =8 (5.81) 
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D1 D=p 
D+D D 
Therefore, it is indeterminate and no solution can be found. Finally, the determinacy 
of the system in Example 5.5 is 

D?-2D+3 D-1 D 
A(D) =| 3D+1 3D D|=D*-D?+D-1 (5.83) 
2 —2 -l 


A(D) = =0 (5.82) 


There are three unknown constants for the system of ODEs, and this agrees with our 
results. Therefore, it is advisable to check the determinacy of the system before we 
actually solve it. 

Finally, the characteristics determinant of Example 5.6 becomes 


D+1 D? D+1 
A(D)=|D-1 D D-1)=-D*+3D*>-D?+D (5.84) 
Ll & @ 


This suggests four unknown constants and this agrees with the result of Example 5.6. 


5.5 REVIEW ON MATRIX 


It will be shown that the solution of the system of first order ODEs involves solving 
the eigenvalue problem in a matrix. Some fundamental matrix results will be 
summarized here. Consider an mxn matrix A 


ay Ay Qn, 
47, Ag ay 

A= ” (5.85) 
Amt U2 ** Ann 


The matrix is sometimes denoted as A = (aj). The transpose of it is A? = (aj) or more 
explicitly: 


mi 

a2 2 Gm? 
A= ” (5.86) 

Gn Ayn *** Ann 


That is, the first column becomes the first row of the transpose matrix and similarly 
for other rows. A zero matrix is defined as 0 =(0) or all entries in the matrix are zeros. 
If two matrices are the same, all elements must be identical: 


A=B, a,=b; (5.87) 
Matrix addition is defined as: 
A+B=C, (cj) =(a;+5;) (5.88) 
Scalar multiplication is defined as: 
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kA = (kaj) (5.89) 
Matrix multiplication is defined as AB = C, or 
6 = > anby (5.90) 
k=l 


where matrix A is mxn, matrix B is nxr, and the resulting matrix C is mxr. In other 
words, the i-th row multiplying the j-th column yields the element of cj. In general, 
matrix multiplication is not commutative, that is AB # BA. In addition, we may also 
have AB = 0 even though A # 0 and Bz 0. 


Example 5.7 Show that the multiplication of the following matrices is not 


commutative: 
1 2 1 3 
A= , B= (5.91) 
3 4 2 4 


Solution: Applying the rule of matrix multiplication given in (5.91), we get 
1 2)\(1 3 Ixl+2x2 1x3+2x4 5 ll 
AB= = = (5.92) 
3 4)\2 4 3x1+4x2 3x3+4x4 11 25 
On the other hand, 
1 3\f1 2 Ixl+3x3 1x24+3x4 10 14 
BA= = = (5.93) 
2 4)\3 4 2x1+4x3 2x2+4x4 14 20 
From this example, it is clear that matrix multiplication is, in general, not 
commutative. 


A unit matrix can be defined as: 


Oo - 0 
0 1: 0 
ar (5.94) 
0 Ov 1 
In addition, it is straightforward to illustrate that for any square matrix A 
IA=AI=A (5.95) 
Then, the inverse of a matrix A can be defined as 
AA'=A'A=I (5.96) 
The inverse exists as long as the determinant of A is nonzero. We also have 
(A'y1=A4 (5.97) 


The dot product between vectors can be expressed in terms of matrix multiplication of 
a row vector (or an nx1 matrix) with a column matrix (or a 1x matrix) as: 


x’y=) xy, (5.98) 
i=l 


More generally, if the elements in the vectors are complex, we can define an inner 
product (which is an extension of the dot product to the complex case): 
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(uy)=x’y= ay, (5.99) 
i=l 


which is in essence the same as the dot product except for taking the complex 
conjugate of vector y. The dot product of a vector and itself forms the square of the 
length of the vector. If the dot product or inner product of two vectors (real or 
complex) equals zero, these vectors are called orthogonal. The word "orthogonal" is 
equivalent to the term perpendicular the case of 2-D or 3-D vectors (1.e., vector with 2 
or 3 elements). The adjective perpendicular is not very meaningful, if we go beyond 
3-D cases. This is because perpendicularity is normally visualized in a geometric 
sense that cannot be applied to a 4-D or higher dimensional space. 

As a side story, we should also mention that when the great mathematician 
Hilbert tried solving eigenvalue problems of integral equations in 1901, he 
inaugurated the study of spectrum theory (i.e., the spectrum of eigenvalues and 
eigenfunctions of integral equations). Hilbert also showed that differential equations 
could be converted to integral equations. His studies prompted other mathematicians, 
like Schmidt, Riesz, Volterra, Fischer, Lebesgue, Frechet, and Banach, to introduce 
and develop the concept of the more abstract analysis of functional space or vector 
space. Actually, the inner product defined in (5.99) is one of the major tools used in 
functional analysis. It has been shown that the differential operator needs a different 
definition of inner product such that the resulting eigenfunctions (in terms of vector) 
are orthogonal. Nowadays, the Hilbert space and Banach space concepts become very 
important in numerical analysis of complex differential equations. However, these 
topics are considered abstract for most engineers and are out of the scope of the 
present book. 


5.5.1 Hermitian Matrix 


For real matrices, if A = A’, it is called symmetric. For matrices with complex 
elements, if 

A=A’™ (5.100) 
it is called self-adjoint. The bar over matrix A indicates that all elements in the matrix 
are replaced by their complex conjugates. Self-adjoint matrices are also known as 
Hermitian matrices, named after French mathematician Hermite. In particular, if a 
matrix is Hermitian, all eigenvalues of the matrix are real and all eigenvectors are 
independent. For Hermitian matrices with an eigenvalue of algebraic multiplicity m, it 
is possible to choose m mutually orthogonal eigenvectors. This is important in the 
sense that the solution form in terms of eigenvectors may depend on whether A is 
Hermitian. This will be discussed shortly in this chapter. 

There is also a close resemblance between the eigenvalue problem of the 
Hermitian matrix and the eigenfunction expansion of a homogeneous boundary value 
problem of ODEs. In fact, if an ODE is self-adjoint (note that this terminology is the 
same as the one used in (5.100) for matrix), the eigenvalues of the boundary value 
problem are also real and all corresponding eigenfunctions are orthogonal. 
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5.5.2 Eigenvalue Problem 


The eigenvalue of a matrix is of profound importance as its applications include the 
calculation of vibration frequency of mechanical systems or solids, of principal 
stresses in solids, and of dynamic stability of systems. The prefix eigen- is adopted 
from the German “eigen” for “self.” Matrix multiplication can be considered as a 
function evaluation or mapping. For example, if a matrix A is multiplied by a 
vector x and the result is another vector y: 

Ax=y (5.101) 


It is a linear function transformation between vector spaces. The eigenvalue 
problem is related to a transformation such that a vector x is mapped to itself 
except for a scalar multiple 4, which is called the eigenvalue of the matrix. That is, 

Ax =Ax (5.102) 
It turns out that it is, in general, not possible to find such a system except for some 
particular values of 2. The number of the eigenvalue depends on the dimension of 
the matrix A. Rearranging (5.102) we have 


Ax —Ax =(A-AI)x =0 (5.103) 
If vector x is not zero, the determinant of the matrix in the bracket must be zero 
det(A— AT) =0 (5.104) 


The solution of this algebraic equation gives the required so-called eigenvalue A of 
the matrix A. The corresponding vector x is called the eigenvector. 


Example 5.8 Find the eigenvalue and vector of the following matrix: 
3 -l 
A= 5.105 
Cs (5.108) 
Solution: The eigenvalue of this matrix A can be evaluated as: 


de(a~at)= a (3 Bee Dre ] 
4 -2 0 1 4 -2-A (5.106) 


=A? -4-2=(A-2)(A41) 
It is straightforward to see that a 2x2 matrix leads to an algebraic equation of 
second order. Thus, we have two eigenvalues as 
A=2,A=-1 (5.107) 


For A = 2, applying (5.103) we have 


3-2 =T)\f 4 | _ (9 5.108 
4 -2-2)}\x,} (Oo One) 


This matrix equation implies 


X, =X (5.109) 


Thus, the eigenvector for A = 2 is 


x -(2}-¢))-(1] (5.110) 
2 
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where c is an arbitrary constant. In the last part of (5.106), we have set c = | to get 
the eigenvector, but .other values of c can also be used. 
For A =-—1, the eigenvector can be calculated as 


3+1 —l)\f x, 0 4 -l)\( x, 0 
= o = (5.111) 
4 -24+1)\x, 0 4 -1)\ x, 0 
This implies that 
X, =4x, (5.112) 


x) (ae }a)-(4) (5.113) 
1 


Again, we have set c = | to get the last eigenvector. Clearly, the eigenvector is not 
unique. 


This gives 


5.5.3 Differentiation of Matrix 


More generally, the elements of a matrix can be a function of the variable: 


a(t) a(t) -* a, (t) 
A(t) = a0 a(t . Aan() (5.114) 
Am) (t) An? (t) Ann (t) 


Therefore, sometimes it is desirable to consider the calculus on matrices or vectors. 
Some essential formulas are reported here: 


dA (day) 5? b 
=) f Awe= | .(t)dt 5.115 
7 (ef (0) [fay (5.115) 
For a constant matrix C and a constant c, we have this differentiation rule 
d(CA 
d(CA) _ dA (5.116) 
dt dt 
d(cA 
(cA) _ aA (5.117) 
dt dt 


Similarly, we have the following distributive rules of differentiation: 
d(A+B) | dA dB 
dt dt dt 


d(AB) -(4)p | a( 2) (5.119) 


dt dt dt 


(5.118) 


5.6 HOMOGENEOUS SYSTEM WITH CONSTANT COEFFICIENTS 


A system of n linear first order ODEs can be expressed as: 
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X] = Ay, (C1) + Ay (OX +--- +4, (OX, +O) 
X= Ay, (t)Xy + Ay) (1)Xq +--- + Ay, (OX, + yO) (5.120) 
Xi, =A, (t)X, +4,9(t)X> +---+4,, (Ox, +f, (0) 


where fis the nonhomogeneous term. This system can be expressed in matrix form 
as 


A= AX+F or X’= AX+F (5.121) 
where 
x, (t) a(t) a(t) s* sa, (t) AO 
X(t) An\(t) y(t) st 1 a, (t) Sy) 
X= : , A= : : "e : , F= : (5.122) 
X, (t) Gi) aall) o* = -a,,@) Si 


If F = 0, the system is homogeneous. If matrix A # A(f), the system is called 
autonomous. The initial condition of this system can be given as 


x; (to) y 
X(t) ry 

X(t))= : =X = : (5.123) 
Xn (t) Th 


If A and F are continuous in an interval containing point f, the initial value 
problem has a unique solution in this interval. For a nxn matrix A and F = 0, there 
exists n independent solution vectors X; (i = 1,2,...,.1) and the general solution for 
(5.121) with F = 0 is given by 
X=c,Xy +cyXq +0008 +c¢,Xq (5.124) 

where ci, ..., Cn are arbitrary constants. The specific form of X; will be discussed 
later. Note that there are n unknown constants in the general solution. This set of 
independent solution vectors forms a fundamental set of solutions of the system. 

Similar to our discussion on ODEs, we can use the Wronskian to check their 
independence: 


X(t) 2(t) vr y(t) 
1 (t) Xn (¢) lane Xan (#) 


W (X1,Xq 000! X,)=det} : oP (5.125) 


a(t) ott) 2 - on) 


where 
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x11 (t) X(t) Xin (t) 
a (#) ‘oa (0) *2n (t) 


a (| Ee (t) ait 


(5.126) 


If these solutions are independent, the Wronskian must be nonzero, or W(X1, X>, 


veeey Xp) #0. 


Example 5.9 Check whether the following system 
dx 


“aoe 
dy 
—==5x+3 
ape 


has the following two independent solutions 
24 7 6t 
e 1 3e 3 
gett = 505 5 


Solution: First , we rewrite the system in matrix form as 


1 3 x 
X'=AX = X, X= 
[5 i H 


Differentiation of the first solution in (5.128) gives 


de! 
X= 
Qe! 
On the other hand, we have 


_f1 3] e% |_| 26" 
ie 5 3i|_.2 | | 9,28 


(5.127) 


(5.128) 


(5.129) 


(5.130) 


(5.131) 


Comparison of (5.130) and (5.131) shows that the first vector given in (5.128) is 


indeed a solution of the system. 


Similarly, the differentiation of the second vector in (5.128) is 


; a 
X= 


The right hand side of the ODE system is 


1 31/3e%] |18e% 
AX, = 6t | 6t 
5 3]]5e 30e 


(5.132) 


(5.133) 


This again shows the validity of the second solution vector in (5.128). To check the 


independence, we can form the Wronskian as 
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Ee 3e"||4 3 
et 56] |-1 5 


Therefore, the independence of these solutions is demonstrated. Thus, they do form 
the fundamental solution set. 


WX) Ca =8e" #0 (5.134) 


Now let us consider the special case of an ODE system with a constant coefficient 
and no nonhomogeneous term F in (5.121). We seek a solution of exponential 
form 


_ he wie; a 
kye™" k, 
X=| : |=| i le4 =Ke* (5.135) 
ke [An | 
Differentiation of (5.135) results in 
kAe™ 
kde“ 
X’=| 3 |=Kre* (5.136) 
| k, Ae” | 
Substitution of (5.135) and (5.136) into (5.121) gives 
Kie™ = AKe™ (5.137) 


Since the solution should be true for all values of variable t, we can cancel the 
exponential function on both sides of (5.137). Putting all terms on the left, we 
obtain an eigenvalue problem as: 


(A-ADK =0 (5.138) 
Since K is not zero, we must require 
det(A — AI) =0 (5.139) 


Thus, solving a system of ODEs reduces to solving the matrix eigenvalue problem. 
The general solution form depends on what kind of roots we get for the eigenvalue. 
We will consider all scenarios one by one. 


5.6.1 Case 1: Distinct Eigenvalues 


If all eigenvalues are real and distinct, the general solution is 

X=¢K,e™ +¢,K,e" +, Kye +--+, Ke" (5.140) 
where vector K is the corresponding eigenvector for the eigenvalue i. This is the 
simplest scenario. 
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5.6.2 Case 2: Repeated Eigenvalues 


If one of the eigenvalue 1, has a multiplicity of m whilst all other eigenvalues are 
distinct and real, the solution can be expressed in two different forms depending on 
whether A is Hermitian. 


Case 2.1: If A is Hermitian, we would be able to find m independent eigenvectors 
for the same eigenvalue. The general solution is 

X=¢ Ke" +...4¢,, K€" + G1 Kerr’ He +0, Ke (5.141) 
where vectors Kj (i = 1,2,...,m) are the independent eigenvectors corresponding to 
repeated eigenvalue Am. 


Case 2.2: If A is not Hermitian, the general solution is 
X= CX yp + CoXqy $e + Gy Xam Cnet Kern teste, Kye (5.142) 


m aym + 


where solution vectors for the repeated eigenvalues are 


Xai = K,,¢" 
X,2 =K,,te*" +K,) e* 
(5.143) 
_ ti" 1 oli! pn at Jt 
Xam =K a1 (m—)! pie .. thao F reiieichaiks + Kam 
The vectors K are calculated by 
(A - AA) Kas =0 
A-AI)K,,=K 
(AAT) Kua = Kas (5.144) 


(A ~ Al) Kam = Ky mt 
If we have two or more repeated eigenvalues for the matrix A, we have to add more 
solutions for the repeated eigenvalues similar to that given in (5.142). 
Proof: The proof of this formula will be sketched here. For any p =1,2,...,m, we 
have the solution as 


p-2 
Xap = Ka opi = eM + Kya oe t vieess +K,,, e*" (5.145) 
Differentiation of (5.145) gives 


a je) 3 fe ks 
Xap = 4%aKar oom” “+(Kyy TAK IEP j 


K AK The 
+( ar ‘a Gaal © 


1 
treet (Kapa t AKaps) ye" +(Kyp1t4Kyp)e™" 
(5.146) 
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Substitution of (5.145) and (5.146) into the ODE system gives 
AX, ~- Rs 
p-2 (5.147) 
4! 4 [(A-A,DK,, -K,,] —— ce +...=0 
"AG Coy +[( ) alta © ale 
The function of ¢ is in general nonzero, and therefore, their coefficients must be 
Zero: 


=(A-A, DK 


(A-/,1)K,) =0 


a 


A-ZI)K,, =K 
( al) a,2 a,l (5.148) 


(A- Al) Kam =K 
which is (5.144). This completes the proof. 


a,m-1 


5.6.3 Case 3: Complex Eigenvalues 


If there is a pair of complex eigenvalues, the two complex eigenvalues must be in 
conjugate form. Let us assume they are 


A,=atiB, 2,=a-iff (5.149) 
And their corresponding eigenvalues are 
K, =B,+iB,, K>=B,-iB, (5.150) 
The general solution is 
X=¢X, +c)X, +¢,K3,e™ +...+¢,K,e" (5.151) 
where 


X, =[B, cos Zt-B, sin Bt] e*’, 
X, =[B, cos ft +B, sin Br]e*’ 


If there is more than one pair of complex conjugate eigenvalues, we can add 
another pair of eigenvectors accordingly. 


(5.152) 


Proof: Let a pair of complex eigenvalues of the real matrix A be A; = ati and A2 = 
aif, and further assume that the eigenvector for Aj is: 


K, = B,+iB, (5.153) 
By definition, we must have K; satisfy 

AK, =/,K, (5.154) 
Taking the complex conjugate of (5.154), we obtain 

AK, =A, K, (5.155) 
However, A is real and the complex conjugate of A; is A2 and (5.155) becomes 

AK, =4,K, (5.156) 
By definition again, we must have the eigenvector of Az satisfy 

AK, =2,K, (5.157) 


Comparison of (5.156) and (5.157) shows that 
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K, = K, =B,-iB, (5.158) 
Thus, the general solution due to the complex conjugate pair is: 
X =c,(B, +iB,)e'*'™ + ¢, (By -iB,)e 
=c,e"'(B, +iB,)(cos Bt + isin Bt) +c,e~' (B, —iB,)(cos Bt—-isin Br) (5.159) 
=(c,+¢,)e™ (B, cos Bt -B, sin Bt) + i(c, —c, )e“' (B, sin Bt + B, cos ft) 
Since we start with a real ODE system, we should expect the solution to be real. 


This can be done easily by choosing cs as a complex conjugate of ca. Thus, we can 
express the solution as 


X =c,e“'(B, cos ft —B, sin Bt) + c,e“' (B, sin Br + B, cos Br) (5.160) 
where c, and c2 are real constants. This completes the proof of (1.151). 


Example 5.10 Find the general solution of the following system of ODEs 


= =2x+3y 
(5.161) 
a =2x+ 
dt - 
Solution: The matrix form of the system is 
, 2 3 x 
X'= AX = xX, X= (5.162) 
2. y 


The eigenvalue of the system is 

2-A 3 
2 I1-a 

The eigenvalues are 4 = —1 and 4. We have two distinct real roots for the 


eigenvalues, and this corresponds to case | discussed in Section 5.6.1. For A =—1, 
the eigenvector can be calculated as 


det(A — AI) = 


=2 —3A-4=(A+1(A-4)=0 (5.163) 


3 3k, 
(A-ADK, = =0 (5.164) 
2 2\/k, 
Both of these equations lead to 
kK, +k, =0 (5.165) 
By choosing k; = 1, we have kz = —1 and the eigenvector becomes 
1 
K, -| ’ (5.166) 


Note that (5.166) is clearly not the only choice. In fact, any combination of k; and 
ky that satisfies (5.165) will give an appropriate eigenvector. The corresponding 
solution vector is 


1 
xX, =K,2" -| ie (5.167) 


For A = 4, the eigenvector can be determined from the following system 
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(A-ADK, = me SI ag (5.168) 
PD: Sell eel = 
This provides a relation between the two components of the eigenvector: 
2k, —3k, =0 (5.169) 
Choosing k; = 3, we have kz = 2 and obtain the eigenvector as 
3 
Kk, -| (5.170) 
2 
The solution vector is 
3 
X, =K,e" = le (5.171) 
Finally, adding these solutions with two unknown constants gives 
1 3 
X=c,X,+c,X, =¢, j e+e, Hl ef (5.172) 


In a later section, we will discuss the behavior of this solution as t > ©. 


Example 5.11 Find the general solution of the following system of ODEs 


a =-4x+y+z 
dy 
== = 5.173 
apr +5y-z ( ) 
dz _ = 
dt nse 
Solution: Putting the system in matrix form yields 
4 1 1 x 
X'=AX =) 1 5 -1/X, X=|y (5.174) 
0 1 -3 Z 
The eigenvalue of the system can be determined from 
det (A-AI) =-(2+3)(A+4)(A—5) =0 (5.175) 


The eigenvalues are 1 = —3, —4, and 5. We have three distinct real roots for the 
eigenvalues, and this again corresponds to case | of Section 5.6.1. For A = —3, the 
eigenvector can be calculated as 
-1 1 1]/4 0 
(A-ADK, =| 1 8 -1)/ ‘A, |=/0 (5.176) 
0 1 O|fk, 0 
The third row of the system gives kz = 0. The first row gives 
-k, +k, =0 (5.177) 


Therefore, we can set the eigenvector as 
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1 
K, =| 0 (5.178) 
1 
For A = —4, the eigenvector can be calculated as 
01 144, 0 
(A-ADK, =|1 9 -1l|k, |=] 0 (5.179) 
0 1 Lik, 0 
The first and third rows of the system give 
k, +k, =0 (5.180) 
The second gives 
k, + 9k, —kz =0 (5.181) 


Setting 43 = 1, we have kz = —1. Substituting these values into (5.181) gives k; = 10 
and the eigenvector becomes 


10 
Kk, =|-l (5.182) 
1 
For A = 5, the eigenvector can be calculated as 
9 1 lik 0 
(A-ADK, =) 1 0 -1|| 4, |=/0 (5.183) 
0 1 -8]| k; 0 
The second row of this system gives 
k,-k, =0 (5.184) 
The third row of this system gives 
k, —8k, =0 (5.185) 
Taking k; = k3 = | into (5.185) gives kz = 8, and thus yields 
1 
K, =| 8 (5.186) 
1 
Combining all these eigenvectors, we finally obtain the general solution as 
1 10 1 
X =| 0|e* +e,| lle“ +.0,| 8 |e* (5.187) 
1 1 1 


Example 5.12 Find the general solution of the following system of ODEs 
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1 —2 2 
M =AV SS 1 Se (5.188) 
2 —2 1 
Solution: Solving the eigenvalue problem gives 
det (A —AI) =-(A+1) (2-5) (5.189) 


The eigenvalues are A = —1, —1 and 5. Since A is Hermitian (or symmetric), this is 
Case 2.1 with a repeated eigenvalue. We should be able to obtain two independent 
eigenvectors for the repeated root for A =—1. 
For A =-—1, the eigenvector can be calculated as 
k, 2 2 2 \|k 0 
(A-AD]} k, |=|-2 2 -2]|/k, |=] 0 (5.190) 
k, 2 2 2\||h 0 
This system implies 
k, -k, +k, =0 (5.191) 
There are three unknowns in (5.191) and two different solutions can be found from 
it. In particular, for the first solution, we set k; = 0 and kz = 1, then (5.191) gives k3 


= |. Alternatively, we set 4; = 1 and kp = 0, then (5.191) gives k3 = —1. Thus, we 
have 


0 
K,=|1|, K,=] 0 (5.192) 
1 -1 
To check the independence of these two vectors, we require 
cK, +c) K, =0 (5.193) 
be satisfied only for c; = c2 = 0. Substitution of (5.192) into (5.193) gives 
0 1 0 
c} l}+e,} 0 |=] 0 (5.194) 
1 -1 0 


The first equation of it implies cz = 0, the second implies c; = 0, and the third 
requires cz = c;. Thus, our random choice of picking two eigenvectors by (5.191) 
does arrive at the independent eigenvectors for the same repeated eigenvalue. In 
fact, setting 4; = 0 and kz = | and vice versa to find the independent eigenvectors is 
one of the best choices. There are many different forms of these independent 
vectors K and the solutions may appear differently, depending on the choices for ky 
and kj. However, one can show that when initial conditions are imposed, all of 
them will lead to the same solution (e.g., see Problem 5.15). The forms obtained in 
(5.192) are among the most efficient forms in satisfying the initial conditions. 
For A = 5, the eigenvector can be calculated as 
k, 4 2 2)/k, 0 
(A-AD| k, |=|-2 -4 -2]|| 4, |=] 0 (5.195) 
k, 2 2 -4/\k 0 
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The first row adding —2 multiplying the second row gives 

k, +k, =0 (5.196) 
The first row adding 2 multiplying the third row arrives at the same equation. We 
can set k3 = 1 and ky = —1. The first row finally yields k; = 1 and, thus, the 
eigenvector becomes 


1 
K=|-1 (5.197) 
1 
The general solution becomes 
0 1 1 
X=¢|1le‘+c| 0 |e* +e,|-1|e™ (5.198) 
1 =] 1 


Example 5.13 Find the general solution of the following system of ODEs 


2 1 6 
X'=AX=|0 2 5/X (5.199) 
0 0 2 
Solution: Solving the eigenvalue problem gives 
det (A—AI) =(2-Ay (5.200) 


The eigenvalues are A = 2, 2, and 2. Since A is non-Hermitian (or unsymmetrical), 
this is Case 2.2 with an eigenvalue with multiplicity of m being 3. Let us illustrate 
how to find the eigenvectors. 

With A = 2, the matrix equation becomes 


k, 0 1 6|k 0 
(A-2D|k,}=|0 0 Sj{k, |=|0 (5.201) 
k, 0 0 O|/k, 0 
This system implies 
5k, =0, k,+6k, =0 (5.202) 
This yields both kz = k3 = 0, and subsequently we have 
1 
K,, =|0 (5.203) 
0 
Recall from the second equation of (5.144) that 
(A-2DK,, =K,, (5.204) 


Substitution of (5.203) into (5.204) results in 
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0 1 6||k, 1 
0 0 5S|/k,]=/0 (5.205) 
0 0 Olk, 0 
This system gives 
5k, =0, k,+6k,=1, k, =0 (5.206) 
The corresponding vector is 
0 
K,, =| 1 (5.207) 
0 
Similar to (5.204) we can continue the calculation using 
(A—2DK,,; =K,) (5.208) 
Using (5.207) in (5.208), we arrive at 
0 1 6\| kh 0 
0 0 SIA, J=}1 (5.209) 
0 0 Ollk, 0 
This is equivalent to 
5k, =1, k,+6k, =0 (5.210) 
With these constraints, we can choose 
0 
K, 3 =| -6/5 (5.211) 
1/5 


Recall (5.142) that the general solution can be se alia as: 


X=¢K, ye" +¢,(K, le” 14K e")+6,(K,, & Fo +K, ste" +K, 3e7) (5.212) 


2 
The general solution is therefore: 


1 1 0 1] , 0 0 
X=c,| 0 fe” +c, 4] 0 [te +] 1 fe b+c,4| 0 met 1 |te7’ +| -6/5 |e”! 
0 0 0 0 0 1/5 


(5.213) 
Note that the matrix A is unsymmetrical and thus we could not find 3 linearly 
independent eigenvectors. 


Example 5.14 Find the general solution of the following system of ODEs 


2 4 
x aaxe|* A: (5.214) 


Solution: Solving the eigenvalue problem gives 
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2-A 8 > 
det(A — AD) = =A°+4=0 (5.215) 
-l1 -2-A 
The eigenvalues are A = 2i, and —2i. For 4 = 2i, the eigenvector can be evaluated 
as: 
2-2i 8 k, 0 
(A-ADK = = (5.216) 
-1 -2-2i||k, 0 
The second equation of (5.216) requires 
—-k, -2(1+i)k, =0 (5.217) 
Let ky = —1; we have k, = 2+2i. Thus, the corresponding eigenvector is 
2+2i 
K, -| ; (5.218) 


Actually, it is straightforward to show that (5.217) is equivalent to the first equation 
of (5.216). To see this, we can multiply (5.217) by —2 (1—i) to get 
2(1-1)k, +4(1-DU+ Ak, = (2-2i)k, + 8k, =0 (5.219) 
Thus, we arrive at the first equation of (5.216). Recall that the eigenvector is 
defined as: 
K, = B, +iB, (5.220) 
Comparison of (5.218) and (5.220) gives 


: -l > 0 ‘ 


X, =[B, cos St—B, sin Btle", X, =[B, cos Bt +B, sin Br]e*’ (5.222) 
Finally, the general solution is obtained by combining (5.221) and (5.222) 


2 2]. 2 Zell; 
X=¢ {2 cos 2t — Hl sin 2} +, {¢ cos 2t + *] sin 2} (5.223) 


5.7 NONHOMOGENEOUS SYSTEM WITH CONSTANT COEFFICIENT 


So far, we have only considered the special case of the homogeneous ODE system. 
If F is nonzero, the general solution comprises two parts, just as the case of the n- 
th order ODE: 
XXX, 
= c,Xy +C)Xq te +c,X, +X 


n*n P 


(5.224) 


where the complementary solution or homogeneous solution is denoted by X, and 
the particular solution for the nonhomogeneous equation can be denoted by X;. 
Written explicitly, the complementary solution can be expressed as 
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x1) X49 (t) x, (t) 
v5 malt ) m = (t) Bice : 2 (t) 
Xnl (t) Xn2 (t) ae (t) 


(5.225) 
CX, 1 (t) + Co X19 (1) +... + 6,41, (0) 


_ Je (t) + €pX9 (1) +...4+.6,.%9, (1) 
Xn] (t) + CoX 42 (¢) +t CnXnn (t) 


Before we continue to consider the particular solution, we note that the 
complementary solution can be written in terms a fundamental matrix defined as: 


mit) X20) XC) 
(1) = “alo *al) - an) (5.226) 
Xnl (t) Xn2 (t) Xan (t) 


which is formed by combining the complementary solutions. The first column of ® 
is formed by the first vector solution and so on. More specifically, we can see that 


XQ) AO - MH, |) q 
Xx, =@(1)C= mall a ~ an) C2 
Xnl (¢) Xn2 (t) sth Xan (t) Ch 


(5.227) 
C11 (2) + CoX9 (Ot) +... +O, X1y (0) 


C1Xq1 (1) + Co Xp9 (1) +... Fy Xp (D) 


CXn1 (¢) Bg CoXn2 (¢) +...+ CnXnn (¢) 
which is clearly equivalent to (5.225). 


5.7.1 Undetermined Coefficients 


The method of undetermined coefficients establishes a number of rules of how to 
assume the particular solution. Table 5.1 summarizes some simple rules for 
assuming a proper particular solution for various forms of nonhomogeneous terms. 
These terms involve a constant vector, polynomials of variables with a constant 
vector, an exponential function with a constant vector, a sine or cosine function 
with a constant vector, or finite terms or products of these functions. For other 
functions, there are no rules for guessing particular solutions, and we have to use 
the method of variation of parameters, which will be discussed in the next section. 
Note that the last four rows in Table 5.1 are for the cases of nonhomogeneous terms 
matching with the homogeneous solution. 
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Table 5.1. Table for the method of undetermined coefficients 


root F(t) X, (0) 
C A 
Ct” At" +..t At+ Ap 
C cos bt, C sin bt Acosbt +B sin bt 
cre eo (A,t" +...4+ At t+ Ap) 
Ce cos bt, Ce“ sin bt e“ (Acosbt + Bsin bt) 
Ct" cos bt, Ct" sin bt cos bi(A,t" +...+ At+ Ap) 
+sinbt(B,t" +...+ Bit+ Bo) 
tid Ccos At, C sin At (A, +tA,) cos At + (B, +tB,) sin At 
A Ce™ (At + Bye 
A Cte e"[(A, +B, t)t" +...+(A, + Bit + (Ay + Bot)] 
atiB | Ce* cos Bt, Ce™ sin Bt e“[(A, + A,t) cos Bt +(B, + Bot)sin Bt] 


To illustrate this method, some examples are considered. 


Example 5.15 Find the general solution of the following system of ODEs 


B= 5x43y—2e* +1 
: (5.228) 
pe —5t+7 


Solution: First, we need to solve for the complimentary solution of the 
homogeneous system: 


X/ = AX, (5.229) 
where 
A -| ; 1 (5.230) 
-1 1 
The eigenvalues of the system are easily determined as 
5-A 3 > 
det (A — AI) = 4 =A°-6A+8=(A-2)A-4)=0 = (5.231) 
Without going into detail, it is straightforward to obtain the following 
complimentary solution: 
X,=¢ [Je +, Je" (5.232) 


The nonhomogeneous term can be written as: 


FO oer 


(5.233) 


+ 
| 
aA & 
—— | 
~ 
+ 
a | 
= ob 
ss | 
S) 
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Clearly, there is no match of the homogeneous solution and according to the 
guidelines given in Table 5.1, we can assume the particular solution as 


+att+a,e" 
X, (1) |e fds | een ee ee (5.234) 
bh | |b b; b, +bot+b,e" 
Substitution of this particular solution into the original system gives 
|* —a,e" | 7 * +3, + (Sa +3b,)t+ (Sa; +3b;)e~" | . —2e‘ +1 | (5.235) 


b, —b,e" —a, +b, + (-ay +b, t+ (az +b, )e~ e'—5t+7 
By matching the coefficients on both sides of the first equation of (5.235) gives 


—5Sa,+a,—3b,=1, Sa,+3b,=0, 6a, +3b; =2 (5.236) 
The second equation of (5.235) leads to 
a,—-b,+b, =7, -a,+b,=5, a,—2b; =1 (5.237) 


The solution for this system of equations gives 
a =35, b=, a =-B, bh =B, a =5, b=-4 6.238) 


32’ 32’ 8 8° 15’ 
Finally, the solution of the system becomes 
; A 35 15 7 
Me.) let ee;| lee Oo be ea e038) 
-1 -1 89 25 4 
32 8 IS 


Example 5.16 Find the general solution of the following system of ODEs 


1 1 -8 
X'=AX+F, A= , F= (5.240) 
1 1 3 
Solution: The eigenvalues of the system are easily determined as 
1-A 1 7 
det (A—AI) = iA =(1-4)* -1=A-2)=0 (5.241) 
The eigenvalues are A = 0, 2 and the corresponding homogeneous solution is 
1 1 
X,=¢ i +5 Hl ev (5.242) 


The nonhomogeneous vector F is a constant vector, which matches the solution 
corresponding to 4 = 0 (see the first vector on the right hand side of (5.242)). 
Therefore, we seek the particular solution in the following form 


aq a 
X, (t) = + t 5.243 
O-Ls Le 620 
Substitution of (5.243) into (5.240) gives 
a |_[Hth | Ja +e) 8 (5.244) 
by a, +b, a, +b, 3 


This equation leads to 
ay =a, +b, —-8, b, =a,+b,+3, a,+b, =0 (5.245) 


Systems of First Order Differential Equations 353 


Subtraction of the second equation from the first equation gives 


a, —b, =-11 (5.246) 
Solving this and the last equation of (5.245) simultaneously, we obtain 
a, =—-U (5.247) 
2 
by = u (5.248) 
Back substitution of (5.247) and (5.248) into (5.245) gives 
a, +b, =3 (5.249) 
Choosing a; = 0 (this is chosen for the sake of simplicity), we have 
i 2 (5.250) 
In summary, we have the following constants 
5) 11 11 
a, = 0, Ds a eT by = 5 (5.251) 
Finally, the solution of the system becomes 
iat 
l He J bal 2 
X=¢, ; +€, i ew +] 5 |+ fi t (5.252) 


Note again that this solution form is not unique, depending on what we assume for 
a. However, there is a unique solution if initial conditions are prescribed. 


If the given nonhomogeneous terms do not match those given in Table 5.1, we have 
no choice but to use the method of variation of parameters, which will be discussed 
next. 


5.7.2 Variation of Parameters 


The general solution has two parts, a complimentary solution with a constant vector 
C and a particular solution in terms of integration: 


X= O(1)C+ (0) [O" ()F(0)at (5.253) 
where @is the fundamental matrix defined in (5.226) and reported here again as 
1) M2) so 4, C 
®(t)= *ai(t) a) ” “ant he ©2 (5.254) 
Xnl (¢) Xn2 (¢) Xn (t) Ch 
For the case that initial values are given 
X(t) = Xp, (5.255) 


the general solution becomes 
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X(t)= (1) (1) )Xp + (2) ik @' (c)F(r)dz (5.256) 


To prove these results, we first recall from (5.227) that the homogeneous 
solution can be written as 
X,(t)=@(t)C (5.257) 
This gives the first term in (5.253). The method of variation of parameters assumes 
that the particular solution of the system can be written as 


X,, (t) = (2) U(¢) (5.258) 
where U is an unknown vector function. Differentiating (5.258) gives 
X’, = @'(¢)U(t)+ B(¢)U(z) (5.259) 
Substitution of (5.259) into (5.121) leads to 
@'(t)U(t)+@(t)U'(¢) = A®(t) U(t) + F(t) (5.260) 
Since the fundamental matrix ® is formed by the homogeneous solution, we have 
®'(t) = A®(r) (5.261) 
In view of (5.261), (5.260) becomes 
®(t)U'(t) = F(t) (5.262) 
Multiplying both sides by the inverse of the fundamental matrix, we find 
U'(t)=@' (¢) F(t) (5.263) 


Thus, U can be found by integrating both sides with respect to ¢ and this result leads 
to 


X,, (1) = (0) U(e) = (0) [O" (e) Fe) (5.264) 


Summation of (5.257) and (5.264) gives the final required result. 
For initial value problems with given condition 


the integration of the general solution can be revised as 
t 
X(1)=@(1)C+@(0)[ O(c) F(r)dz (5.266) 
lo 
Substitution of the initial condition yields 
Multiplying the inverse of the fundamental matrix gives 
C=@'(t)) Xp (5.268) 
Back substitution of this constant vector into the solution finally gives 
t 
X(1)= @(1) ® (i )Xp + O(c) f O'(e) P(e) dz (5.269) 
i) 


This completes the proof of (5.256). 


Example 5.17 Find the general solution of the following system of ODEs by 
variation of parameters 
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| > 4} -[a} x-[ 
X'=AX+F, A= , F=|_ |, x)= (5.270) 
2 e! 0 


4 
Solution: The eigenvalues of the system are easily determined as 
-3-A 1 
det (A —AI) = 5 j 1 =A? +7A410 =(A4+5)(A+2)=0 (5.271) 
The eigenvalues are A = —2, —5 and the corresponding homogeneous solution is 
1 1 
X.() =< le +0, lee (5.272) 
The fundamental matrix can then be formulated as 
-2t —5t 
@(r)=|" © (5.273) 
ett 26 
The inverse of the fundamental matrix is 
1 yet pit — fe" 
©" (t)=—__ = (5.274) 
det(@(t))| ee 1 ot al 
3 3 
This inverse can be used to evaluate 
Se fen 3t 2te*" +3! 
@' (1) F(t)= = (5.275) 
1 pst _1 pst e! te aly 
3 3 3 
Finally, the particular solution is obtained as 
= 
X, (1) = (0) [1 ()F()ar 
= a, lo 6, 27,1, 
_ E os | (210% +4e')at [3-354 (5.276) 
—2t —5t 
e —2e te! —Let*)\y SpoZle hy 
J Joet Zeya | | 3-456 
The final solution becomes 
6, _27,1,- 
= —ft Pee ay — 
= _ 1] 5 1 | 5, 5 50 z 4 
X(t)=X, (4)+X, (th=q|, |e +e, e+ (5.277) 
- 1| —2 J¢221. 1 
5 50 2 
Finally, we apply the initial condition to (5.277) to get 
1 1 1 274.1 1 1 a 
=e | ees! ee Se ae, | lal. 2. aR 
0 1 2) |.214.1 1 —2 oe, 
L 50 2 25 
The solution for the constants is 
121 137 
c= » C= 5.279 
200’ 7 200 a 


The final solution becomes 
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1 1 6,27 4 I yt 
KGa lee) letale. 203 (5.280) 
200[1|° * 200| -2 55 Ot gD 
50 2 


Note that there is no unknown constant in this solution because the initial 
conditions have been satisfied. 


5.8 SYSTEM OF NONLINEAR ODE 


So far, we have assumed that the system of first order ODEs is linear. Now we can 
extend our consideration to the more general case of a nonlinear system: 

dix dy 

®—Axy), S= foxy) (5.281) 
where fi and f2 are generally a nonlinear function of the unknown functions x and y. 
It is straightforward to extend the system to equations with order higher than 
second. For equilibrium solutions not changing with time, we must have 


dx dy 
aX_g “S 5.282 
dt ° dt : ( ) 
Thus, the equilibrium solution can be found by solving: 
Ai(tXe.¥e)=90, foltx.,¥.) =0 (5.283) 


Note, however, that (5.283) may not be easily solved to get the closed form 
equilibrium solution, depending on the given functions f; and /2. 


5.9 STABILITY OF AUTONOMOUS SYSTEM 


The stability of an autonomous system was considered mainly by Poincare in 1881 
and Lyapunov in 1892. If fi and f2 in (5.283) are independent of ¢, the coupled 
system is called autonomous because its functions on the right hand side of (5.281) 
are independent of time (or are not affected by time variation and thus lead to the 
term autonomous). We can first rewrite the system of ODEs about the equilibrium 
solutions as: 

§=xX—-%X; 17=V—Ve (5.284) 
The magnitude of these variables indicates how far the current solution is from the 
equilibrium solution. Thus, as ¢ > o if both € and n — 0, the solution is clearly 
stable. The system of ODEs in the perturbation solution (in the sense that it is 
perturbed from the equilibrium solution) is 


d d 
oR én, HR En) (5.285) 
The stability of the solution of this system around an equilibrium solution can be 
examined by investigating the eigenvalue of the linearized system of (5.285). First, 
we can linearize (5.285) by expanding the functions on the right of (5.285) in 
polynomials of & and n. Dropping all higher order terms other than linear, we have 
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dé dn 
a a6 +977, a y16 + Ago (5.286) 


where aj are constants. This is precisely the system of first order ODEs that we 
have discussed so far. Let us consider the eigenvalue of the system as: 


-A 
det(A — A) =|“! | 
Ay, Ay —A 
= 2 — (ayy +ayy)A + ayy — 44749, (5.287) 
=/—pl+q=0 
The eigenvalues are: 
+] p* -4 = 
Ayy = PNP TL pia (5.288) 


2 2 
As we have shown previously, the general solution of the perturbation around the 
equilibrium solution is 
X =K,e“" +K,e” (5.289) 

Let us now consider the possible long-term behavior of the perturbation solution. If 
A1 and A are real, the solution either increases indefinitely with time or decays with 
time, depending on whether the eigenvalue is positive or negative. If 11 and Az are a 
complex conjugate pair, the solution increases or decays depending totally on 
whether the real part of the eigenvalue is positive or negative. The imaginary part 
of the eigenvalue clearly will only lead to oscillating solutions of sine and cosine 
(recall Euler's formula). 

A number of scenarios are possible. The following terminology of the 
stability of ODEs was introduced by Henri Poincare in 1881: 


(1) If g > 0 and p < 0, the roots are either complex conjugates with negative real 
parts, or both real and negative. The solution is "asymptotically" stable, in the sense 
that the analysis of stability is valid if the domain being considered is so close to 
the equilibrium point that the effects of all nonlinear terms can be neglected. 
Therefore, such analysis is called linear stability analysis. If A = 0, the eigenvalue is 
repeated and both of them are negative. Such an equilibrium point is called a stable 
proper or stable improper node, depending on whether the equal eigenvalue system 
has two or one independent eigenvectors (recalling from the fact that A is either a 
Hermitian matrix or non-Hermitian). If A > 0, it is an asymptotically stable node. 
If A <0, the eigenvalues are complex conjugate pairs and it is an asymptotically 
stable spiral point. It is called spiral because of the oscillations accompanying with 
the decay. 


(11) If g > 0 and p > 0 with A > 0, both the roots are positive. The equilibrium point 
is called an unstable node. If A = 0, it is called an unstable proper or unstable 
improper node, depending on whether there are two or one independent 
eigenvectors (Hermitian or non-Hermitian A). If A <0, the equilibrium point is an 
unstable spiral point. 
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(iii) If g > 0 and p = 0, then the roots are purely imaginary. The solution will 
neither increase nor decay. In fact, the solution is a sine or cosine function and it is 
a purely oscillating function. The equilibrium point is called a stable centre. 


(iv) If g < 0, then the roots are real, and we must have A > 0. If p > 0, both 
eigenvalues are real with one of them positive. If p < 0, both eigenvalues are real 
with one positive and one negative root. In either case, the behavior of the 
equilibrium point is the same and is called an unstable saddle point. It is called a 
saddle point in the sense that if the initial condition is given such that a solution is 
precisely in the direction of the eigenvector of the negative root, the solution will 
decay to zero. If any initial condition leads to a slight deviation from the direction 
of this eigenvector of the negative eigenvalue, then the solution is unstable. The 
situation is similar to walking on the ridge leading to a saddle of a mountain. Any 
step slightly deviated from the ridge will lead to falling down the cliff (unstable) 
and you will never travel safely to the saddle (stable). 
These scenarios are summarized in Figure 5.2 in the p-q space. 


Unstable spiral 
Asymp. stable q point 
spiral point 


(©) stable tenter ©) 


A=p-4q=0 


Asymp. stable Unstable node 


node 


Figure 5.2 Stability classifications for 2-unknown-system 


Table 5.2 summarizes the type of stability for an equilibrium point for the case of 
two unknowns. This system is proposed by Poincare in 1881. A system of three 
equations and three unknowns can be classified using a similar idea, but the 
classification is more complicated and difficult to present in graphical form like 
Figure 5.2. Reyn (1964) presented a systematic classification of the three 
differential equation systems. It has been successfully applied to analyzing 
landslides by Chau (1995, 1999b) using bifurcation theory on creeping slopes 
obeying a two-state variable friction law. 
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Table 5.2. Stability classification for 2-D autonomous system 


q 2) A Stability type of the equilibrium point 

>0 <0 = Stable proper or improper node 
>0 Asymptotically stable node 
<0 Asymptotically stable spiral point 

>0 >0 = Stable proper or improper node 
>0 Unstable proper or improper node 
<0 Unstable spiral point 

>0 =0 <0 Stable center 

<0 >0 Unstable saddle point 


Example 5.18 Consider the stability of the following system 


a =2x+3y 
(5.290) 
2 =2x+ 
dt - 
Solution: This solution is given in Example 5.10 as 
1 3 
X=¢X, +¢,X, =a] le +0, ple (5.291) 


For this case, we have 

P=(@1, +47) =3, GF =A Ay) — Aya), =2-6=-4 (5.292) 
This is an unstable saddle point. The behavior of the solution is depicted in Figure 
3. 


cq <0, c,=0 c, <0, cy, >0 


c,>0, c,>0 


e <0, ec <0 Xx, 


4 
c,=0, c, <0 


Figure 5.3 Stability at the equilibrium point for Example 5.17 
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If the initial condition of the system is such that it locates along the direction of the 
first eigenvector (1, —1), the solution is stable. As shown in Figure 5.3, the solution 
returns to the origin. However, if the initial condition does not locate exactly along 
this direction, the solution is unstable. Figure 5.3 shows that as t > oo the solution 
goes to infinity along the direction of the second eigenvector of (3, 2). Note the 
exponential function for the first eigenvector is an exponentially decaying function 
whereas the exponential function for the second eigenvector is an exponentially 
increasing function. This is like a saddle at a mountain ridge. You will not fall 
down the cliff only if you walk along the edge of the saddle. Therefore, Poincare 
termed this an unstable saddle point. 


5.10 SUMMARY AND FURTHER READING 


This chapter discusses the solution technique for solving a system of first order 
ODE. It is important since all systems of ODEs, no matter the order and the number 
of coupled ODEs, can always be converted into a system of first order ODEs. For 
nonlinear systems, when we apply a numerical technique (such as the fourth order 
Runge-Kutta method discussed in Chapter 15) to solve the system of ODEs, it is 
always advisable to rewrite it as a system of first order ODE. 

First, we demonstrate the technique of elimination for solving a system of 
ODEs analytically. The concept of a multiplier system for elimination of unknowns 
in a system of equations is introduced. The determinacy of a system of equations is 
also discussed, and using this technique, we can identify whether the solution is 
possible for a system of higher order ODEs. For a linear system of first order 
ODEs, we review the Hermitian matrix (which has a profound effect on the solution 
form of the system) and the eigenvalue problem. It is shown that a system of first 
order ODEs with constant coefficients can always be converted to solving an 
eigenvalue problem of a matrix. Four scenarios of the solutions are discussed. They 
are real distinct eigenvalues, repeated eigenvalues for Hermitian and non-Hermitian 
matrices, and complex conjugate eigenvalues. All solution forms are proved 
analytically before examples are presented for each scenario. For a 
nonhomogeneous system of first order ODEs, we present both the undetermined 
coefficient method and the method of variation of parameters. Linear stability of an 
autonomous system is briefly discussed. 

For those readers who are interested in the historical development of matrix 
theory and its relation to solving differential equations, we highly recommend the 
review paper by Hawkins (1975a,b; 1977) on matrix theory. 


5.11 PROBLEMS 


Problem 5.1 Solve the following system of lst order ODEs 
oe sax (5.293) 
dt 


where 
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A= F S| X(0) = a (5.294) 
2 -9 |’ 1 , 
Ans: 
X= le = a ie (5.295) 
Problem 5.2 Solve the following system of 1st order ODEs 
EE 5x (5.296) 
dt 
where 
A= an 2 
7 ; (5.297) 
Ans: Note the solution is not unique if we do not normalize the eigenvectors 
1 | 3] 6 
X=¢ [le +, 2 (5.298) 
Problem 5.3 Solve the following system of Ist order ODEs 
aS AX (5.299) 
dt 
where 
a-[> A xo=[)| (5.300) 
—3 5 2 
Ans: 
: / 21 Hl 24 
X=|_ |e +3] Ite (5.301) 
2 1 
Problem 5.4 Solve the following system of Ist order ODEs 
EX AX (5.302) 
dt 
where 
al 
= ; ; (5.303) 
Ans: 


1 0}. 4 0 1]. 4 
X=c, i cos 2t — 5 sin 2t re +c) : cos 2+ i sin2tre (5.304) 


Problem 5.5 Solve the following system of 1st order ODEs 
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2 5 = 4x, -x, 
7 _ (5.305) 
3 al + 2 = 9y, Xo 
dt dt 
Ans: 
ree Cee 
X=c¢, A +Cy fe (5.306) 
Problem 5.6 Solve the following system of Ist order ODEs 
Ping (5.307) 
dt 
where 
gel 2 (5.308 
1 -2 oe) 
Ans: 
1 —3t 1 —t 
X=¢ i e+, ; e (5.309) 


Problem 5.7 Solve the following system of Ist order ODEs by variation of 
parameters 


a (5.310) 
dt 
where 
—2 1 me 
A= wears (5.311) 
1 -2 3t 
Ans: 


X=<¢, je +0, fle + a} aH te’ +) 45) (5.312) 


Problem 5.8 Solve the following system of 1st order ODEs 


OR AX (5.313) 
dt 


where 


gel MF sel 5.314 
hae ast Ol ies 


x=[ Jetsa] fe (5.315) 
1 4 


Ans: 
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Problem 5.9 Solve the following system of 1st order ODEs 


dX _ 
a 
where 
Aal , seoyel* 
“sf ¥-[9 
Ans: 


dX _ 
a. 
where 
nell He 
“[; 3} ¥0-[, 
Ans: 


Problem 5.11 Solve the following system of lst order ODEs 


aX _ 
oo 
where 
he 1 3 xo 1 
“Ea 5} *o-[ 
Ans: 


1 1 
X=] le +3] | te 
2 1 


Problem 5.12 Solve the following system of Ist order ODEs 


dx = 
dt 
where 
3 -18 2 
2 -9 1 
Ans: 


(5.316) 


(5.317) 


(5.318) 


(5.319) 


(5.320) 


(5.321) 


(5.322) 


(5.323) 


(5.324) 


(5.325) 


(5.326) 


(5.327) 
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Problem 5.13 Solve the following system of Ist order ODEs 
dX 


co“ = AX (5.328) 

dt 
where 

i X(0) = : (5.329) 
A QP Vs 1 
Ans: 
Hl a1 
X= ile (5.330) 


Problem 5.14 For the weather system on Earth, air current may flow parallel to a 
contour of equal pressure (called isobars). This happens when the Coriolis force 
(force due to Earth’s rotation) and pressure force balance each other. In geophysics, 
this is called geotrophic flow (see Figure 5.4). The following 2-D coupling system 
(a special form of Navier-Stokes equation) for velocity along North-South and 
East-West directions (v and w) can be used to model it: 


d°u 
oP cds 
Zz (5.331) 
2 
y= alu-u") 


where u and v are the flow speed in horizontal directions. The other parameters o, 
uu, and u* are the Coriolis constant, kinematic viscosity of air, and uniform flow 
velocity of u far from earth’s surface (all are assumed as constants). 


cP) 
Coriolis force 


Actual current 


Geostrophic current 
+<—____ 


Figure 5.4 Illustration of geotrophic flow 
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(i) Assume the following solution form: 
u=u*+Aexp(Az), v= Bexp(Az) (5.332) 
Find the characteristic equation of A and solve for i. 


(11) Redefine the variables as: 


x, =u-u*, 
= du/ dz, 
janine (5.333) 
X3=V, 
X4 = dv/dz 
(iii) Formulate the problem as a system of first order ODEs and find the matrix A 
aa 
' x 
X'’=AX, X= (5.334) 
x3 
X4 
(iv) Find the solution of the system. 
Ans: 
fo 1 0 0] 
0 0 -= 0 
Ae ‘“ (5.335) 
0 0 0 1 
“0 0 0 
LL J 
Ay =t Saw, Apg SE “a-1 (5.336) 
: 2u ; 2u 
X=¢X, +ce,X, +¢;X, +c4Xq (5.337) 


ee 
X, -|, cos |“ +-B, sin ere =e (5.338) 
2u 2h 
S14 
X, -|P, cos [—1+B, sin Ze (5.339) 
2u 2u 


ere (5.340) 


a 
ey ale 
x, =| Dace |“ 1—D, sin Sle (5.341) 
2u 2u 
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1 0 1 0 
Va/ (2h) Va/(2u) — Ja / (2p) —Ja/ (24) 
FS ay. eR cg | ge al 
Va/(2u) —Ja/ (21) — Ja / (2p) Va! (2) 
(5.342) 
Problem 5.15 Reconsider Example 5.16: 
1 1 -8 
X'=AX+F, A= , F= (5.343) 
1 1 3 
(i) By choosing a; = a in (5.249) (where a is an arbitrary constant), show that the 
solution is 
11 
1 tg 2 
X=c, #€5| |e" +) 5 + t (5.344) 
-1 1 <-a 11 
ss 2 


(11) Show that this result is indeed a particular solution. 


(iii) Now consider the following initial condition, find the solution using either 
(5.252) or (5.344): 


X(0) = Hl (5.345) 


(iv) Show that both solutions given in (5.252) and (5.344) give the same answer. 


3] 4% 11 
5 eed Pee ee a (5.346) 
al7lali|© 72] 11 


Ans: 


CHAPTER SIX 


First Order Partial Differential Equations 
(PDEs) 


6.1 INTRODUCTION 


Nearly all textbooks on partial differential equations will cover second order PDEs, 
like the wave equation, diffusion equation, and potential or Laplace equation. 
However, the discussion of first order PDEs is normally not covered in elementary 
textbooks on PDE. It is considered as a more advanced topic. The method of 
solution is quite different from that for second order and the general solution for 
first order PDEs will include an arbitrary function of some characteristics line. 

Although we discuss in this chapter the first order PDE before discussing the 
second order PDE, historically the second order PDE was investigated before the 
first order. Bernoulli, Euler, D’Alembert and others considered the physical 
problem related to dynamics of rigid bodies, vibrations of membranes, and wave 
phenomena. These are second order PDE. The solution of first order PDEs was 
considered mainly by Lagrange, Clairaut, D’Alembert, Monge, Charpit, Jacobi, and 
others. 

We will first discuss the geometric interpretation of the solution of linear first 
order PDEs. The Lagrange method for solving linear first order PDEs is then 
introduced. For the case of nonlinear first order PDEs, the Lagrange-Charpit 
method and Jacobi’s method will be discussed, and the geometric interpretation of 
the solution of nonlinear first order PDEs in terms of Monge cones will also be 
covered. In the process, the idea of characteristics equations, a term introduced by 
Cauchy in 1819, and curves will be introduced. The concept of characteristics will 
be used again for second order PDEs of hyperbolic types (or wave types) in 
Chapter 7. 


6.2 FIRST ORDER PDE 


A nonlinear first order PDE can symbolically be written as 


Pag 6 (6.1) 
xX 


The existence of the solution of a nonlinear PDE cannot be guaranteed. If the first 
order PDE is linear, we can express it explicitly as 


PCs E+ OG, )) = = RC) (6.2) 
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where z is the unknown and x and y are the variables. A general feature of the 
solution of a first order PDE is that it normally involves arbitrariness in the 
solution. For example, consider the following first order PDE 


2 a a =0 (6.3) 
Ox Oy 
Now introduce a change of variable that 
z=@(u), u=ax+y (6.4) 
where @ is an arbitrary function. Differentiating (6.4) gives 
oz, Ou , Oz 1. OU , 
7 pu) — ag(u), a a ag'(u) oe ag'(u) (6.5) 


Substitution of (6.5) into (6.3) shows that it is identically satisfied by (6.4). Thus, 
the general solution includes an arbitrary function @. In fact, we will see in later 
sections that u is called the characteristics of the PDE. 


6.3 HYPERBOLIC EQUATION 


Physically, the first order PDE appears naturally in the modeling of the wave 
phenomenon. In the next chapter, we will see that the governing equation for a 1-D 
wave can be expressed as: 


fag =6 (6.6) 


Following the idea of factorization that we discussed in Chapter 3 for ODEs, we 
can factorize (6.6) as: 
2 2 
a eer gE ee 
Ot Ox’ «6 Ot Ox Ot 
Since c is a constant, the factorization in (6.7) is commutative. Therefore, the 
solution is the sum of the solutions of the following two first order PDE: 


a a 428, (e ee (6.8) 
Ox 


+c 
Ot Ox Ot 
Referring to the discussion on (6.3) above, we can easily see that the solutions of 
(6.8) are 


Ju (6.7) 


u=@(x+ct), u=y(x-ct) (6.9) 
Thus, the solution of (6.6) can be formed by adding the solutions of (6.8) as: 
u=(x+ct)+y(x-ct) (6.10) 


The validity of this solution can be demonstrated by substituting (6.10) into (6.6). 
Let us consider the physical meaning of (6.8) more closely. For stationary waves, 
the wave undulation does not change with time or we have mathematically 
Ou _ 
Ot 
If a wave is not stationary, the simplest type of wave equation is formed by adding 
an additional term as: 


(6.11) 
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=0 (6.12) 


where c is the wave speed of the propagating disturbance. Note that (6.12) is the 
simplest type of transport equation and it is also known as the kinematic wave 
equation. We have seen that the solution of (6.12) appears as 


u=y(x-ct) (6.13) 
Let us take a Galilean transform such that 
u(t, x) = v(t, &) = v(t, x —ct) (6.14) 


Taking differentiation of (6.14) using the chain rule yields 
Ou _ ov | ov 0§ Ov Ov cu_ ov 


c—, (6.15) 
ot ot O€ oO ot O0& ox oO 
Substitution of (6.15) into (6.12) yields 
Ou cu _ OV _4 (6.16) 


Ot Ox Ot 
In other words, we must have v = wé). That is, if we travel along with the 
propagating wave at speed c, we see an unchanging shape of the waveform. 
Physically, the signals travel along the characteristics of € = x—ct. This forms the 
basis of the method of characteristics. Mathematically, all these wave equations are 
called hyperbolic, which will be explained in detail in Chapter 7. 


6.3.1. Transport with Decay 


For real waves, there must be damping. Mathematically, we can add an addition 
term as 


OBO ant (6.17) 
Ot Ox 
Applying the change of variables given in (6.14), we have 
a t+av=0 (6.18) 
Ot 
This is a separable PDE, and (6.18) can be integrated to get 
Inv =—at+C, (¢) (6.19) 
Thus, the solution is obtained as: 
wit.) = f(e™ (6.20) 


Wie | e_ | ce 


Figure 6.1 Decaying traveling waves of Ist order hyperbolic PDE 
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Expressed in terms of the original variables, we have a decay wave along the 
characteristics as 


u(t,x) = f(x—ct)e™ (6.21) 
This solution is illustrated in Figure 6.1. 


6.3.2. Non-Uniform Transport 


If the wave speed is not uniform, we can model the hyperbolic wave as 


Ou Ou 
cs +c(x) Ti 0 (6.22) 
Let us consider a function h such that 
h(t) = u(t, x(t) (6.23) 
Taking the total differential with respect to ¢ using the chain rule gives 
dh(t) Ou Ou dx 
aa (t,x(t)) + es (t, x(t)) 7 (6.24) 
The speed of the propagating wave is defined 
dx(t) 
Ht c(x(t)) (6.25) 
Substitution of (6.25) into (6.24) and in view of (6.22), we have 
BES =0 (6.26) 
dt 
Therefore, a solution of (6.22) is given by 
A(t) = const. (6.27) 
along a characteristics curve. We can also determine the characteristics as 
cg = (6.28) 
c(x) 
Integrating both sides gives 
Q(x) = (= =t+k (6.29) 
c(x) 
where k is a constant. The characteristics is thus 
&=a(x)-t=k (6.30) 
The final solution is then obtained as 
u(t, x) = v(a@(x)-f) (6.31) 


6.3.3. Nonlinear Transport and Shock Waves 


We have seen in the last section that if the wave speed is a function of position, we 
can express the solution in a characteristics curve. If the wave speed depends on the 
unknown magnitude wu, the first order PDE becomes nonlinear. The simplest case is 
c =u (Whitham, 1974): 
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Ou Ou 

Ot Ox 
This equation is called the Poisson-Riemann equation is found useful in modeling 
the shock wave phenomenon, including traffic flow and flood waves in rivers. For 
the case of traffic flow, u is the traffic density. Since the wave speed equals the 
wave magnitude, a larger wave travels faster than a smaller wave. Thus, a larger 
wave behind a smaller one will eventually catch up and pass the smaller wave. 
Following a similar procedure in the last section, we seek a solution 


=0 (6.32) 


A(t) =u(t, x(t)) (6.33) 
Taking the total differential with me to ¢ using the chain rule gives 
dh(t) Ou dx Ou yt! 
a t +hex t))— =—+u— =0 6.34 
it (t, x(t) + ui) rare (6.34) 
Thus, we again have h = constant ae a characteristics. 
dx 
c= 6.35 
a (6.35) 
We can integrate (6.35) as 
&=x-ut=k (6.36) 
Thus, the general solution of the Poisson-Riemann equation is 
u=f(¢)= f(x—ut) (6.37) 


We are going to see that this solution for shock waves will break down at a certain 
time. Let us consider that 


ou Oo 0€ Ou 
= =f: = f'(E\1-t— 6.38 
ae Foes (6) Ae ISN ay (6.38) 
Note that the derivative Ou/Cx appears on both sides of (6.38). Solving for it gives 
ou __ FG) (6.39) 
ox =14+¢f'(é) 
We can see that 
of >o0, as t>t=- l (6.40) 
Ox £5) 


If the initial data of the wave profile is such that /(é) < 0, we will have a breaking 
wave as shown in Figure 6.2. Physically, the wave breaks down (breaking wave) as u 
becomes a multi-valued solution of x for ¢ > ¢*. This is a major characteristics of a 
shock wave. 


Multi-valued solution 


f'()<0 CH: os, 
Ox 
t<t* x t=t* x t>t* x 


Figure 6.2 Evolution and breaking down of a shock wave 
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Actually, the Poisson-Riemann equation can be interpreted in terms of a generalized 
conservation law: 
| 
uf + uss =0 (6.41) 
ot ox 
where T = 7(t,u,x) is a conserved density and X(tu,x) is the flux through a control 
volume. We can see that if we take a particular case 


T =u, Kae (6.42) 


we have 


AN ca (6.43) 
ot ox 2 Ot Ox 
We recover the Poisson-Riemann equation. Thus, the shock dynamics can be 


interpreted as the mass conservation law in integral form: 
d p22 x x 
“{ Tax = f° _ 
dt 3x, 3 


—dx =-| © dx =-x|? (6.44) 
x Ot x) OX aL 


Graphically, we can interpret this conservation law as shown in Figure 6.3. 


u OO sos u , Equal area rule u 
ox 
- i oa 
b 
ee a 
t=t* # 


t>t* x 


Figure 6.3 Physical law of conservation of mass 


The value of u across the breaking wave can be estimated as 


a x<ct 
u(t, x) = (6.45) 
b x>ct 
where 
a+b 
c= 5 (6.46) 


This is called the Rankine-Hugoniot condition. 

We have seen from this section that a first order PDE relates directly to the 
applications of shock waves, which is important in traffic and flood wave 
modelling in engineering and mechanics. 


6.4 AIRY’S METHOD FOR HYPERBOLIC EQUATION 


In this section, a method presented in Airy (1873) for analyzing a nonhomogeneous 
hyperbolic equation will be discussed. In particular, we consider a first order PDE 
of the following form 
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2 ge =a(x,y) (6.47) 
ox Oy 
Airy proposed the following change of variables 
u=axt+y, v=ext+fv (6.48) 
in which e and fare unknown constants. If af'# e, we can invert (6.48) as 
eee ican Pema (6.49) 
af —e af —e 
Applying the chain rule we have 
Oz _ Oz Ou Oz Ov _ Oz | ot (6.50) 


+ + 

Ox Ou dx Ovox Gu ov 
Oz a I (6.51) 
Oy Cudy Ovey Gu ~ & 


Substitution of (6.50) and (6.51) into (6.47) gives 
OZ u—Vv —eu+av 
(e- af) = =a(x,y) =, 
Ov af-—e af-e 
Integrating (6.52) with respect to v we have 


1 u-Vv —eu+av 
z=g(u)+ } Pe (cea cL (6.53) 
e-af’y af-e af-e 
where u = ax+y, which as we have seen in Section 6.2 is the characteristics. When 
the integration is conducted for v, we can treat u as constant. 
Let us consider a more specific function a: 


) (6.52) 


a=xPy! (6.54) 
where p and q are constants. Substitution of (6.54) into (6.53) leads to 
1 
P q 
2=¢(u) aan! J fu—v)? (eu +av)'dv (6.55) 


We can apply integration by parts to (6.55) as 


1 ae 
FO ear ‘(ceu+avy! 


a 2 
as a [i v)?*!(-eu + av)t dv 


(p+I\(e—af) 


Repeat this process of integration by parts q times, and note the following result 
[ Giu—vyP*tdv =—f (fu-vy?"d(fu-v) 


(fu _ yeu (af = ann xP tat! 


ptqtl ptqtl 


(6.56) 


(6.57) 


we atrive at 
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z= ¢(u)+ I xPtlya + aq pt+2_q-l 
p+l (p+1)(p +2) 
2 = q —])--- 
a qq 1) geist a a qq 1) 2 xPt4d (6.58) 
(p+1)(p+2)(p +3) (p+1)--(p+q) 
+ a‘q!p! yPtatl 
(p+1+q)! 


More importantly, we find that the solution is independent of e and f 


6.5 GEOMETRIC INTERPRETATION OF LINEAR PDE 


The geometric interpretation was discovered by Monge in the 1770s and published 
much later in 1785. Geometrically, we first note that two different three- 
dimensional surfaces can be expressed in terms of two potential F and G as: 


Vesllag pas 
Ox ~~ Oy @ 


IF IF OF 
VF= 2 e.4 g et e. (6.60) 
Ox Oy: 
Figure 6.4 illustrates that VF is the normal of a spherical surface S; whereas VG is 
the normal to a conical surface S2. The vector E can be interpreted as the tangent of 
the intersecting curve J; 


(6.59) 


B=vFxvo=(- 0G OF OG. 
Oy Oz Oz Oy ~ 


0G OF OF 0G OF 0G OF 0G 


+ Jey +( Je. 
Ox Oz Ox Oz ~ Ox Oy Oy Ox 


The tangent of the intersecting curve is 
dé = dxe, + dye, +dze, (6.62) 


(6.61) 


Note that the coefficient of (6.61) can be expressed in terms of a Jacobian as 
OF,G) OF 0G OF 0G 


O(y,Z) Ov Gz Oc Oy 


Since E and dé are parallel, the components of these two vectors must be 
proportional: 


(6.63) 


dx dy dz 
= = 6.64 
E.G) FG) AFG) me 
O(y,z)  O(z,x) A(x, y) 
In a more compact form, we replace these Jacobians by P, Q, and R as 
a dy id (6.65) 


P(x,y.2) Oley.2) ROY.2) 
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We will discuss in more detail in the next section on solving these equations, which 
are also known as characteristics, and the final solution of its corresponding PDE. 
We will now consider the relation between (6.65) and the solution of the first 
order PDE. Using the definition in (6.66), we have the vector E 
E = Pe, +Qe, + Re, (6.66) 


These components are the coefficients of the first order PDE given in (6.2). Note 
that the vector E will be parallel to a curve J’ which is an intersection of two 
surfaces denoting of their normal vectors VG and VF. The direction of the three- 
dimensional curve is shown in Figure 6.1 as dé. Therefore, the solution of any first 
order PDE can be interpreted as an intersection curve of two 3-D surfaces. 

Clearly, (6.65) can be rearranged to give two characteristic equations 


dy _Q(x,y,z) dz _ R(x, y,z) (6.67) 
dx P(x,y,z)’ dx P(x,y,2) 


VF 


Figure 6.4 Geometric interpretation of the solution of 1st order PDE 


This equation actually provides the idea of Cauchy’s method of characteristics for 
wave equations that will be discussed in a later chapter. Solving these first order 
ODEs in (6.67), which are called characteristics equations, gives the following 
characteristic lines: 

Uy (X, YZ) =c (6.68) 

U(X, y,Z) = Cp (6.69) 
Equations (6.68) and (6.69) are the characteristics of the solution of the PDE. 
Physically, if the PDE reflects the wave phenomenon, these characteristics lines are 
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the propagating directions of waves; in the theory of light, these are rays. The 
solution of the original PDE will be discussed in a later section. Taking the total 
differential of (6.68) gives 


du, = Ot i pee (6.70) 
ox ov Oz 
Ifa first order PDE is given in the following form: 
P(x.y.2) +. O(x, 9.2) + RO) =, (6.71) 
Ox ov (or4 


we see that the following subsidiary equation (parallelism between E and dé) of 
(6.71) is 
dx _ dy _ dz dy 
POy.z) OCoy,z) ROy.z) 0 
This is, of course, the same as (6.65). We will see in Section 6.7 that (6.72) is actually 
the subsidiary equation for the characteristics of the first order PDE. 


(6.72) 


6.6 CAUCHY-KOVALEVSKAYA THEOREM 


In 1842, Cauchy published a series of papers concerning the existence of a PDE 
solution. For the case of first order PDE, consider the following PDE: 


Ou, 
Oz 
Assuming that the solution curve is known to pass through the following initial 
point 


=0 (6.73) 


pYisg ae 
Ox oy 


Xp =Xo(S), Vo =Vo(S), 29 = ZS) (6.74) 
with all xo, yo, and zo being analytic functions of a parameter s (1.e., differentiable 
for real variables). Cauchy showed that the solution of the PDE with the initial data 
exists: 

U(X, V,2,C),C.) = 0 (6.75) 
The two constants in (6.75) indicate the fact that the final solution depends on both 
of the characteristics lines given in (6.68) and (6.69). The existence theorem was 
also investigated independently and improved by Sophie Kovalevskaya, who was a 
student of Weierstrass, in 1874. Darboux also considered the same problem in 
1875. The work of Cauchy seems not to be known to Kovalevskaya and Darboux. 
In fact, in the paper by Kovalevskaya she cited both Briot and Bouquet but both of 
them refer to the work by Cauchy though without giving a precise citation of 
Cauchy. The proof by Cauchy and Kovalevskaya was later improved by Goursat in 
1898 by using analytic functions. However, in the literature, the existence theorem 
is normally referred to as the Cauchy-Kovalevskaya existence theorem. 
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6.7 LAGRANGE’S METHOD FOR LINEAR FIRST ORDER PDE 


The solution of first order PDEs was derived by Lagrange in 1772. Let us generalize 
the first order PDE to n-dimensional space (n variables). In particular, consider a 
first order PDE with n variables: 

Oz (ord (ord 


B—+P,—+..+P,—=R (6.76) 
Ox, OX Ox, 
Taking the total differential of z, we obtain: 
a ip hig ae, =dz (6.77) 
Ox, OX Ox, 


If (6.76) and (6.77) are equivalent, we must have the following subsidiary equation 
(parallel between E and dé discussed in Section 6.5) for characteristics 
dx, dx, dx, — dz 


a A (6.78) 
RP, PR 


n 


where A is a constant. The term characteristics was introduced by Cauchy. The 
solution of the characteristics given in (6.78) is also a solution to (6.76). This 
equivalence was known to Euler. 
The solutions of the characteristics defined in (6.78) can be expressed 
symbolically as 
Uy (X15 XQ 5-0-9 Xp_oZ) = A 


Uy (X15Xq5-+49X_sZ) = Ay (6.79) 


Uy, (X1 5 Xo 5--+9Xp_sZ) =A, 


According to Lagrange, the general solution of (6.76) is given as 

P(Uy ,U>,-..,U,) =0 (6.80) 
where ¢ is an arbitrary function. This result was obtained in 1785 by Lagrange. Let 
us consider the proof for the case of two independent variables here. The proof for 
more than two variables follows similarly. In particular, consider the differential 
equation and its subsidiary equation as 


=p _R, dy _ a, dz _, (681) 
x, 7 &y P PR 
The characteristics by solving the second of (6.81) are expressed symbolically as: 
Uy (%15%Xy,Z)=Cy, Uy (Xj, Xy,Z) = Cy (6.82) 
Taking the total differential of (6.82) gives 
0 Ou Ou 
du, =—! de, +§ “1 dx, + az =0 (6.83) 
Ox, Ox, Oz 
0 e) e) 
du, =—? dy, +2 de, +$ 2 de =0 (6.84) 
Ox, OX, Oz 


Using the results of Section 6.5, (6.83) and (6.84) correspond to two first order 
PDEs as 
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pol tp Sig (6.85) 
Ox OX, (6-4 
p22 yp M2 , 7% _¢ (6.86) 
Ox, OX IZ 
These two equations can be put in a matrix form as 
iy, Oe pout. 
Ox, Ox, ||P 
1 2|J41t__ J & (6.87) 
Bie i U2) Ole 
Ox, = OX Oz 


The solution of P; is 


R Ou, Ou, ge Ou, Ou, R O(uy,U>) 
Oz OX, Oz OX, O(Z, X>) 


B= = 6.88 
} Ou, Ou, Ou, OUuy O(uy,U>) mee) 
Ox, OX, OX, OX, O(x,X>7) 
Rearranging (6.88), we obtain 
PB —R 
= 6.89 
Ou, Cu, Ou, Ou, Ou, OU, Ou, OU, ee 
OZ OX, OZ OX, OX, OX, OXy ON 
Similarly, the solution of P2 leads to 
P, —-R 
- (6.90) 
O(Uy,Uz) OC, Up) 
O(m,Z) — O(%,X7) 
Characteristic equations relating R, P|, and P2 are 
P, -R R 
= = (6.91) 
O(uy, Uy) O(uy Uy) O(uy Uy) 
O(%, 2) O(%, Xz) O(Z, X2) 
We now return to the following characteristics 
Uy (X1,X2,Z) = Cy (6.92) 
uy (X1,X>,Z) =Co (6.93) 
Consider an arbitrary function containing wu and uz 
D(u,,u,)=0 (6.94) 
Taking differentiation of (6.94) with respect to x; and x2, we find 
OP Ou, OP Ou, ij, oD Ou; OP Cun _ 4 (6.95) 
Ou, Ox, Ou, OX, Ou, OX, OUy OX, 
Differentiate (6.92) and (6.93) with respect to x1 
i ig, Oi (6.96) 


éx, Ox, @ Ox, Ox, Oy o& a& 
Substitution of (6.96) into the first equation of (6.95) gives 
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2@ (Ay, Oy Oe) OD (By , Oty Oe 
Ou, Ox, OZ OX, OUy OXY OZ OX 
Similarly, the second equation of (6.95) can be rewritten as 
OD Ou, _ om Oz oP Ou, _ Oty OF 5 29 (6.98) 
Ou, OX, OZ OX, = OUn OX = OZ COX 
These two equations can be put in a matrix form as 
Ou, rea Oz Ou, Mn oz || 0@ 
Ox, Oz Ox, Ox, Oz Ox, || Ou, 0 
Ou, rea Oz = Ou , Me oz || 0@ 
Ox, OZ OX, OX, OZ OX, || Ou, 


For a nonzero solution for the derivatives of ®, we require the determinant of the 
matrix being zero: 


Ou, 7 Ou, Oz 


)=0 (6.97) 


(6.99) 


Ou, | OU, O Ou, , Oy Oz, Ou, om Oz 


=0 (6.100 
Ge, Oz Ox x Ox, Oz ox, Ox, Oz Ox, OX, Oo Oxy) 
Expanding the multiplication and simplifying the results gives 
Ou, OUn Ou, OU, de Oz Ou, Oz OU OUy 
Ox; OX, OX, Ox, OX, OZ OX, OZ OX, (6.101) 
r Oz OU, Ou, Ou, OU _0 
Ox, Oz Ox, Oz OX, 
Using the definition of the Jacobian, we can simplify (6.101) to 
O(uy U2) _ OZ O(uy U2) _ O(uy ,U>) = (6.102) 


Axx) Ox, O(Zx,) By A(%,2) 
Substitution of the results in (6.91) yields the following first order PDE 
pp 4h <6 (6.103) 
Ox, Ox, 
This is the PDE given in (6.81) and, thus, the solution of (6.103) is given by (6.94). 
The proof can be extended to the case of 1 variables to show the validity of (6.80). 


6.8 LAGRANGE MULTIPLIER 


In this section, we will solve the subsidiary equation given in (6.78) to get the 
characteristics. It is not always straightforward to solve these characteristics 
equations. Very often, the method of the Lagrange multiplier can be used to 
simplify the problem. In particular, for the case of n variables, we have the 
subsidiary equation as: 
dx, dx, dx, dz L My dx, + by dxy 7 ofl, + My AZ 
F P, ca R MA + bP, Pathe, + My R 


‘1 


(6.104) 
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The last part of (6.104) is the result of the method of the Lagrange multiplier and 
LU, [l2,.--, Hn+1 are the Lagrange multipliers and are arbitrary functions of the n 
variables. 
To prove the last part of (6.104), we first notice that dx; = AP; or 
Mx, = APR, bydx) = My AP),..., 


(6.105) 
Hy AX, ms HAP: My Z = MntAR 
Substitution of (6.105) into the last of (6.104) gives 
hy dx, a Mydx, + ssbb AXy + My yi = AP, + ALP, tiuct AUF, + Af sR 
MP + bP, Pang l, + Mik MA + [bP, + plaly + My R (6 106) 
= Am F, + [bP, Puch Phy + My R) =i 


MF F [bP, + vill + MyiR 
This completes the proof. The role of uw, by,..., Unt is like that of integrating 
factors. If we can find appropriate multipliers such that 


MA + LbP, +. F;, + My R =0 2 (6.107) 
from (6.106), it also implies 
Mydx, + thdx, +...4,dx,, + L,,,dz = 0 (6.108) 


Thus, the solution can in principle be found 
U(X, ,X7,--,X),),Z) =a (6.109) 


This method of the Lagrange multiplier will be illustrated in the following example. 


Example 6.1 Find the solution for the following first order PDE 


jg (6.110) 
ox oy 
Solution: The subsidiary equation of (6.110) is 
a&_ dW _& (6.111) 
y x 0 
The last part of (6.111) implies 
dz =0 (6.112) 
Integration gives 
uy =2 =a, (6.113) 
The first two parts of (6.111) give 
xdx + ydy =0 (6.114) 
Integration of (6.114) results in 
uy =x’ +y" =a, (6.115) 
By (6.80), the solution can be expressed as 
d(x? + y*,z)=0 (6.116) 


This solution can be rewritten as 
z= f(x’ +y’) (6.117) 


First Order PDEs 381 


As expected, the general solution involves an arbitrary function 


Example 6.2 Find the solution for the following first order PDE 
Ci oi 2S (6.118) 
Ox Oy 


Solution: The subsidiary equation of (6.118) is 
dx _ dy dz 


=- (6.119) 
v4y? Qxy e 
Using the Lagrange multiplier method given in (6.104), we have 
dx +d; dx — d. - 
2 “o 2 2 2 = = Gt y) a = (6.120) 
x +2xyty x°-2xyt+y Zz (x+y) (x-y) 
The last equation in (6.120) can be expressed as 
1 1 
d{—)=-d(—) (6.121) 
Zz x+y 
1 1 
d{—)=-d{——) (6.122) 
Zz x-y 
Integration gives the following characteristics 
u a : =q (6.123) 
Z x+y 
Uy ae : =a, (6.124) 
Z ox-y 
Therefore, the general solution can be expressed as 
fete eG (6.125) 


where ¢ is an arbitrary function. 


Example 6.3 Find the solution for the following first order PDE with boundary 
condition 


Oh Os gO 5. (6.126) 
Ox Oy OZ 
u=yz, x=1 (6.127) 


Solution: The subsidiary equation of (6.126) is 
dx dy _dz_ du 
1 1 2 O 
This gives characteristics equations as: 


(6.128) 
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dx=dy, dx= 5, du =0 (6.129) 
It is straightforward to see that the corresponding characteristics lines are: 
Uj =X-Y=Q, Uy=2X-Z=a4), U3 =U=A4, (6.130) 
The solution can be expressed in terms of these characteristics as 
$(uU,,U>,U3) =0 (6.131) 
Alternatively, we can express 3 in terms of the other characteristics as 
u=@(x—-—y,2x—-Z) (6.132) 
Substituting the boundary condition given in (6.127) 
u(x =1) = yz=@0-y,2-2) (6.133) 
Putting x = 1 into (6.130), we get 
l-y=q (6.134) 
2-Z=4a), (6.135) 
yZ= a, (6.136) 
We can solve for y and z in terms of constants a; and az as: 
y=1-a, (6.137) 
z=2-4, (6.138) 
u= yz =(l—a,)(2—ay) =a, (6.139) 


We can now substitute the value of a; and a2 from (6.130) into (6.131) to get the 
general solution 
u=(1—-x+y)(2—2x+z) (6.140) 


As a final check, when x = 1, we have u = yz from (6.140). 


6.9 PFAFFIAN EQUATIONS 


Recall from Chapter 3 that first order ODEs can be expressed in differential form as: 
Max + Ndy =0 (6.141) 


We note that first order PDEs can also be expressed in a similar fashion. This is called 
Pfaffian: 
Pdx + Qdy + Rdz =0 (6.142) 


which is named after Pfaff, who was the supervisor of the renowned mathematician 
Gauss. 


6.10 EAXCT EQUATIONS 


For the three-dimensional case, the first order PDE in Pfaffian form is 
Pdx + Ody + Rdz =0 (6.143) 
This PDE can be interpreted as a dot product of the following two vectors: 
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E=(P,Q,R), d& =(dx, dy, dz) (6.144) 
Therefore, (6.143) is equivalent to 
E-d&=0 (6.145) 
If the vector E can be expressed in terms of a potential function U 
E=VU, (6.146) 
the PDE given in (6.145) can be rewritten as: 
VU-d&=0 (6.147) 


Or equivalently, we can express it as 


eu ae aS dz=0 (6.148) 


ox Iz 
This is clearly a total differential of a function U given by 
U(x, y,Z)=c (6.149) 
where c is a constant. Thus, this is the solution the Pfaffian differential equation. 


Therefore, the existence of a potential U defined in (6.146) is the condition of the 
Pfaffian being exact. An equation of this type is first considered by Clairaut in 1739. 


6.11 INTEGRABILITY OF PFAFFIAN 


If the Pfaffian differential equation is not exact, we can assume the existence of an 
integrating factor sz such that 

UPdx + uOdy + uRdz =0 (6.150) 
Note that this idea is exactly the same as that discussed for first order ODE. In vector 
form, we require that (compare with (6.146)): 


HE =VU (6.151) 
The governing equations for this integrating factor are 
2 
OU _ OP) _ OuQ) (6.152) 
Oyox oy Ox 
2 
CU _ (MO) _ OMAR) eis 
Oydoz Oz oy 


OU _ OUR) _ (uP) 
OzOx Ox Oz 


This is an extension of the condition for an integrating factor for first order ODEs 
discussed in Chapter 3. Expanding (6.152) to (6.154) gives 


(6.154) 


OP OO, ou Ou I 
UG ae Q - as (6.155) 
0Q OR, Ou Ou 6.156 
L( es ey R By Q i (6.156) 
OR OP. ou Ou 
Mt ox az) . Oz . ox oD 
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We can add the results of (6.155) multiplied by R, (6.156) multiplied by P, and 
(6.157) multiplied by QO to yield 
OP Oo 0Q OR OR OP 
2) + P( g )+O( )=0 (6.158) 
Oy Ox Oz Oy Ox oO 
This condition, if expressed in terms of the vector FE, can be written as 
E-(VxE)=0 (6.159) 
This is the condition of the Pfaffian differential equation being integrable. This 
condition (6.159) was derived by Clairaut in 1740 and by D’Alembert in 1744. 
Once (6.158) is satisfied, we can solve the problem using one of the following 


procedures. Note that the following procedure can only apply for the case where 
the Pfaffian is integrable. 


R( 


Method 1: 
Because the Pfaffian is integrable, we must have a function U such that 
dU = Pdx + Ody + Rdz (6.160) 
This implies that 
pu28 gee" g-28 (6.161) 
Ox Oy Oz 
Integrating (6.160) we have 
U(x, ¥,z)=C (6.162) 
where C is a constant. Therefore, the solution of the Pfaffian is a one-parameter curve. 
Method 2: 
First, (6.143) can be rearranged as: 
P 
Peay Sands mc ESE § (6.163) 


Comparison of the terms in (6.163) gives 
Oz P 


— === F#(GYZ 6.164 
a OR 1(%5 ¥5Z) (6.164) 
Oz @) 
— =-==0,(%,),z 6.165 
By R O; (x,y, Z) ( ) 
Thus, we can integrate (6.164) with respect to x by holding y constant. We have 
z= 9(x,¥;C(y)) (6.166) 


where C(y) is an arbitrary function that we add in the process of integration. Back 
substitution of (6.166) into (6.165) gives an ODE for C(y). After its determination, 
(6.166) becomes the complete solution. Note that this procedure is similar to Section 
3.2.4 for integrable ODEs. 


Method 3: 


Step 1: Hold z= constant or set dz = 0; 
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Step 2: integrate with respect to x and y and add an arbitrary function of f(z); 

Step 3: differentiate the result obtained in Step 2 with respect to all variables; and 

Step 4: compare the result in Step 3 with the original PDE to identify f(z) in Step 
2. 


Note that these methods are somewhat similar. These procedures are illustrated in the 


following example. 


Example 6.4 Find the solution for the following PDE in Pfaffian form 


3 yzdx + 2xzdy + xydz = 0 (6.167) 
Solution: The functions P, Q, and R can be identified as 
P=3yz, Q=2xz, R=xy (6.168) 
Taking partial differentiation of these functions gives 
\P IR IR 
a =3z, og =2z, og 2x, e x, e =3y (6.169) 
ov Ox (ord ov ox 
Substitution of (6.169) into the integrability condition (6.158) gives 
oP oO 0Q OR OR OP 
RB) + (E-2 (S-S) 
Oy Ox ov Ox (6.170) 
= xy(3z —2z)+3yz(2x —x)+2xz(y—-3y) =0 
Thus, (6.167) is integrable. 
Method 1: 
Dividing through (6.167) by xyz, we get 
2 1 
Sea dy+—dz=0. (6.171) 
x y Zz 
Integrating (6.171), we arrive at 
3Inx+2Iny+Inz=c (6.172) 
Combining these terms, we have 
In(xry?z) =c (6.173) 
Taking the exponential function on both sides, we find 
xy’z=C (6.174) 
This is the solution of the Pfaffian given in (6.167) 
Method 2: 
It is straightforward to identify that 
OZ iP 32 
—=-—=P(x,y,z)=-— 6.175 
ax R 1 (x, 2) ( ) 
OZ O 
3220 (y2)==— 6.176 
dy R Q(x, y,Z) ( ) 
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Integrating (6.175), we obtain 
Inz+Inx* =C(y) 
Thus, we have 


3 
zx” =C(y) 
Substitution of (6.178) into (6.176) gives 
ac __2C 
dy y 
Integrating both sides, we find C as 
Cc 
Ciy)== 
Ma 


Finally, substitution of (6.180) into (6.178) gives the solution as 
xy'z=C 
As expected, this is the same as (6.174). 


Method 3: 
Step 1: We can set 


dz=0,or z=a, 
By doing so, (6.167) is reduced to 


3ydx + 2xdy =0 
Step 2: Integrate (6.183) as 
7h = 3% 
y x 


This can be integrated exactly as 
2Iny =-3Inx+In f(z) 
We have put the arbitrary function of z in a compact form: 


1 
y =f) 
x 
Step 3: We can now differentiate (6.186) as 
3x7 y"dx +2yx3dy = faz 


Step 4: Comparison of (6.187) with the original PDE (6.167) yields f 


cm 
ra ee ‘a 
f = 
Z Zz. 


In f =-Inz+Cy 
1 
f==c 
Zz 
Finally, we find 
x yz =C 
Again, all different procedures lead to the same answer. 


(6.177) 


(6.178) 


(6.179) 


(6.180) 


(6.181) 


(6.182) 


(6.183) 


(6.184) 


(6.185) 


(6.186) 


(6.187) 


(6.188) 
(6.189) 


(6.190) 


(6.191) 
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Example 6.5 A complex function is defined by two functions u(x,y) and v(x,y) as: 
w=u(x, y)+iv(x, y) = f(z) (6.192) 
where fis an arbitrary function and 
Z=xXt+iy (6.193) 
Show that 


Sey = (6.194) 


Solution: This problem actually provides a different proof of the Cauchy-Riemann 
equations, by solving first order PDEs. The main focus is to show that we must 
have z = xtiy, as the characteristics for this case. Consider the following 
derivatives: 


Ow dw oz dw 
= f"(zZ)= 6.195 
ox dz a fe) dz ( 
OM aN ee if'(z) nila (6.196) 
Oy dz oy dz 
Comparison of (6.195) and (6.196) gives 
figs (6.197) 
ox i Oy 
The last part of (6.197) actually provides a first order PDE: 
a a CLEC LAE, (6.198) 
Ox i0vy Ox Oy 
Using the Lagrange method, we have the subsidiary equation as: 
eo! ae (6.199) 
1 i 0 
The characteristics equation can be solved as 
idx —dy =0 (6.200) 
By multiplying i, we have 
dx + idy =0 (6.201) 
After integration we get 
uy =Xt+iy=c, (6.202) 
The other characteristics are clearly 
dw=0, or u,=W=C, (6.203) 
o(w,xt+iy)=0 (6.204) 
Therefore, we can write it as 
w= f(x+iy) (6.205) 


This completes the first proof. 
Secondly, we can rearrange (6.198) as 
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Ow _ Ow _ ,Olu+i) _ Outiv) 


(6.206) 
ox oy Ox Oy 
Expanding (6.206), we obtain 
A CL (6.207) 
Ox Ox Oy Oy 
Equating the real and imaginary parts of (6.207), we have 
Ou _ or Ou _ Ov (6.208) 


oy ox Ox oy 


This proves the Cauchy-Riemann equations. 


6.12 LAGRANGE-CHARPIT METHOD FOR NONLINEAR PDE 


We are going to discuss a general technique called the Lagrange-Charpit method, 
Lagrange method, or Charpit method. There is a heated dispute on who discovered 
this technique. There is not much historical coverage of this method in any 
textbook. Some writers called this the Lagrange method as Charpit never published 
this method before he passed away and the original manuscript was reportedly 
“lost.” The idea for this method was published in 1779 by Lagrange. It is clear that 
you should have heard about the name of Lagrange quite often (either from this 
book or from elsewhere) while Charpit is not well known and we don’t even know 
when we was born. A vivid example of this view is given by Kline (1972): 


Lacroix said in 1798 that Charpit had submitted a paper in 1784 (which was not 
published) in which he reduced first order partial differential equations to system of 
ordinary differential equations. Jacobi found Lacroix’s statement striking and expressed 
the wish that Charpit’s work be published. But this was never done and we do not know 
whether Lacroix’s statement is correct. Actualy Lagrange had done the full job and 
Charpit could have added nothing (p.535, Volume 2, Kline, 1972). 


We do not agree with Kline on this, and Kline (1972) did not have a chance of 
reading the historical paper by Grattan-Guinness and Engelsman (1982), which will 
be summarized in a later paragraph. 

Some authors called it the Charpit method (e.g., Sneddon, 1957) as there are 
records that Charpit presented this method in front of French Royal Academy of 
Sciences on June 30, 1784 shortly before Charpit passed away on December 28, 
1784, although there was no published record of his work before Saltykow (1930, 
1937) formally published it in 1930 for Charpit. However, the paper by Saltykow 
was not cited in textbooks, such as the one by Sneddon. 

We follow the view that Lagrange laid down the fundamental ideas of this 
technique in 1779 and Charpit provided the actual details in 1784 and we prefer to 
call it the Lagrange-Charpit method (e.g., Delgado, 1997). Forsyth (1888) claimed 
that this method is partially due to Lagrange and partially due to Charpit, although 
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Forsyth called it Charpit’s method. But again no detailed citation was given by 
Forsyth. 

As remarked in the last paragraph, the history of this method is not covered in 
any textbook even though different names have been adopted by different authors. 
The best coverage was given by Grattan-Guinness and Engelsman (1982), and we 
reported its history briefly here. It was reported that Charpit had helped the course 
given by Monge, who also gave him private lectures in solid geometry. Charpit was 
clearly influenced by Lagrange. Paul Charpit was a “young” mathematician when 
he presented this method to the French Academy of Sciences on June 30 1874. 
Monge, Bossut, Condorcet, Cousin, Laplace, Vandermore, and de Borda were 
presented. At that time, Lagrange was still in Berlin. It was supposed that Laplace 
and Condorcet were nominated as reporters for Charpit’s presentation, but such 
report was never found. Presumably Laplace kept the original manuscript of 
Charpit until he passed it to Lagrange in June 1793. In September 1793, Lagrange 
sent it to Arbogast, who made a copy for himself and sent the original to Lacroix, 
who made another copy of the manuscript. Lacroix mentioned Charpit’s method in 
his book and publications in 1798, 1802, and 1814. When Jacobi asked in 1841 for 
the publication of Charpit’s paper, it was reported lost for the first time. 
Apparently, Jacobi misunderstood Lacroix and claimed that it was lost. And since 
then, most authors simply quoted from Jacobi that Charpit’s manuscript was lost. 
As remarked by Grattan-Guinness and Engelsman (1982), the extant manuscript of 
Charpit actually survived and has been archived at the Biblioteca Medicea- 
Laurenziana, Florence (Arbogast’s copy) and at the Archives of the Academie des 
Sciences, Paris (both Arbogast’s and Lacroix’s copy). Thus, Jacobi’s claim of a 
“Jost” manuscript is not correct. In 1928, H. Villat found Lacroix’s copy at the 
Archives of the Academie des Sciences and made a photocopy and sent it to N. 
Saltykow in Belgrade, who published it in Saltykow (1930, 1937). Although the 
paper of Saltykow (1930, 1937) was published in French, it is clear from the 
mathematical equations that the paper of Charpit did contain all the major steps of 
the present known Lagrange-Charpit method. Therefore, Kline’s (1972) assertion is 
inaccurate. 

Let us consider the following nonlinear first order PDE 


F(x, y,2Z, ee, =0 (6.209) 
Ox Oy 
Next, we define 
oa, ga (6.210) 
Ox oy 
The nonlinear PDE given in (6.209) can now be written as 
F(x, y,Z, p,q) =0 (6.211) 
We seek a solution in the form: 
U(x, y,2Z, p,q) =a (6.212) 
Mathematically, we can treat (6.211) and (6.212) as two equations for p and gq: 
P(X,¥,2,4), 9(x,,Z, a) (6.213) 


This is the most fundamental idea of this method. Once p and qg are solved, we can 
back substitute them into the total differential such that it is integrable: 
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dz = Fact dy = pes y,z,ade + g(x y,2,a)dy (6.214) 
IX 


Then, we can integrate (6.214) to give a solution for the original PDE given in 
(6.209). 

First, using the technique for linear PDEs, for (6.214) we can identify the 
vector E as 


E=(P,Q,R)=(p.9,-)) (6.215) 
Then, the integrability of the Pfaffian equation (6.158) becomes 
OP oO 0Q OR OR OP 
RSL) 4 pL) (A _& 
Oy Ox Oz Oy Ox a 
og _ PP _ PP 4 _y 
OZ oz Oy Ox 


Next, we will find the first derivatives involved in (6.216). To find them, we take 
the total differential of F and U with respect to x gives 


dF _ OF | OF op , OF oq _ 

dx Ox Op ox dq Ox 

dU _0U , OU ep , ou oq Z 

dx Ox Op Ox dq Ox 
These equations can be put in matrix form as 


F, a Px = ‘ae (6.219) 
UU; |e, ve 


(6.216) 


(6.217) 


(6.218) 


or, 
[A]{p} =-{f} (6.220) 
The inverse of the matrix A is 
U, —-F 
ne (6.221) 
det|4|]-U, F, 
The determinant of A is 
det|A| = —__ FLU, -F,U, #0 (6.222) 
O(P.q) 
Therefore, we have 
= UF -FU, 
cg ia (6.223) 
qx OF,U) “Us +P, 
O(p.q) 
Thus, we have 
__ FU) AF WU) | _ FU) FU) co 


» Ax 
A(x,q) Op.) Ox, p) p,q) 
Similarly, we can differentiate F and U with respect to y and z to obtain 
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eee OF,U) , FU) pe O(F,U) OF ,U) 
00.0) Apa)? “A APD” — gays) 
_ FU) , FU) | 
“  @(z,p) Op.9) 
Now, we can substitute these results into (6.216) to give 
OF,U) , FU), OFU) , FU) _ 4 (6.226) 


O2Z,p) — O2,q) y,.4) Op) 
Equivalently, (6.226) can be expressed explicitly as 
pH F era, Ee FE 2 ee S622) 
We can re-shuttle this equation by recalling that U actually is our unknown 
oU oU oU oU oU 
Fi—+F, —+(qF, + pF,)—-—(F. + pF.) ——- (Ff, +9F,)— =90 (6.228 
Poy a Gy + Ghat Pl) at PEA Ey ta (6.228) 
This is a first order PDE with five variables and the Lagrange method discussed in 
Section 6.7 can be used to write down the the subsidiary equation 
dx dy _ dz _ dp _ dq 
-, Fae qk, + pF, {P+ pr) =F, +9F,) 
This is the main result of the Lagrange-Charpit method. From this equation, we can 
derive 


(6.229) 


U(x,y,2, p,q) =a (6.230) 
as long as 
ey) #0 (6.231) 
OP.) 
The characteristics equation can be set to dt where ¢ is a parameter: 
ce = a dt (6.232) 


Ly 45 Gia, eae) ha) 


Therefore, the nonlinear PDE is transformed to a system of five coupled ODEs: 


dx dy dz 

=F, =F, =gF_+ pF. 6.233 
ae ° a 8 ee? a ( ) 
ep ee Le F.) (6.234) 
dt xt PL), dt y q 4 , 


We now illustrate how to apply the Lagrange-Charpit method. 


Example 6.6 Solve the following nonlinear PDE 


Grr y=2 (6.235) 


Solution: This PDE can be recast as 
F = p’x+q’y-z=0 (6.236) 
Differentiation of (6.236) gives 
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2 2 
Fa2pxs, F,=2q, Fe=p. Kya, Fal 
The subsidiary equation becomes 
dx — dy | dz _ dp _ dq 
2px 2qy 2p?x+2q’y pl-p) g-q) 
Using the method of the Lagrange multiplier, we have 


pdx +2pxdp— _ g°dy + 2qyvdq 


2pxt+2px(p-p*) 2q°y+2qy(q-4") 
This can be simplified to 


d(p°x) _ d(q°y) 


px qy 
Integration of both sides gives 
px=aq'y 
On the other hand, we can rewrite (6.236) 
px=z-q'y 


Equating (6.241) and (6.242) gives 


V2 
a Zz 
: vl — 


Substitution of (6.243) into (6.241) gives 


1/2 
_| za 
eee ;| 
Finally, from (6.214) we have 


/2 2 
eo | geal dy 
x(1+a) y(+a) 
Rearranging (6.245) leads to 
(+a) 1/2 a\'2 ie 
dz= dx + 2 dy 
Zz x y 


Finally, integration of (6.246) gives the solution as 
{d+a)z}"" =(ax)"? +y! +5 


(6.237) 


(6.238) 


(6.239) 


(6.240) 


(6.241) 


(6.242) 


(6.243) 


(6.244) 


(6.245) 


(6.246) 


(6.247) 


Example 6.6 shows that the analysis may not be straightforward if we do not 
recognize the appropriate multiplier for the subsidiary equation. Let us consider 
some special cases of nonlinear first order PDEs that can be solved easily by using 


the Lagrange-Charpit method. 
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6.12.1 Typel (only p and q) 


Consider the case that, the PDE only involves p and q as 


F (p,q) =0 (6.248) 
The subsidiary equation can be simplified to 
dx dy | dz _dp_ dq (6.249) 


ff Ly. Gh or, & @ 


For such cases, we have 


dp=0, p=a (6.250) 
Substitution of (6.250) into (6.248) gives 
f(a,q)=09 (6.251) 
If we can solve for g, we have 
q=Q(a) (6.252) 
This is consistent with the last part of (6.249) 
dq=0 (6.253) 


In addition, (6.248) is a first order PDE, and we expect only one unknown constant 
instead of two. This is precisely what we get in (6.252). Now we can integrate to 
get z as 


sag sees) (6.254) 
Ox 
Similarly, we can integrate (6.252) to give 
OZ 
q= a O(a), z=O(a)y+ fy(x) (6.255) 
Comparison of (6.254) and (6.255) gives 
fy(xy=ax+C, fi(v)=Q(ay+C (6.256) 
Finally, the function z is 
z=ax+QO(a)y+C (6.257) 


Example 6.7 Solve the following nonlinear PDE 


(Fy) =1 (6.258) 
Ox Oy 
Solution: Note that this is Type I, and it can be rewritten in terms of p and q as: 
pq=l (6.259) 
Employing (6.249) of the Lagrange-Charpit method, from dp = 0 we have 
p=a (6.260) 
Substitution of (6.260) into (6.259) gives 
q=—=0la) (6.261) 


Integrating both (6.260) and (6.261), we have 
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z=ax+~4+C 
a 


Alternatively, we can rewrite it as 
a’x+y-az=C 
where C; is an arbitrary constant. 


6.12.2 Type II (only z, p and q) 


For type II, the functional form of the PDE is 


F(Z, p,q) =0 
The subsidiary equation of the Lagrange-Charpit method is simplified to 
dx — dy _ dz _ dp dq 


Py £y Gi tpl, pF, at; 
The last part of (6.265) gives 


dp _ dq 
P 
Integrating this, we obtain 
p=aq 
where a is a constant. Substitution of (6.267) into (6.264) gives 
F(z,aq,q)=0 
Suppose that we can solve for g such that 
q = Q(z,a) 
Finally, p becomes 
p=aQ(z,a) 


With known p and q, we can integrate (6.214). 


Example 6.8 Solve the following nonlinear PDE 
(Sp -(&) =2 
Ox Oy 
Solution: Note that this is Type II, and it can be rewritten as: 
F(z, p,q)= p’ -q -2=0 
Thus, we can use (6.267) as 
p=aq 
With (6.273), the PDE given in (6.271) becomes 
ag? —q? =z 
Solving for g we have 


Substitution of (6.275) into (6.273) gives 


(6.262) 


(6.263) 


(6.264) 


(6.265) 


(6.266) 


(6.267) 
(6.268) 
(6.269) 


(6.270) 


(6.271) 


(6.272) 


(6.273) 


(6.274) 


(6.275) 
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p=a>— (6.276) 
ae 
With these values of p and q, (6.214) becomes 
dz = a(——)'dx + ("ay (6.277) 
a1 a’ -1 
Rewriting (6.277) gives 
1 a 1 


wee 7 wet 7 Td (6.278) 
Z (a~ —-1) (a~ -1) 


Integrating both sides gives 


1/2 _ ax ¥ 
2z 7a t72 nt (6.279) 
(a“ -1) (a~ -l) 
Finally, we have the final solution as 
2 
1 ax y 
7= ot ep? +c| (6.280) 


6.12.3 Type III (Separable) 


If the nonlinear PDE can be separated such that x and p only appear on the left 
whereas y and q only appear on the right, we have the Type III situation: 


F(x, p) =G(y,q) (6.281) 
For such case, the subsidiary equation of the Lagrange-Charpit method becomes 
dx dy | dz _ dp _ dq (6.282) 
. “Gy PF -gG,. Ey. 
The first and fourth equations of (6.282) can be grouped to form an ODE as 
F. 
ap +— =0 (6.283) 
a F, 


which can be solved for p as a function of x because F is only a function of x and p 
(see (6.281)). In Pfaffian form, it is 


F dp + F.dx =0 (6.284) 
This is a total differential of F = constant. Clearly, the solution is 
F(x, p)=a (6.285) 
where a is a constant. Thus, (6.281) also leads to 
G(y,q)=a (6.286) 


Thus, in principle, we can find p and q from (6.285) and (6.286) and eventually 
integrate (6.214). 

Alternatively, we can also group the second and last equation of (6.282) to 
give 
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G, 

aq My 

dy G, 

Then, following the same logic we can find the solution for g since (6.287) is a total 


differential of G = constant. Thus, we can solve both p and q from the following 
equations 


iG (6.287) 


G(y.q)=a, F(x,p)=a (6.288) 
The next example illustrates this case. 


Example 6.9 Solve the following nonlinear PDE 


(Z)y0422) = (Bx (6.289) 
Ox Oy 
Solution: Note that this is Type II because it can be written as 
2 2 
1 
py 8 (6.290) 
x y 


Because the given function G is much simpler than F’, we can revise the procedure 
slightly as 


G,dq+G,dy =0 (6.291) 
Using (6.290), we have 
1 
G,=—, G,=-4 (6.292) 
y y 
Thus, we have 
ual 7 ay (6.293) 
q y 
The solution is 
q=ay (6.294) 
where a is a constant. Substitution of (6.294) into (6.290) gives 
Lanse 
F(x, p)= p” = =4 (6.295) 
x 
Solving for p we have 
V2 
a’“x 
Substitution of (6.296) and (6.294) into (6.214) gives 
V2 
dz = pdx + qdy = at +aydy (6.297) 
xX 


Integration gives 


z=Javl+x +5 yr +b (6.298) 
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6.12.4 Type IV (Clairaut Type) 


The type IV first order PDE can be considered as a generalization of the Clairaut 


equation of first order ODEs discussed in Chapter 3: 
Oz (ord Oz OZ 

zZ=x—+y—4+f(H, 
Ox ~ Oy © Ox Oy 


Thus, the functional form F can be cast as 


) (6.299) 


F=xp+yqt f(p,q)-z=0 (6.300) 
Thus, we have 
Hal =p, FS FyA8elys 245 F oy (6.301) 
The subsidiary equation of the Lagrange-Charpit method becomes 
a __® & 2% (6.302) 


x+f, ytf, wtptad,+pf, 9 0 
Clearly, similar to the Clairaut equation discussed in Chapter 3, we have 


p=a, qz=b (6.303) 
Thus, the solution is (6.196) with the values of p and qg given in (6.200) as: 
z=ax+by+ f(a,b) (6.304) 


Example 6.10 Solve the following nonlinear PDE 


Oz  OZ.2 Oz Oz, 
os ay? {z aes »s 2 (6.305) 


Solution: Note that this is Type IV or Clairaut type 


Z—xp — yq = —— (6.306) 
(ptay 
Thus, from the last two equations of (6.302) we have 
p=a, qz=b (6.307) 
The general solution is 
z=ax+by+——— (6.308) 
(a+b) 


6.12.5 Singular Solution 


Similar to the discussion in Section 3.2.12, the singular solution can also be found 
accordingly. Consider again the following first order nonlinear PDE 


F(x, y,Z, p,q) =9 (6.309) 
where p and g have been defined in (6.210). Let the general solution of it be 
V (x, y,Z, a,b) =0 (6.310) 


Since the solution for z is a function of x and y, we can differentiate (6.310) as 
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OV OV cz OV OV 
+ = +—p 
Ox Oz Ox Ox 
OV Vv oz _ ov peg 
ay Oz Oy Oy ae 
Similar to the c-discriminate method discussed in Section 3.2.12, we allow a and b 
to be functions of x and y. Then, the differentiation of (6.310) leads to 
OV OV OV oa OV Ob 
ae pr + = 
Ox =o Ca Ox = Ob Ox 
ov ce mid Oa ; oV ob _9 (6.314) 
ay a " @a oy Ob oy 
Substitution of (6.311) and (6.312) into (6.313) and (6.314) gives 
a da, OV ab _ 
Ca Ox Ob Ox 
OV Oa md Ob _ 
‘Oa a ‘Ob ov 
These two equations can be put in a matrix form as 
Oa ob |(aVv 


ex Ox |) 6a {of (6.317) 


=0 (6.311) 


=0 (6.312) 


(6.313) 


(6.315) 


(6.316) 


Oa 0b|\aV 0 

dy Oy || ab 
In general, we can now use (6.310), (6.313), and (6.314) to eliminate a and b. 
There are three possible scenarios: 


(i) For singular solutions, we require 


=. 26. PSO; (6.318) 
aa ob 


Note that (6.313) and (6.314) are automatically satisfied by the first equation of 
(6.318). 


(11) To get back the general solution, we require 
Ga _ ab _ aa _ Ob _ 


(6.319) 
ox Ox Oy ov 
(iii) For the third scenario, we have 
la #0, ua #0 (6.320) 
Oa Ob 
From (6.317), we immediately see that the following Jacobian is zero: 
Ne) (6.321) 


Oxy) 
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Thus, the variables a and b are not independent. We must have a = a(b) or 
inversely we have 

b=w(a) (6.322) 
where w is an arbitrary function. Differentiating (6.310) with respect to a and 
rewriting (6.310) in terms of (6.322), we have the following system of equations: 


cia Le (6.323) 
Ca Ob Oa 
V(x, y,z, a, w(a)) =0 (6.324) 


Elimination of a from (6.323) and (6.324), we again obtain the general solution. 


Example 6.11 Find the general solution as well as the singular solution of the 
following Clairaut equation: 


> ee get Oz Oz _ (6.325) 
Ox ~ Oy Ox Oy 
Solution: Expressing this in terms of p and g, we have 
zZ=xp+yq+ pq (6.326) 
The subsidiary equation of the Lagrange-Charpit method for the Type IV case is 
dx — dy _ dz _ dp _ dq (6.327) 
x+f, ytf, wt+pxtaf,+pf, 9 0 
From the last two parts of (6.327), we have 
p=a, q=b (6.328) 


The general solution becomes 
V(x, y,Z,a,b) = z—-ax—by—ab=0 (6.329) 


Applying (6.318) gives 


Le (6.330) 
Oa 
OV 
—=-y-a=0 6.331 
ae ( ) 


Using (6.330) and (6.331) to eliminate a and b from (6.329), we have the singular 


solution as 
Z=-xy (6.332) 


6.13 GEOMETRIC INTERPRETATION OF NONLINEAR PDE 


The geometric interpretation of a nonlinear PDE was given by Monge. Let us 
consider the linear form of PDE first 


Ly Ry Oe ST Te (6.333) 
Ox oy 
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Note that dé is parallel to E, and the normal vector to the increasing curve E = (P, 
Q, R) is the directional vector of the intersecting surfaces can be expressed: 


N =(p,q,-)) (6.334) 


Z 


Monge cones of 
possible normals 


Figure 6.5 Geometric interpretation of the solution of nonlinear Ist order PDE 
The normal direction of the vector E is 


a ee (6.335) 


Dp + x +1 

More importantly, we see that the direction is an unique function of p and gq. 
However, for a nonlinear PDE, there is no one-to-one correspondence between p 
and q. To illustrate this, we consider a special case that nonlinearity in p can be 
expressed as polynomials: 


F(x,,2Z, DsQ) =4,P" +4, _p" +..ta,pt+bq—c=0 (6.336) 
Clearly, we have more than one pair of p and gq that satisfies F = 0. In other words, 
we do not have a unique direction n for the intersecting curves. The direction of the 
curve is now non-unique and Monge interpreted the possible direction of the curve 
forming by a cone, called the Monge cone. The idea is showed in Figure 6.5. 


6.14 JACOBI’S METHOD 


We saw in the last section that the Lagrange-Charpit method is for two variables x 
and y. This technique, however, cannot be extended to first order PDEs of more 
than two independent variables. The technique discussed in this section was 
proposed by Jacobi in 1836. Jacobi’s method can be applied to 3 or more variables. 
Therefore, it is more general than the Lagrange-Charpit method. However, for 
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Jacobi’s method to be applicable, the unknown z cannot appear in the nonlinear 


PDE explicitly. 
Let us consider the case of a first order PDE with three variables: 
F(X, %5%3, Pi» Po» P3) =9 (6.337) 
where z is the unknown and its derivatives are defined by 
Oz oz Oz 
= : = : = (6.338 
Py a, P2 ax, P3 a, ) 


The main idea of Jacobi’s method is to find two PDE involving two unknown 
constants: 

F(%%25%35 Pts Pas P3) = a (6.339) 

Fy (%,X2,%3, P1> Po» P3) = % (6.340) 
In this way, we have now three equations to solve for pi, p2, and p3. This idea is 
clearly a natural extension of the Lagrange-Charpit method. 

The total differential of the unknown z is 

dz = p,dx, + prdx> + p3dx3 (6.341) 

The integrable conditions of this equation are 
Op, _ a7z Op, 


= (6.342) 
Ox, —OXOX, OX 
Py _ 02 _ (6.343) 
Ox, —Ox,0X3 OX; 
Sp, __O°2_ _ py (6.344) 


Ox; 7 Ox30Xy ‘ OX 
Note that p1, p2, and p3 are functions of the variables only. We can now take the 
total differential of F and F with respective to x1 : 
OF OF op, Pe OF Op, . OF Op; 
Ox, Op, Ox, Op, Ox, Op, Ox, 
OF, OF, ; OF, Op, | OF; Op; =(0 
Ox, Op, Ox, Op, Ox, Op, OX, 
The subtraction of the result of multiplying (6.346) by AF/dp, from the result of 
multiplying (6.345) by d\/Op; yields 
OFLA) | OFF) Or, OFF) Os _ 


=0 (6.345) 


(6.346) 


(6.347) 
O(%1, 71) OC P2,P1) Ox O(p3, py) Ox 
where the Jacobian is 
OFF) OF OF, OF OF, (6.348) 
O(%P,) Ox, OP, Op, Ox, 
Similarly, we can differentiate F and F' with respect to x2 and x3 to get 
OFLA) , OFA) On, OFA) @s _ 5 eam 


O(X2,P2) OCP, P2) OX. OC P3, Po) Oxy 
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OFA) | OFA) a), OFA) py _ 


(6.350) 
O(%3,P3) O(P). P3) Ox3 OP, P3) OX3 
Adding (6.347), (6.349) and (6.350) leads to 
OFA), OFA) | OFA) _ sad 
O(%, P) O(Xy, P2) O(X3, P3) 
The result (6.351) is a consequence of applying the following identities: 
OF.) Ooo, OFLA) doy a AFA) , FAD _ 9 66 35 


O(P25P\) OX — O(Pj, Pr) OX, XOX, O(P2,P,) OCP» P2) 
Note that the last equation is the result of reversing the order of the two variables of 
a Jacobian leading to the negative value of the same Jacobian (see definition in 
(6.348)). Similarly, we also have 


OFA) Ops , AFF) a __& AFA) , AFA), _ 9 6353) 


O(P3,P,) Ox, OCP), P3) OX; OX30X, O(P3,P,) OCP), P3) 
OFF) Ps, OFM) OP, __ & OFA) , OFM) y 96.354) 
O(P3, Pz) OX, O(Pz,P3) Ox3 — OX30Xy O(P3,P2) (Pr, P3) 
Rewriting (6.351) explicitly, we get 
OF OF, OF OF OF OF, OF OF; _ OF OF, OF OF 
Ox, Op, Op, Ox, OX, OP, Op, OX Xs Ops Op; OX 
This equation can be rewritten symbolically: 
[F,F,]=0 (6.356) 
This is actually the compatibility of the two PDEs F = 0 and F’ = 0 (see Problem 4 
of Section 2.9, Sneddon, 1956). Similarly, we can also obtain: 
[F,F,]=0 (6.357) 
[F.Fy1=0 (6.358) 
Note that F’ = 0 is given while F| = 0 is not. Thus, we can view (6.356) as a first 
order PDE for F}. Its solution can be found readily using the Lagrange-Charpit 
method discussed in Section 6.12. The subsidiary equation of (6.355) using the 
Lagrange-Charpit method is 
dp, dx _ dp» _ dx, _ aps _ & 
OF OF OF OF OF OF 
Ox, Op, OX, Op, Ox; Op; 


=0 (6.355) 


(6.359) 


From the characteristics equations of (6.359), we can find F\ = a; and F2 = ap. 
However, after we obtain these solutions, we need to check whether they are 
compatible using 


3 
[Ale (2 AS) (6.360) 
“i OX, Op, Op, OX, 
By now, we have three equations for pi, p2, and ps: 
F=0, F.=qa, F,=a, (6.361) 


This completes Jacobi’s method, which will be illustrated in the following example. 
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Example 6.12 Solve the following nonlinear PDE using Jacobi’s method 
gan Py 2G (6.362) 
Ox, OX OX OX; 
Solution: Expressing this in terms of p; (i = 1,2,3), we have 
2 DX X3 +3 p50, + D3P3 =0 (6.363) 
The subsidiary equation is 
dx, dp Ax, dp, _ a; dp3 


= 7 5 = =— = (6.364) 
—2xX3  2piX3  -3x3-2pop; 9 py 2X + O23 
The first two parts of (6.364) give 
ay _ aD (6.365) 
“ Pi 
The solution of (6.365) is 
Fo= py = % (6.366) 
The fourth part of the characteristics equation (6.364) gives 
Fy = py =a (6.367) 
Their compatibility can be checked as 
[F,,F,]=p,:0-x,-0+0-1-0-0+0-0-0-0=0 (6.368) 
Thus, we can use (6.363), (6. 366), and (6.367) to find 
a 2a,x, +3a,x5 
RA» BAe B=-—— > (6.369) 
x, a, 
The solution can now be integrated from the total differential 
> Saye 
dz =“ dx, + aydxy —(A8 = ya, (6.370) 
xX, ay 
Integration of (6.370) gives the final solution as 
z=a,Inx, +a)x, aie + 5X3) +a, (6.371) 
a) 
Example 6.13 Solve the following nonlinear PDE using Jacobi’s method 
(x, +23)( OF gy pa (6.372) 
OX, OX, Ox, 


Solution: We can rewrite (6.372) as 
(Xp +.23)(P2 + ps)” +zp, =0 (6.373) 
In (6.373), the unknown z appears explicitly, and we cannot apply Jacobi’s method 
directly. Let us assume the following relation 
u(x}, X7,%3,2) =0 (6.374) 
Note that (2.374) is clearly a solution of (6.373) because it relates the unknown z 
with the variables. We further assume that z is a new variable: 
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oe, 
Oz OX, Ox, P 
ZX 6.375 
4 Dp 1 ax, ax, Ou P, ( ) 
OxX4 
Similarly, we have 
P, P, 
=-—, =— 6.376 
P2 P, P3 P, ( ) 
The derivatives of the new unknown u are 
po Be oS eS (6377) 
Ox, Ox, Ox OX, 


With the new unknown wu and its derivatives defined in (6.377), we can rewrite the 
PDE symbolically as 


F(%,%7%3,%4 FPF), P,,P)=0 (6.378) 
The original PDE now becomes 
(x) +43 +B) x4 RP, =0 (6.379) 
Jacobi’s method can now be applied to give a subsidiary equation: 
dx, — dF _ dx — aR dx 
mB 9 2m +eR+R) (R+R) 22+eVA+A) 6 soy 
aP, dx, dP, 


= - 
(Rt+PRy uh AR 
These characteristics equations can be solved to give 


Firh=a, h=h-Rh=a, y=xyh, =a, (6.381) 
It is straightforward to show that 
[f./))=[F,.3)=[%3,%]=0 (6.382) 
We can now find the derivatives from (6.379) and (6.381) to give 
Raa, R=3, 2R=m4/", R=R-a (6.383) 


X4 X4 + x3 


Thus, we can find w as: 


Poa tate ae [4 (ar, +dx3) (6.384) 
X4 2 X4 +X 


The solution can obtain by integration as 


u=a)X, +a; Inx,+ te (x5 —x3)+.Ja,a;(x, +x3) +44 (6.385) 


Finally, the solution of (6.373) can be found by making the following identifications: 


1 
r=) meee “hea, ew, “ed (6.386) 


a, a, a, 


The solution is 


In z+ A,x, +A, (X_ —x3)+.fA, (x, +3) +4, =0 (6.387) 
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6.15 HODOGRAPH TRANSFORMATION 


We will see in Chapter 7 that for fluid mechanics problems, Molenbroek in 1890 
and Chaplygin in 1902 applied Legendre transform to rewrite the unknowns as x 
and y, and the variables as the velocity components. This technique is, in general, 
called a hodograph transformation. We will show in this section that such 
transformation can convert a system of nonlinear first order PDEs to a linear one if 
the variables do not appear explicitly in the differential equations. 

In particular, we consider the following system of PDEs: 


Ou Ou Ov Ov 
A (u,v) — + B, (u,v) — +C, (u, v) — + D, (u,v) — = 0 6.388 
(uw 1) ar (ul v) ax (u v) at , (ut vy) ox ( ) 
Ou Ou Ov Ov 
A, (u,v) — + B, (u,v) — + C, (u,v) — + D, (u,v) — = 0 6.389 
5 (u,v) ar (U,V) ax (u,v) ar (u,v) ox ( ) 
We can use the hodograph transformation as 
x=x(u,v), t=t(u,v) (6.390) 
where 
O04 Y) 9 (6.391) 
(x, y) 
We can use the transformation as 
Ox Ot Ox ot 
ou av ou av ov au ov ou 
= ? > > = 6.392 
Ct = O(x, t) Ox O(x, ft) Ot O(x, ft) Ox  O(x, ft) ( ) 
O(u, v) O(u,v) O(u,v) Ou, v) 


Substitution of (6.392) into (6.388) and (6.389) gives the following linear system 
of first order PDEs as 


A ee —B, ai -C, ny +D, Ga =0 (6.393) 
Ov Ov Ou Ou 


A, aan ea —C, (u,v) a + D, (u,v) A =0 (6.394) 
Ov Ov Ou Ou 
Since the unknowns are x and f¢, (6.393) and (6.394) are linear PDE. 


6.16 SUMMARY AND FURTHER READING 


Most textbooks on differential equations do not cover the solution technique for first 
order PDEs. However, we have illustrated that first order PDEs actually appears 
naturally as transport, kinematic wave, or shock wave equations in engineering and 
mechanics. Airy’s method of solving nonhomogeneous hyperbolic equations is 
introduced. Geometric interpretation of the solution of linear first order PDE by 
Monge is discussed. For linear PDEs, we discuss the method of Lagrange, including 
the technique of the Lagrange multiplier, Pfaffian equations, exact equations, and 
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integrability of Pfaffian. For nonlinear PDE, we cover Lagrange-Charpit method and 
Jacobi’s method. Various special cases of the Lagrange-Charpit method are discussed 
and illustrated. Geometric interpretation of nonlinear PDEs is also discussed. 

For further reading, we recommend the books by Lopez (2000) and Snedden 
(1957). 


6.17 PROBLEMS 


Problem 6.1 Solve the following PDE 


Oy xy’ (6.395) 
Ox Oy 
Ans: z=yw(ax+y)+ : {6a°x" y* +4a>xy+a'x4 


12a” 


Problem 6.2 Show also that the solution of Problem 6.1 can be expressed as 


2= part y)+—5 -Aaxy’ ~y*} (6.396) 
12a 


Problem 6.3 Solve the following PDE 


—-a— (6.397) 
ox = =oOy 
Ans: z=y(axt+ y)+ tant 
(a@- fa) 
Problem 6.4 Find the solution of 
fp gh (6.398) 
Ox Oy Cz 
Ans: u =f(~, 
Zz y 
Problem 6.5 Find the solution of 
ga =x (6.399) 
Ox oy 
i oe y 
Ans: z° =2x+ f(=) 
x 
Problem 6.6 Find the solution of 
Oz 6/4 
x+3y)—- y— =0 6.400 
(x+3y) a ay ( ) 
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Ans: z= fly toy") 


Problem 6.7 Find the solution of 


Oz Oz 
y—-x— =0 
ox oy 
Ans: z= f(x’ +y’) 
Problem 6.8 Find the solution of 
as 
os y 


Ans: z= y—x+ f(x? -y’) 


Problem 6.9 Find the solution of 


3) 2 Oz (o4 2 
x4 + 2x } =0 
( ye ey Zz 


1 1 1 1 
Ans: ¢(—+ s—t )=0 
Z x+y Z x-y 


Problem 6.10 Find the solution of 


Ans: ¢(xy,3xyz— y’) =0 


Problem 6.11 Find the solution of 


Mer ME =F yoty) (x-y Ox +2y+z2) 


Ans: ¢[xy,(x+ y)(x+y+z)]=0 


Problem 6.12 Find the solution of 
7 Oz 
Ox 


we =x(z-2y) 


2 
1 
Ans: z=—~ 4+ f(x? +y’) 
yoy 


Problem 6.13 Find the solution of 


(6.401) 


(6.402) 


(6.403) 


(6.404) 


(6.405) 


(6.406) 
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2 2, OZ 2, 9, OZ ve) 
x(x" + 3° )—— yBx* + y7)— = 2z(y* — x") (6.407) 
Ox Oy 
xy x+y 
Ans: g(~ 7) =0 
Z Zz 


Problem 6.14 Find the solution of the following PDE 


Bile) 4a Syn) (6.408) 
Ox Oy 


passing through the curve 
z=0, x+y? =2x (6.409) 


Ans: x? + y? =2x+z° —4z 


Problem 6.15 Find the solution of the following PDE 


Gy)“ 26-3) = ew) (6.410) 
Ox Oy 
passing through the curve 
x=1, y=0 (6.411) 


Ans: x? +y’?+z=l+xz+y 


Problem 6.16 Find the solution of 


y= “) =0 (6.412) 


Ans: z? = 2ax+a’y’ +b 


Problem 6.17 Find the solution of 
Oz Oz &. 
+— =(—_\(_) 


(6.413) 
Ox Oy Ox oy 
Ans: z= axt—— +b 
Problem 6.18 Find the solution of 
ea + aS = HD (6.414) 


Ans: 22 =2(a+1(x+2)+b 
a 


Problem 6.19 Find the solution of 
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02.9 (CZ.9 2.9 _ 2 ,O%.9, 2 2 
oe GS +x°y «Ge (x +y") (6.415) 


Ans: z= 5 ba yo ay aay ad 


Problem 6.20 Find the solution of 


Oz, 2 OZ Oz. OZ 
2x: x —2 =0 6.416 
Zz GC? WG? OG (6.416) 
Ans: z = ay +(x’ —a) 
Problem 6.21 Two PDEs are given as: 
Oz 2 6/4 or4 2 
F kets = 0) 6.417 
Ge Go G8 (6.417) 
va care cl ye =0 (6.418) 
Ox, = OX, 


(1) Check the compatibility of these two PDEs. 
(ii) Use Jacobi’s method to find F2 = a. 
(iii) Finally, find the solution for z. 


‘s as 1 
Ans: (ii) F2 = pi = a; (iil) z = a(x, —Inx, +b 
X3 


Problem 6.22 Solve the following PDE by Jacobi’s method 


Oz Oz 
(—) x, +(—)"x, =z (6.419) 
Ox, Ox, 


1/2 1/2 1/2 
Ans: c =2(ax,) +2(bx,) +2{(a+)z} 


Problem 6.23 Find the solution of the following PDE 
Oz 9: Oz 2 
—y-(—y =0 6.420 
( ox, ‘am x, (6.420) 
Ans: z=+tax+ay+b 
Problem 6.24 Find the solution of the following PDE 


Oz Oz 
vy (—P -x¥ (= 
Ox 


a =0 (6.421) 


1 1 
Ans: z =—a!?x? + ae +b 
2 2 


Problem 6.25 Find the solution of the following PDE 
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Oz. Oz 
zZ— DS =0 


Ans: z = a(a'x - ay - b)? 


Problem 6.26 Find the solution of the following PDE 


a a 
2x -yS sy? -x? 
Ox ~ Oy 


Ans: 6(xy,z? +x? + y”)=0 


Problem 6.27 Find the solution of the following PDE 


xy a+ yz VF 20 ») 


Ans: ¢(x + y+z,xyz) =0 


Problem 6.28 Find the solution of the following PDE 


2,-OZ.2 OZ.» 2 2 
—) +(—) J=x°+ 
sg Ne) Gr! x+y 


[2 
Z =xVx" +a yyy at a call la 
2 “4b =a 


Problem 6.29 Find the solution of the following PDE 


Ans: 


Oz Oz 2 
t =! 
(xz—-y) Ae (yz—-x) ay Zz 


P(x y)1—z),(x+ y)+z)) =0 


)+b 


(6.422) 


(6.423) 


(6.424) 


(6.425) 


(6.426) 


CHAPTER SEVEN 


Higher Order Partial Differential 
Equations (PDEs) 


7.1. INTRODUCTION 


We have considered the theory for first order PDEs in the last chapter. It seems 
logical to discuss first order PDEs before we discuss higher order in the present 
chapter. However, historically, investigation of partial differential equations starts 
with second order, as many physical problems, like wave propagation, heat 
diffusion, and incompressible and irrotational flow, have to be modelled by second 
order PDEs. They include the work of Bernoulli, Euler, D’Alembert, Laplace and 
many others. Because of its importance in physical applications, most of the 
available results for PDEs are developed for second order. A separation chapter 
(Chapter 9) will be devoted solely to the discussion of second order PDE, including 
wave, diffusion, and potential equations. We will consider the classification of 
second order PDEs in detail here, leading to the hyperbolic type, parabolic type, 
and elliptic type. We will show that the canonical forms of second order PDEs with 
constant coefficients can always convert to nonhomogeneous Klein-Gordon 
equations, nonhomogeneous diffusion equation and nonhomogeneous Helmholtz 
equations. The solution techniques for these equations are covered in some detail. 

As reviewed by Selvadurai (2000a,b), nearly all existing textbooks on PDEs 
are restricted to the coverage of second order PDEs. For example, such PDE 
textbooks include some of the most popular textbooks on PDEs: Airy (1873), 
Bateman (1944), Carrier and Pearson (1976), Gustafson (1999), Evans et al. 
(2000), Gu (1989), John (1981), Heinbockel (2003), Myint-U (1987), Myint-U and 
Debnath (1987), Sneddon (1957), Petrovsky (1991), Sommerfeld (1949), Tricomi 
(1923), Drabek and Holubova (2007), Trim (1990), Farlow (1982), Zachmanoglou 
and Thoe (1986), Zill (1993), Zauderer (1989), and Zill and Cullen (2005). In 
reality, many phenomena need to be modeled by PDE of order higher than two, 
including the biharmonic equation, Onsager equation, Benjamin-Bonna-Mahony 
equation, Boussinesq equation, KdV equation, regularized long wave equation, and 
Hirota equation. 

In view of the importance of the biharmonic equation in elasticity and in fluid 
flows, Selvadurai (2000b) considered the solution of the biharmonic equation in 
detail, whereas higher order PDEs with constant coefficients were considered in 
Chapters 31 and 32 of Ayres (1952). 

Motivated by this shortcoming in most books, we present the biharmonic 
equation in detail, including a solution of the biharmonic equation expressed in the 
form of an integral equation similar to that of Poisson for potential theory. 
However, Green’s function method for the biharmonic equation is deferred to 
Chapter 8 while the variational method for the biharmonic equation is covered in 
Chapter 14. 
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Another main difference of the present chapter from most of the pre-existing 
textbooks on PDEs is our discussion of the factorization technique for solving 
homogeneous higher order PDEs, and symbolic methods for solving 
nonhomogeneous PDEs of higher order. For second order PDEs with non-constant 
coefficients, we discuss Monge’s method, and the Monge-Ampere method. These 
methods were not covered in most textbooks on PDEs, except Sneddon (1957) and 
Ayres (1952). 


7.2 CLASSIFICATION OF SECOND ORDER PDE 


There is something special about the second order PDE. For the case of two 
variables, all linear differential equations can be classified into three types of 
differential equations. Let us consider the following most general form of second 
order linear PDE: 
2 2 2 B 
SE dla gladly jag sae em) (7.1) 
ox OxOy dy" Ox oy 


We apply a general change of variables as 


u(x, y) > u(S,77) (7.2) 
Inversely, the new variables can be expressed in x and y as 
S=S(x,y), 7=7(%,y) (7.3) 
The mapping is arbitrary and the only requirement is having a nonzero Jacobian: 
Sx Sy 
jal? ee OT 2G (7.4) 
Me My; ox Oy dy & 
The first derivatives of u with respect to x and y are 
Ou _ Ou 0€ ig Ou On (7.5) 
ox 0& Ox On Ox 
Ou Ou og , cu on (7.6) 


Oy 0¢ oy On oy 
The second derivatives can be obtained by further differentiating (7.5) with respect 
to x as 
au 8 .du 0§ , Gu dn, _ 0 (Oy OF , Ou ae d du, dn , du an 
Ox? ax OE Ox On ax Ox OE Ox OE a? OG On Ox On ax’ 
It is important to note that the differentiation of the bracket term on the right hand 
side of (7.7) is conducted using the chain rule similar to that for (7.5). Finally, we 
get 

2 2 2 2 2 2 

Ou _ Ou Ey, 5 Ou Eon, Ou Om. , Ou HE, Ou Oy 


ae ae? & d&0n ox Ox Gn? Ox OE ax? ON ax? 


Similarly, the other two second derivatives can be obtained as 


(7.7) 


(7.8) 
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Qu du dé 0g 0§ On , 8n AF) a *u On ON ou Ore ou 0°n 
Oxoy = Ox Oy oe Ox Oy "oe oy én? ox Oy O& Oxdy On OxOdy 
(7.9) 


Gu _ Fu (>, 5 Ou dé On , ou Gy , ou PE Ft O19) 
ay ag? ay dein Oy Oy dP Oy OE Gy? On Oy 
Finally, substitution of (7.5), (7.6), and (7.8) to (7.10) into (7.1) gives 


qau ~ Oy 07u Ou Ou 


sre +C +D—+E—+Fu=G (7.11) 
0g aon =n — ESO 
= gf Cbs nos OS. OSS 
A= +B £C 7.12 
AC =) ax dy G& (7.12) 
Fy Teel I cc igch cA SLT WE Vgc Toc (7.13) 
ax ax ax Oy ax Oy dy Oy 
Ons 09 OF on,» 
c A=) +B a ——. (7.14) 
2 2 2 
Dan? pe ep (7.15) 
ox Ox0y éy* Ox oy 
2 2 2 
peal p21 eS pS op (7.16) 
ox Oxdy dy’ Ox oy 
F = F(x(é,7), (67) Gay) 
G =-G(x(E,7), ¥(E,7) (7.18) 


The mathematical form of (7.11) looks more complicated than (7.1). One may ask 
why we want to apply a more complicated change of variables to result in the 
system given in (7.11) to (7.18). However, the general mapping given in (7.11) 
allows us to search for a simpler form of second order PDE. 
A major property of this transformation is that 
=a 
B’ -44AC = a (7.19) 
J 

where J is the Jacobian of the mapping function defined in (7.4). The validity of 
(7.19) is proved as Problems 7.2 to 7.4 at the end of this chapter. That is, the sign 
of B?-4AC in (7.18) will remain the same after any valid coordinate transformation 
with nonzero J. If it is positive, it is always positive. If it is negative, it is always 
negative. If it is zero, it is always zero. Its sign is thus an invariant. In fact, this 
property allows us to classify the PDE using the sign of (7.19). This idea is first 
explored by Laplace and later refined by Bois-Reymond. We will next consider the 
idea of characteristics and its relation to the sign of (7.19). 
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7.2.1 Physical Meaning of Characteristics 


In particular, observing the mathematical similarity of (7.12) and (7.14), we look 
for the possibility of setting 

A=0, C=0 (7.20) 
The mathematical structure for both (7.12) and (7.14) is the same. Thus, they will 
be considered together here. In particular, we are looking for a solution that 
satisfies the following first order PDE: 


HE en Sey 0 (7.21) 
Ox Ox Oy oy 
There are two solutions for (7.21). Let us denote its solutions as 
Z=O(x%V)=Q, 2=H(%,V)=Cy (7.22) 


These curves are illustrated in Figure 7.1. 


2w=e W(x, y)=C, 


Figure 7.1 Solutions are propagating along two curves, which are called characteristics 


Taking the total differential of (7.22) gives 


dé = 06 dx + Os dy=0 (7.23) 
ox ov 
ip cet eo 8” phy (7.24) 
Ox ov 
Thus, the slopes of these can be found as 
jo ee (7.25) 


dx 0€ / Oy 
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_dy __ onl ox 
dx 0n/ oy 


These slopes A; and Az are depicted in Figure 7.1. Along these curves, the original 
PDE becomes an ODE, and these curves are known as characteristics curves. If 
these curves exist, the solutions are propagating along these characteristics. Note, 
however, whether these characteristics exist or not, depends on the given 
coefficients A, B, and C given in the original PDE given in (7.1). On the other 
hand, we can see that (7.21) is actually a quadratic equation for the ratio of the 
partial derivative of € or 77 with respect to x to that with respect to y. In particular, 
we have 


(7.26) 


OF ,05.6 Oe 06 
A | tC= ; 
ey BC! )+C=0 (7.27) 
On ,On,2 on ,On 7 
See, + BT )+C=0 (7.28) 


The two solutions of them are the same. Without loss of generality, we can pick the 
root of dy/dx for the two characteristics as: 


dy 0€/éx _ B+ B? -4AC 


7.29 

a dx 0&/éy 2A ve) 

4g, <= dnl _ B-VB" ~4AC (7.30) 
dx 0n/ oy 2A 


Clearly, the slopes of these characteristics are functions of A, B, and C. 


7.2.2 Sommerfeld’s Interpretation of Characteristics 


A closer look at the solutions given in (7.29) and (7.30) reveals a more in-depth 
physical meaning of the characteristics. The following interpretation was given by 
Monge in 1770 and summarized in the book by Sommerfeld (1949). Let us recast 
the second order PDE in (7.1) as 


2 2 2 
go en! 4c* ge oy (7.31) 
ax? Oxy ay? ox Oy 
Next, we can rewrite (7.31) as: 
Ar+Bs+Ct=@ (7.32) 
where 
_ Ou Ou Ou (7.33) 


oT S = ot = 
ax? éxéy ay” 
By adopting the notation that we used for the first order ODE introduced in the last 
chapter, we have 


prs g=— (7.34) 


Taking the total differential of both p and q, we get 
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d = (at = 2 aes Oy = rahe + sd (7.35) 
4 Ox’ ax” Oxdy - : 
2 2 
fete en hy (7.36) 
Oy Oxdy dy’ 


We now group (7.32), (7.35), and (7.36) in matrix form as 
A B Clr @ 
dx dy 0 |4s-=4dp (7.37) 
0 dx dy|\i|t dq 
We rewrite this symbolically as 
[4] {4} ={9} (7.38) 
For a certain problem, if@, dp, and dq are given on some curve, we can find 7, s, 


and ¢ if and only if the determinant of matrix A is nonzero. The determinant is 
obtained from (7.37) as: 


A= det|A| = A(dy)” — Bdxdy + C(dxy (7.39) 

Now, we observe that the slopes given in (7.29) and (7.30) satisfy the following 
equation 

A(dy) — Bdxdy + C(dx)? =0 (7.40) 

which is precisely the determinant of the system given in (7.37). In other words, we 

cannot solve for r, s, and ¢ if ®, dp, and dq are given on the characteristics. This 


situation is illustrated in Figure 7.2. Given data cannot be prescribed on any line 
that parallels characteristics, otherwise the problem cannot be solved. The root of 


(7.40) is 
dy _B¥VB?-4AC 
dx 2A 


which is precisely (7.29) and (7.30). Thus, A = 0 implies characteristics exists. 
When A # 0, 7, s, and ¢ are nonzero, we can find their derivatives as: 


(7.41) 


3 3 3 
p28" gaps a ee te (7.42) 
“Ox “  OxOy Oy 
Thus, differentiation of (7.32), (7.35), and (7.36) with respect to x gives 
Ar, + Bs, + Ct, = ®, (7.43) 
dr =r,dx+s,dy (7.44) 
ds =s,.dx+t,dy (7.45) 
Note that we have used the following identities in arriving at these results 
3 
22 ge, s,=t,= ee (7.46) 
ox Ox * @xéy 


Again, we can put (7.43) to (7.45) into matrix form as 
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A B C\(r.) [@ 


dx dy 0 |\s.7=4 dr (7.47) 
0 d& adayillt, ds 


Note that the matrix of coefficients is the same as that for (7.37). It means that we 
can solve for higher derivatives of u. This process of taking higher derivatives 
leads to the same coefficient matrix again and again. Therefore, if A # 0, all higher 
derivatives of u can be found. In other words, u can be expanded in Taylor series 
expansion at any points. Thus, the function u must be analytic, smooth, and 
continuous. Therefore, there is no jump or no wave-like signal from the solution of 
u if A#0. On the contrary, if A = 0 (and this results from assuming the validity of 
(7.20)), the solution is not continuous or representing a wave signal. Note that the 
arrival of the wave signal is an abrupt and discontinuous phenomenon. 
Consequently, we see that characteristics represent wave types of solutions. For 
differential equations with propagating solutions (or hyperbolic type), 
mathematicians use the term “characteristics” whereas physicists use the term 
“waves” for the same physical phenomenon. In fact, there is a Le Roux-Delassus 
Theorem in 1895 stating that any singular surface of a solution of a linear 
differential equation must be characteristics (Hadamard, 1923). 


\ 90x, y) = const 


==. 


Pp, gare known on I” 


Figure 7.2 Discontinuous solutions or wave solution along characteristics. Curve J with initial 
derivatives data cannot be parallel to characteristics 


Any second order PDE can now be classified according to how many roots exist for 
the characteristics, which only depends on the value of A, B, and C. Let us recall 
our terminology for second order hyperbola, parabolic, and hyperbolic curves. In 
particular, we have 


Ax’ + Bxy + Cy” + Dx+ Ey+F =0 (7.48) 


This equation for a curve is classified as a hyperbola, parabola, or ellipse according 
to: 
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>0 hyperbola 
B’-4AC =4=0 parabola (7.49) 
<0 ~ ellipse 


These curves are also known as conic sections. As illustrated in Figure 7.3, they all 
result from cutting a section of a cone. 

Recognizing the similarity of the mathematical structure between (7.1) and 
(7.48), Bois-Reymond in 1839 proposed the following classification of second 
order PDEs: 

B? -4AC <0: elliptic 
Classification {B?—4AC =0: parabolic (7.50) 


B’ —4AC > 0: hyperbolic 


ellipse 


parabola 


Figure 7.3 Hyperbola, parabola and ellipse as conic sections 


Note that the hyperbolic type of second order PDE was first proposed by Laplace. 
It was subsequently extended to the classification summarized in (7.50) by Bois- 
Reymond. Note that the hyperbolic case corresponds to the existence of two 
characteristics or the existence of wave-type solutions. The parabolic case 
corresponds to the existence of one characteristics or physically corresponds to a 
diffusion type of phenomenon. Finally, the elliptic type corresponds to no 
characteristics or physically corresponds to an equilibrium type of phenomenon. 
More discussions on the physical meaning of this classification will be given later. 
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7.2.3 Hyperbolic PDE 


Recalling that for the hyperbolic type, we have the validity of (7.20), thus (7.11) is 
reduced to 


2 
Gop iG (7.51) 


060n 06 on 


This can be rewritten as 


2 
Oe ge ees (7.52) 
0g0n 0& ON 
which is the first canonical form of the hyperbolic type of PDE, and where 
DD 2°28 F G 
d=s, @==, ==, g== 7.53 
B B f B e B a) 


For the hyperbolic type of PDE, there exists another canonical form. To see this, 
we can introduce a change of variables as 

€=st+t, n=s-t (7.54) 
This can be inverted to give 


1 1 
veer eG) (7.55) 


The first derivative of u becomes 
Ou Ou os | Ou ot 1 du Ou 


+ (7.56) 
0& Osdé Otd0E& 2 Os Ot 
Ou _ Ou Os ou ot 1 0u e4 (7.57) 
0n OsOn Ot On 2 0s Ot 
Subsequently, we have the second derivative as 
@u Af @ u_ du as , 0 udu a] _1 ou Su a sy 
dg8n 2\| 6s ds at OE Ot as At AE] 4 ds* a? 
Substitution of (7.58) into (7.52) leads to 
Ou Ou Ou Ou 
+ +e —+ fu= 7.59 
as? ar 1 as 1 at fi 81 ( ) 
where 
d,=2(d+é@), e=2(d-e), fr=4f, g,=4¢ (7.60) 


Equation (7.58) is called the second canonical form of hyperbolic type of PDE, 
which is mathematically equivalent to (7.52). 
Let us recall the one-dimensional wave equation as: 


2 2 
oe 27 Se (7.61) 
Ot Ox 
Introduce the following change of variables 
gee (7.62) 
c 


and we get 
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2 2 
Pa =e = uf (7.63) 
ox Os 
Thus, the second order derivative terms in (7.59) agree with those in the one- 
dimensional wave equation. In Section 7.3, we will discuss further simplification of 
the canonical form of the hyperbolic type PDE. 

Note that for the cases of constant coefficients we can show that the 
characteristics are straight lines. In particular, for a special case of constant A, B, 
and C, we find the characteristics as 

B=\B? 3440 B+\B? -4AC 


= =C,, n=y-————_— x=C, (7.64 
g=y aA x=C), n=y oF x=C, (7.64) 


For such case, we can show that the coefficients in (7.52) can be found as 


= 2AE-D[B- VB? -44C] 


2[2AC(4A’ +1)- B?] 


_ 2AE — D[B+\VB?-4AC] 


A2AC(44? +1)—B?] * (7.65) 
: AF 
2AC(AA +1)=B" 
AG 


5 
2ACTAL 41)=B° 


Example 7.1 Classify the following second order PDE. If it is hyperbolic, find the 
two characteristics and the canonical form of the PDE. 
Uy, —4uy, tu, =0 (7.66) 


Solution: Comparing (7.66) to the standard form (7.1), we have 
A=1, B=0, C=-4, D=1, E=0, F=0 (7.67) 


VB? -44AC =4>0 (7.68) 


Therefore, by the Bois-Reymond classification (7.66) is classified as hyperbolic 
type of PDE. The corresponding characteristics are 


dy 0£/@x _ B-VB?-4AC _ 


Thus, we find 


irs dé / dy 2A ™ (7.69) 
b dy On/éx _B+NB*-4AC _, (7.10) 
dx 0n/ oy 2A 
Integration of these two ODEs for y gives two equations 
y=-2x4+C, (7.71) 
y=2x+C, (7.72) 


In view of (7.22), we find the two characteristics as 
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€=yt+2x=C, (7.73) 
n=y-2x=C, (7.74) 
Thus, we have 
Oy oy Oy m., 075 
ox ov ox ov 
Substitution of (7.75) into (7.13) to (7.18) yields 
B=-16, D=2, E=-2, A=C=F=0 (7.76) 


Finally, the canonical form becomes 
Ou __1,0u_ Ou 
0é0n 8 0n 06 


) (7.77) 


7.2.4 Parabolic PDE 


For the parabolic type, we have only one characteristics. Let us set 


A=0, C#0 (7.78) 
Actually, for this case we must have 
B=0 (7.79) 
To see this, we consider 
A= A( oy apo 2 «(aE +E) ac (7.80) 
Ox ay" 
In obtaining the last of (7.80), we have ie the following identity 
B=2JAC (7.81) 
which is a natural consequence of the parabolic condition of 
B’ -4AC =0 (7.82) 


Note that (7.82) resulted from (7.29) and (7.30) if there is only one root for A. Now 
let us consider 


B= 7408 OM , p26 ON , én 08) 4 08 an 
ox Ox dx Gy ax Oy Soy By 


-2( aE + fo%| Vat ve) 0 


This first bracket is zero from (7.80), and thus the second bracket must not be zero. 
Thus, we see 


2 

c= Ae 2) 24 pom ON, (Oty =| fg, E21) 40 7.84) 
ox Oy oy Ox oy 

This is exactly the second bracket in (7.83) and therefore is not zero. This, of 

course, agrees with our expectation _in (7.78) because there is only one 

characteristics. Recall from (7.80) that 4 = 0 is equivalent to 


(7.83) 
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4-2 ag/ax JC  B _B (7.85) 
dx Od&lay JA 2AJVA 2A 
Thus, for constant coefficients the characteristics is 
ee ae ee) (7.86) 


2A 
To find the canonical form, we need to have the other variable n. There is not 
much discussion on this in the literature. In fact, we can choose any n, which is not 
parallel to €. Because of this, the canonical form for the parabolic case is not 


unique. 
Anyhow, the canonical form of the parabolic type is given symbolically as 
2 
a std LaLa ver (7.87) 
on 0g =—On 
where 
D E F G 
das eG =S; == == 7.88 
1G 1G fi G §1 G ( ) 
If A, B, and C are constants and thus (7.86) is valid, we have 
B 
G=Y-~- XQ, WH=xX=C, (7.89) 


2A 
The second of (7.89) is chosen arbitrarily (any line not parallel to € = C)) as long as 
the Jacobian of the mapping is not zero. For this case, we have J = —1. For this 
particular choice, we have 
2AE—DB D F G 

d= we > beer, ‘eer 8-7 (7.90) 
The most common parabolic type of second order PDE is a heat or diffusion 
equation: 


2 
OP ay of (7.91) 
ot ox 
Again, the constant a can be easily absorbed into x to yield 7 (compare (7.62)). We 
will further simplify the canonical form in Section 7.3. 


7.2.5 Elliptic PDE 


For the elliptic case, we have 

B’ —4AC < 0: elliptic (7.92) 
This implies that the characteristics are complex (i.e., no real solution). Let us write 
the characteristics as 


6, =é+in (7.93) 
$2 =5 in (7.94) 
Using (7.93) as the characteristics, we have 
MoS tED 2 4 pole +i OS +177) +c(CEtiMy _0 (7.95) 
ox ox Oy Oy 
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Note that 
Ag ti).a 05.2 4:05 ON Ona 
( re fat =) te ro = (7.96) 
OF +in).2 05.2 4.05 On a 2 
4.3% 7.97 
( ay at By 3 ay oy ( ay (7.97) 
OG +177) OS +i) _ OF OF | ON OF , ON OG) _ ON ON (7.98) 
ax dy dx Oy ae Oy Oy Gx’ oe Gy 
Substitution of (7.96) to (7.98) into (7.95) gives 
(A—C)+iB =0 (7.99) 
This implies 
A=C#0, B=0 (7.100) 


Similarly, we can also use (7.94) into (7.95). It is straightforward to show that this 
also leads to (7.99) (see Problem 7.1). 
The canonical form becomes 


2, 2 
cae cay Lae Lae ee (7.101) 
aé* On* 0§ On 
where 
D _E F G 
d==, @=2, faa, g=% 7.102 
re F aay ( ) 


We will now work out the details of (7.12) to (7.18) for the case of complex 
characteristics to find the explicit forms for (7.102). In particular, we first let the 
characteristics given in (7.93) be 


6 =&+in (7.103) 
Then, substitution of (7.103) into the first equation of (7.100) gives 
AEE — hz) + BE Sy ~My) + CE; —17,) = 0 (7.104) 
Substitution of (7.103) into the second equation of (7.100) gives 
2AEN, + BUEN, +.) +2CE,7, =0 (7.105) 
Differentiation of (7.103) gives 
£,=6, 00. €,=6, 70), (7.106) 
By employing the results given in (7.106), we find 
or =o, +2161, 1, (7.107) 
oF =& +26, — 0; (7.108) 


CySx =(6y ti, MC, +i7,) = 6,5, — MyM, +i7,6, +6,7,) (7.109) 
Adding (7.104) and i times (7.105) and using (7.107) to (7.109), we obtain 
AG? + BEG, + CC; =0 (7.110) 


which is of course equal to (7.95). More importantly, the solution of (7.110) is 
simply the solution of a quadratic equation, and is given as 


424 Theory of Differential Equations in Engineering and Mechanics 


6. &,+in, _-B-iN4AC-B? (7.111) 
6, &+in, 2A , 
Rearranging (7.111) gives 


—~B-iV4AC—B? 


g, tin, = (PACS 56, +n,) 
(7.112) 
= =B Ty 2 ° -B ce 4 
{Za vuae B ere Tid #| 


Comparing the real and imaginary parts of (7.112) results in 


—B ty J 2 =8 Sy | 2 
= ,+ 4AC-B >, n= 4AC-B° - (7.113 
Sx Fe ) 2A | 4 Fe 4 2A ( ) 
The second equation of (7.113) can be used to solve for &, and its result can be 
substituted into the first equation of (7.113) to give 
2An,, + Bn, 2Cn, + Bn, 


éy= » Sx 
*  Ja4c—B? V4AC —B? 


These equations are called Beltrami equations. Clearly, we can eliminate € from 
(7.113) to get a second order PDE for n. However, the resulting equation is not 
easy to solve. If A, B, C, D, E, and F are constants, we can actually assume a linear 
dependence of € and 7 in terms of x and y (Kevorkian, 1990) 

6=ax+PBy, n=yxt+dy (7.115) 
Comparison of the mathematical form of (7.64) and (7.86) with (7.115) shows that 
these characteristics are all straight lines for the case of constant coefficients. Using 
(7.115), we find 


(7.114) 


é, = B =—(2An, + Bn,) =-2Ay - BS (7.116) 
Comparing (7.116) and the first of (7.114), we obtain 
A= ss B= 2 (7.117) 


V44C -B J44C — B? 


Alternatively, we can also find 
€, =a =2An, +Bn, = By +246 (7.118) 


For simplicity, we choose 


0=0, vy=l (7.119) 
Thus, we have 
€=Bx-2Ay, n=x (7.120) 
With this particular change of variables, we have 
£=8, 6, =A, gosh. 9=0 (7.121) 
Consequently, we have 
A= AE? + BEE, + CE, = B°A-2BBA+44°C=A (7.122) 


C= Any +By,n,+Cyy, =A (7.123) 


Higher Order PDEs 425 


D = AE,, + BE, +CE, + DE, + EE, = DB-2AE = DB=EEE 4 aah 
7 ; 4AC-B? 
E = Any, + Bt + Cty, + Dr, + En, =D (7.125) 
F=F (7.126) 
Finally, we obtain 
DB-2AE D F 
Uge + Up, + uz +—u, +—u=0 (7.127) 
AV4AC-B? © A” A 
Therefore, comparison of (7.127) with (7.101) gives 
DB-2AE _ D F 
d= , e=—, f=-, gu (7.128) 
AN4AC - B? A A A 


A number of special cases of (7.101) will be considered. If the last four terms on 
the right hand side of (7.101) are zero, we have the Laplace equation. If d, e, and f 
are zeros and g is a function of € and 77, we arrive at Poisson’s equation. If only fis 
nonzero, we have the Helmholtz equation. Thus, the Laplace, Poisson, and 
Helmholtz equations are all elliptic type. Elliptic PDE are mainly for equilibrium 
type problems. 


7.3. CANONICAL FORMS OF SECOND ORDER PDE 

The canonical forms given in (7.51), (7.86), and (7.101) are the normal canonical 
form given in the literature. In this section, we will show that they can further be 
simplified. The following further reduction of canonical forms is adopted from the 
idea mentioned in a footnote on p.105 of Hadamard (1923). 


7.3.1 Hyperbolic PDE 


Recall the canonical form given in (7.59): 


Ou 
aa ae +d, a +e Ay + fiu=g) (7.129) 


Now introduce a change of variables: 


u(s,t) =e™*Pry(s,t) (7.130) 
Differentiation of (7.130) gives 
u, =e" (ay +y,) (7.131) 
u, =e"'(By +) (7.132) 
u,, =e Py, +2ay,+a°y) (7.133) 
Uy =e (yr, + 2By, + BY) (7.134) 


Substitution of (7.131) to (7.134) into (7.129) leads to 
Vo —Wy + Qa+d,)y, +(-2B +e)y, +(e? —B? +d,at+efB+ fy = ger 
(7.135) 
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Since a and B are arbitrary constants at our deposal, we can remove the two first 
order derivative terms by setting 


a=-d,/2, B=e,/2 (7.136) 
Thus, the governing equation for y becomes 
2 42 
e—d ae 
Vos Yu t — + fly = ge 08 (7.137) 


The idea of assuming the mathematical form of (7.130) to remove the first order 
derivative terms has been mentioned on page 105 of Hadamard (1923). Therefore, 
all hyperbolic second order PDEs with constant coefficients can be transformed to 
a nonhomogeneous Klein-Gordon equation, which is first derived by Klein in 1927 
and Gordon in 1926 for the relativistic motion of charged particles in the 
electromagnetic field. This major and important conclusion has not been mentioned 
explicitly in any textbook on differential equations. 


7.3.2 Parabolic PDE 


Let us start with (7.87) that 


2 
cody tee f= 8 (7.138) 
7 7] 

Consider a change of variables as 

u(n,€) =e "y(n, €) (7.139) 
Differentiation of (7.139) gives 

u, =e" Pe —aye %7* Ps (7.140) 
ug =e Py, + Bye Oth (7.141) 
Unn = gry. — aye + g?ye er (7.142) 


Substitution of (7.140) to (7.142) into (7.138) gives 
Won + (20+ €, Wr +d Wz + (a? + Bd, 1,4 frw=ge"* (7.143) 


Since @ and § are arbitrary constants at our deposal, we can remove the second and 
fourth terms on the left of (7.143): 


1 ee -4f 
a=-e, B=— 7.144 
x Ad, ( ) 


Thus, we can use the following transformation 
a, (4-4) 


u(n,éy=e > “4 w(n,é) (7.145) 


such that the we have converted the mathematical problem into the solution of the 
following differential equation 


2 
— +d, ea = gem Bi (7.146) 
1 
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This is a nonhomogeneous diffusion equation. Therefore, we can see that all 
parabolic type PDEs with constant coefficients can be converted to a diffusion 
equation with a nonhomogeneous term. 


7.3.3 Elliptic PDE 
In Section 7.2.5, the canonical form for the elliptic type PDE is obtained as: 
Ou Oru Ou _ Ou 
; as +e—+ fu+g=0 (7.147) 
0&? On 0g = =On 
Adopting the same transformation used in the last two sections, we have 
u(7,6) =e OTP) y(n, 6) (7.148) 
Differentiation of (7.148) gives 
= E 1 
Sel ig 5 Bv) (7.149) 
“tae 1 
1, Se its iad (7.150) 
1 
i= eo (ant BEV 2 (Vion -ay, + gow) (7.151) 
(a £ 1 
Ugg = € OTP ce — BY + BW) (7.152) 


Substitution of (7.148) to (7.152) into (7.147) gives 
Wee +Won +yw.(d-B)+y,(e€-@) 


1 1 - _ (7.153) 
HZ (a? + B°)— (Bd + ae) + fly = ger” 
We can remove the first derivative terms by setting 
a=e, B=d (7.154) 
Consequently, we obtain 
2,52 
Veg +V mn * [, 2) = gelor* Pe? (7.155) 


This is a nonhomogeneous Helmholtz equation. 

We conclude here that, for linear second order PDEs of two independent 
variables with constant coefficients, hyperbolic, parabolic and elliptic type PDEs 
can be converted to solving the canonical forms of the nonhomogeneous Klein- 
Gordon equation, nonhomogeneous diffusion equation, and nonhomogeneous 
Helmholtz equation, respectively. To our best knowledge, this observation for 
canonical forms of hyperbolic, parabolic, and elliptic type PDEs has not been 
reported in any book on differential equations or mathematical physics. 
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7.4 SOLUTIONS OF CANONICAL FORMS OF SECOND ORDER PDE 


As we have seen, there are three types of second order PDE. They are hyperbolic, 
parabolic, and elliptic. In the last section, we have demonstrated that all second 
order PDEs with constant coefficients eventually can be converted into three types 
of differential equations, namely the Klein-Gordon equation, diffusion equation and 
Helmholtz equation, all with nonhomogeneous terms. Clearly, the importance of 
these PDEs has not been noticed previously. We will briefly show here that all of 
these PDEs can be solved by using a technique called separation of variables. More 
discussion on the separation of variables will be given in Chapter 9. 


7.4.1 Nonhomogeneous Klein-Gordon Equation 


The Klein-Gordon equation is a dispersive wave equation, which was proposed 
independently by Oskar Klein in 1927 and Walter Gordon 1926 for modelling 
motion of a spinless charged particle in the electromagnetic field. In mechanics, the 
Klein-Gordon equation appears naturally in modelling the wave motion of a 
vibrating rope resting on a Winkler type foundation. Traditionally, the Klein- 
Gordon equation without a nonhomogeneous term can be expressed as: 


1 
Wor = Vin tOW (7.156) 
c 
Let us assume the following separation of variables 
y = X(x)T(t) (7.157) 
Substitution of (7.157) into (7.156) leads to 
X= XP +e XT (7.158) 
c 
By dividing through by X7, this equation can be simplified to 
ree ee ee (7.159) 
x  ¢ T 


Since X is only a function of x whereas 7 is only a function of ¢, the only possibility 
is that the left hand side and the right hand side are both constant. Note that we 
have assumed a negative value for this constant in the last term of (7.159). 

The two ODEs resulting from (7.159) are 


X"+V?X =0 (7.160) 
T+c7?(a@? +A4*)T =0 (7.161) 
The solutions of these equations are 
X = Asinax+Bcosax (7.162) 
T =Csin at + Dcos at (7.163) 


where 


o=cNae +7? (7.164) 


Thus, the general solution becomes 
y =(AsinAx+ BcosAx)(C sin wt + Dcos at) (7.165) 
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The value of A must be determined by the boundary condition. The following 
example illustrates the procedure in obtaining the eigenvalue 1. 


Example 7.2 Solve the following Klein-Gordon equation with appropriate 
boundary conditions: 


v=, tew, O<x<L, t>0 (7.166) 
oe 
y(0,t)=w(L,t)=0, t>0 (7.167) 
y(x,0)=0, y,(x,0)=v, O<x<L (7.168) 
Solution: The boundary condition can be expressed as 
y(0,t) = X(0)T(t) =0, t>0 (7.169) 
y(L,t)=X(L)T(t)=0, t>0 (7.170) 
whereas the first initial condition is 
y(x,0) = X(x)T(0)=0, O0<x<L (7.171) 
Thus, we must have 
X(0)=X(L)=0, t>0 (7.172) 
T(0)=0, O0<x<L (7.173) 


Note that the boundary conditions are homogeneous and thus correspond to an 
eigenvalue problem of differential equations for X. Substitution of (7.162) into the 
first part of (7.172) leads to 


X(0)=B=0, t>0 (7.174) 
Substitution of (7.162) into the second part of (7.172) leads to 
X(L)= AsinAL=0, t>0 (7.175) 


Since A cannot be zero (otherwise the solution is identically zero), we must have 
the eigenvalue equation as 

AL=na, n=1,2,3.... (7.176) 
Although there are infinite eigenvalues, the boundary condition cannot be satisfied 
by arbitrary values of the separation constant. This is precisely the property of the 
eigenvalue problem. Thus, the eigenvalues and their eigenfunctions are 


A, ==, n=1,2,3,... (7.177) 
L 
X, (x) = Asin A, x = Asin (7.178) 


Note that if we choose a positive separation constant in (7.159), the fundamental 

solution for X is sinh and cosh. Consequently, the second part of (1.172) cannot be 

satisfied. Thus, hyperbolic sine and cosine cannot be used as the basis of 

eigenfunction expansion. This is the reason why we cannot take positive value in 

(7.159) for the separation constant 17. 

For the initial condition, substitution of (7.163) into the first of (7.173) gives 

T(0)=D=0 (7.179) 

The fundamental solution becomes 
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y(x,t) = ys sin J, xsing,t (7.180) 


where 


o, =cfo? +A (7.181) 


Differentiation of (7.180) gives 


y, (x,t) = Ye Q, Sin A,,x cosa, t (7.182) 


nn 
n=l 


Substitution of (7.182) into (7.168) leads to 


io.0) 


Vi(x,0) =v) = )¢,@, sin A,x (7.183) 
n=l 
Multiplying both sides ane a sine function and integrating from 0 to ZL, we have 
max . nax 
of sin dx = Z o,[ sin sin =a ae (7.184) 


Note the following results, we have 


L 
| i a (7.185) 
0 L mm 
L =): # 

} bi i ee aids (7.186) 

L L =L/2, m=n 

Thus, substitution of (7.185) and (7.186) into (7.184) gives 
2 
c, =——2 arg y+] (7.187) 
a) 


n 


Finally, we get the solution as 
(x,t) = reas ral 1)"*! 4.1]sin Z,xsing, t (7.188) 


where 


O.=e)a ee, ih, ==, n=1,2,3,.. (7.189) 


Once the solution for the homogeneous Klein-Gordon equation is obtained, the 
nonhomogeneous case can be solved either by the method of undetermined 
coefficients or by the method of variation of parameters discussed in earlier 
chapters. Alternatively, for the nonhomogeneous Klein-Gordon equation, we can 
also solve the problem by Green’s function method, which will be discussed in 
Chapter 8. Let us consider the following nonhomogeneous Klein-Gordon problem: 


Vor = Vy te y +f (x0), O<x<1,. 420 (7.190) 
Cc 


VO1)=20, WL)=2,0, t>0 (7.191) 
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VAx0)= fox), V0) =f), O<x<L (7.192) 
We will not go through the details of analysis, but the solution can be found using 
Green’s function method as (Section 4.1.3-4 of Polyanin, 2002) 


6 rl L 
VAD=S | WOGxg.0ds+] HOG Nd 


= f, if He aG et vedede (7.193) 
+f Ol (uEt-Mgaodr-e fg, Oy (4ét-De,dr 
where 
G@,£)= resins sin(A,é) ~— (7.194) 
A, = a (7.195) 


Green’s function is denoted as G(x,éf) for the Helmholtz equation in a finite 
domain. It was derived by Sommerfeld in 1912. For another domain and another 
boundary or initial conditions, the reader can refer to the handbook by Polyanin 
(2001). 


7.4.2 Nonhomogeneous Diffusion Equation 


Let consider the following nonhomogeneous diffusion equation 


u, =0°u,,+f(%t), OSxSL, t>0 (7.196) 

subject to the following initial and boundary conditions: 
u(0,t)=0, u(L,t)=0, t>0 (7.197) 
u(x,0)= (x), O<x<L (7.198) 


First, let us consider the eigenvalue problem of the homogeneous form of (7.196) 


(i.e., f = 0): 


Ou, =u, OSxSL, t>0 (7.199) 
u(0,t)=0, u(Z,t)=0, t>0 (7.200) 
u(x,0)=@(x), O<x<L (7.201) 
Assuming a separation of variables, we have: 
u= X(x)T(t) (7.202) 
Substitution of (7.202) into (7.199) leads to 
a? X"T = XT' (7.203) 
By dividing through by X7, this equation can be simplified as 
ACS Tg (7.204) 
Xx oT 


Since X is only a function of x, whereas T is only a function of ¢, the only 
possibility is that the left hand side and the right hand side are both constant. Note 


432 Theory of Differential Equations in Engineering and Mechanics 


again that we have assumed a negative value of this constant in the last term of 
(7.159). 

In the following discussion, we will only focus on the discussion of X. In 
particular, we have from (7.204) 


X"+A7°X =0 (7.205) 
The solutions of these equations are 
X = Asindx+Bcosax (7.206) 


The value of 7 must be determined by the boundary condition. Substitution of 
(7.206) into (7.200) leads to 


X(0)= X(L) =0 (7.207) 
Substitution of (7.206) into the first part of (7.207) gives 
B=-0 (7.208) 
The second part of (7.207) leads to 
sindL =0 (7.209) 
That is, we require 
AL=nn (7.210) 
Thus, there are infinite discrete eigenvalues given by 
A, =nall, n=1,2,3,.. (7.211) 
The eigenfunction that corresponds to this eigenvalue is 
X,, (x) =sin(4,x), N=1,2, 3.66 (7.212) 


Thus, we have found eigenvalues and its eigenfunction expansion. 
We now return to the nonhomogeneous problem given in (7.190) to (7.192). 
In particular, we assume the following eigenfunction expansions: 


u(x,t) = es (1)sin(A,x) (7.213) 

n=1 
ft= > f,Osin(A,x) (7.214) 

n=l 
d(x) = Dc, sin(Z,x) (7.215) 

n=l 

where T,, is to be determined and 
fu) = 2] fen)sin (na x/L) ds (7.216) 
2 ft 

é,= =|. (x) sin (na x/ L)dx (7.217) 


As we will discuss in Chapter 10, all functions can be expanded in an infinite series 
of the eigenfunctions (in this case sine function). More detailed discussions on 
eigenfunction expansion are given in Chapter 10. 
Substitution of (7.213) and (7.14) into (7.196) gives 
dT, 
paler +0727, - f,]sin(nx/L)=0 (7.218) 
n=l 
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In view of the governing equation for X given in (7.205), we can rewrite (7.218) as 


dT, 
—24+@7kAT, = f, (7.219) 
dt 
Substitution of (7.213) and (7.215) into (7.198) gives 
T, (0) =, (7.220) 
Multiplying (7.219) by the oe factor, we have 
“the a Aer t= fe a Aer (7.221) 
Integration of (7.221) gives 
arkAzt tO atkaes 
Tet =F 0)4 } fer Pa ds (7.222) 
0 


Using the initial condition (7.220), we obtain 
Oy) t ey, 
T, =c,e% ‘a +( fe7 "ds (7.223) 
0 


Substitution of (7.223) into (7.213) gives the final solution for the 
nonhomogeneous diffusion equation given in (7.196) with boundary conditions 
(7.197) and (7.198) as 


u(x,t) = ee = et Gao df f,e* kan (s- "as sin) (7.224) 


n=l 


where 4, is given in (7.210). 


7.4.3 Nonhomogeneous Helmholtz Equation 


The Helmholtz equation is also known as the reduced wave equation. To see why 
this is so, let us consider a two-dimensional wave equation as 
1 
Poet Py = ain (7.225) 


Physically, this PDE models the vibrations of a rectangular membrane with no 
bending. Let us now consider a harmonic wave type of solution in the form: 
Y =y(x, ye (7.226) 
where ais the circular frequency of the wave. Substitution of (7.226) into (7.225) 
gives 
a 
Woy + yy + == =0 (7.227) 
c 
This can be rewritten as the standard form of the Helmholtz equation: 
Vax tWyy thy =0 (7.228) 
This is the reason why it is referred to as the reduced wave equation. To solve 
(7.228), we can introduce the following separation of variables 
y= X(~)Y(y) (7.229) 
Substitution of (7.229) into (7.228) yields 
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x" ”" 
$47 325 (7.230) 
xX Y 
Similar to the argument used in the previous sections, we can rewrite (7.230) as 
x” 2 ¥ a re ee 
=A’, —=-1, A+ =k 7.231 
: pac, AP + As (7.231) 
Thus, we have two ODEs as: 
X"+A7X =0 (7.232) 
Y"+AY =0 (7.233) 
The general solutions of these equations are: 
X = AsindA,x+ Bcosa,x (7.234) 
Y = AsinA,y+ Boos Ay (7.235) 


Clearly, this set of solutions resembles vibrations of a membrane with a fixed 
boundary on the circumference of a rectangular membrane. However, if there is no 
physical meaning attached to a PDE, we may set the related differential equations 
as 


x" ¢ Fg 2_ 92 _ 22 
=a) ee. —-AZ =k 7.236 
pach aay Ak (7.236) 
In this case, we may have the solutions as 
X = AsindAx+ BoosaAx (7.237) 
Y = AsinhdA,y+ Bcosha,y (7.238) 


This solution is valid provided that the third equation of (7.236) is satisfied. 
However, (7.238) cannot model vibration-type solutions for a membrane. Whether 
the solution set given in (7.235) or (7.238) is valid, it depends on the condition to 
be imposed on the boundary. 


Example 7.3 Consider the waveguide problem of a rectangular hollow tube of a 
conductor. The problem can be mathematically prescribed as a three-dimensional 
Helmholtz equation with appropriate boundary conditions: 


Woe Woy + Wag +k? = 0, O0<x<a, 0<y<b (7.239) 
y(0,y)=0, w(a,y)=0 (7.240) 
y(x,0)=0, w(x,b)=0 (7.241) 


with k being positive. The boundary value problem is illustrated in Figure 7.4. The 
wave modes are normally referred as TM (transverse magnetic) or TE (transverse 
electric). 


Solution: Using separation of variables, we can assume 


V(X, ¥.2) = X(X)Y(y)Z(z) (7.242) 
Substitution of (7.242) into (7.239) leads to 
on ) a, AY: (7.243) 
x Y Z 


Thus, the general solution for X is 
X = Asind,x+ Bcosa,x 


Figure 7.4 Problem of rectangular waveguide 


The first condition of (7.240) leads to 
B=0, Aaz=nz 
Thus, the eigenvalue becomes 


Similarly, we can find the eigenvalue for A2 as: 
mn 


ay 
The solution for yw becomes 


y/(x, y,2) = sin“) sin( a D678 
a 


and 
nw 


( 
a 
As = Put = ke — Kenn 


NIT MIT 
— [2 +(—/y 
a b 


el a ld 


We can rewrite (7.249) as 


where 
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(7.244) 


(7.245) 


(7.246) 


(7.247) 


(7.248) 


(7.249) 


(7.250) 


(7.251) 


This is called the cut-off frequency of the waveguide and it is clearly discrete. 
Physically, for the case of electromagnetic waves, the value of k is given by: 


ke? =o ple 


(7.252) 


where 1 and ¢ are called the permeability and permittivity of the free space, and 
is the circular frequency of the electromagnetic waves. For small values of wave 
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frequency, it clear from (7.250) that Ann is purely imaginary. For this small 
frequency, (7.248) shows that the solution decays exponentially with z. That is, 
waves with low frequency attenuate quickly along the waveguides and therefore are 
cutoff. The phenomenon is caused by the fact that the rectangular tube is of finite 
size and only waves of certain discrete frequencies can propagate along it. More 
discussion of electromagnetic waves is presented in Volume 2 of this book series. 


Let us now consider the following nonhomogeneous Helmholtz equation with 
prescribed boundary condition: 


Wx +Wyy, +Ay = =f1% y) (7.253) 
YO.) =A), Way) =f (7.254) 
W(x,0) = x(x). W(x,5) = fax) (7.255) 


This problem can be solved by using Green’s function method, which will be 
considered in Chapter 8. However, we will simply quote the following solution 
from Section 7.3.2-15 of Polyanin and Zaitsev (2002): 


aepb 
veav=f | fGmoC, y.g.mdndé 
oe ag 
6g og 
a 0G 7 0G 
+] BOIS Coy. Ma0d SY MES O28 Mpaods 


b b 
+] AMIS r.EMeodn-[ HOMO E Mead (7.256) 


where 


Ghee 4 >» sin(p,,x) sin(q,, y) sin(p,,€) sin(q,,77) (7.257) 


n=l m=1 Pr + In —A 
2 nN 2 MIT 
ee ed 7.258 
Pu => Im == (7.258) 


For other domain or boundary conditions, we refer to Polyanin and Zaitsev (2002). 
We now consider the Helmholtz equation in polar form. In cylindrical 
coordinates, the Helmholtz equation can be expressed as 
2 2 2 
Ori POU OOM Oe oi (7.259) 
or ror roo a 
Using the separation of variables, we have 
u = R(r)O(@)Z(z) (7.260) 
Substitution of (7.260) into (7.259) gives 
2 2 2 
1 @R aR, L1d@ 1dZ 1 _y (7.261) 
Rede’ rdr Pr Ode? Z ad’ 
This leads to the following ODEs as: 
2 
ta ong (7.262) 
O de” 


Higher Order PDEs 437 


wz .¢ a6 
<9 =a? -k 7.263 
7 ae (7.263) 
2, 2 
Ee Ae TE P26 (7.264) 


R dr’ r dr r 
Note that (7.264) is a Bessel equation. The solution of ©, Z, and R are 


O = Acosm@+ Bsinmdé (7.265) 
Z = Ccosh[Va? —k? z]+ Dsinh[Va? —k?z] (7.266) 
= EJ, (ar)+ FY,,(ar) (7.267) 
In view of the physical requirement of 8, we must have the periodicity of © 
O(6 + 27) = O(0) (7.268) 


Consequently, m must be an integer. 
For the special case that a = 0, we have u being independent of z, and (7.259) 
becomes 


2 2 
OD Oe 6 (7.269) 
ar? rér r? ae 
Introduction of a separation of variables gives 
u = H(r)T(@) (7.270) 
Substitution of (7.270) into (7.269) gives 
1.@H idt, 11d'T 4, 


—+- = +k? =0 (7.271) 
Hd? +d Tr do 


This leads to 
2 
IO ei = (7.272) 
T do 
2 2 
NEE ee eG (7.273) 
dr? sr dr r? 
Again, we have the solution of T being 
T = Acosm@+ Bsinmdé (7.274) 


where m is an integer. Equation (7.273) is of Euler type and using the standard 
change of variable of Euler ge ee 


J, (kr) + DY,, (kr) (7.275) 
In spherical none the rast equation can se expressed as 
1 0 1 O..  u 1 5 
ru) + sin0 + +ku=0 7.276 
sae Pane 60° 8G” SnD Bp a = 
Again using the separation of variables, we have 
u = R(r)P(@)®(g) (7.277) 
Substitution of (7.277) into (7.276) gives 
11 f 1 1d. t 1d’. 25 
oe ee ae +k* =0(7.278 
Ree = 'pmoaett oD sin? ® dg? vee 


The governing equations become 
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2 
BOF sine (7.279) 
@ do 
2 
- d ene“ 540 sr =0 (7.280) 
sin@0 dé dé sin* 0 
(Me ye dt 
rR)|+k =0 7.281 
ae gi th P -S (7.281) 
where J is a separation constant. The solution for ® is 
® = Acosm@g+ Bsinme (7.282) 
The solution of P can be recognized by using the following change of variables: 
x =cos0 (7.283) 
Using this change of variables, (7.280) can be transformed to 
d 2 dP m 
l-x + (A P=0 7.284 
ra ee ( 2? ( ) 
This can be shown as equivalent to 
2 2 
(=x?) 9x s4y)-“ 1 =0 (7.285) 
dx dx lx 
where we have set 
A=I(1+1) (7.286) 
Equation (7.285) is the associated Legendre equation and the solution is 
P(@) = CP" (cos 0) + DQ)" (cos @) (7.287) 
Finally, to find the solution of R in (7.281) we first rewrite it as 
2 
a a = +(1 a \R=0 (7.288) 
dp pdp p 
where 
p=kr (7.289) 
We can introduce another change of variables as 
pols (7.290) 
Je 
Differentiation of (7.290) gives 
dR 1 Sy 1 ds (7.291) 
dp 2%ppe Jpdp 
2 2 
1 
‘ ‘ == _ _ é “ c (7.292) 
dp 4p? p?’dp Jpdp 
Substitution of these results into (7.288) gives 
2 
Se ie (7.293) 
dp’ pdp p 


This is precisely the Bessel equation and consequently the solution for R is 
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1 


Substitution of (7.282), (7.287) and (7.294) into (7.277) gives the final solution in 
spherical polar coordinates. More discussion of this spherical Helmholtz equation 
will be given in Chapter 9 (Section 9.4). In particular, a new function called the 
spherical Bessel function can be defined to simplify the solution form given in 
(7.294). 


7.5 ADJOINT OF SECOND ORDER PDE 


In an earlier chapter, we have discussed the adjoint differential equation of an 
ODE, and demonstrated that the adjoint equation is actually the governing equation 
for the integrating factor of the original ODE. In this section, we will extend this 
idea of the adjoint equation to second order PDEs. The result discussed in this 
section was first derived by De Bois-Reymond in 1889 and was also derived by 
Darboux in 1915. Recall from our earlier discussion that solving the adjoint 
problem of an ODE may be as difficult as solving the original ODE. Nevertheless, 
it is an important concept in solving higher order partial differential equations, but 
this is normally not covered in most textbooks on differential equations. We will, 
however, see the importance of adjoint PDEs in Green’s function in Chapter 8. Let 
us consider a general form of linear second order PDE as 
2 2 2 
Lu) = Ae* + po gC ep oe ret (7.295) 
ox OxOy dy" Ox 
We now consider the product of another function v, which is the solution of the 
adjoint problem, with function L(u) defined in (7.295) as: 
Ou Ou Oru Ou Ou 
eras ad +Cy—,+Dv mt ae + Fuv (7.296) 
x 


OxOy oy Ox 
We first note the following identities: 
2 2 
Sie gg (7.297) 


ax? ax? ax Ox Ox 
O’u—s« (By) O. Gu, 8 OBvy. A. du, €. OBY 
u = By 


Oxdy OxOy ox ay? oy Ox )= oy Ox” Ox i oy ) 
(7.298) 
2 2 

en <nu e ) i ey Os (7.299) 

oy oy Oy Oy Oy 

Ou 0 é 

Dv a u Ay (—Dyv) = (Duv) (7.300) 
Be w( Ey) = < (Ew) (7.301) 


Using the identity in (7.301), we note that (7.298) can be written as: 


440 Theory of Differential Equations in Engineering and Mechanics 


2 2 
you ye (By) 1 a ay hn ay eo) 
OxOy Oxdy 2|0x oy Oy” oy Ox Ox 


(7.302) 
Substitution of (7.297) to (7.302) into (7.296), we get 
Fn aay (er (7.303) 
Ox Oy 
where 
2 2 2 
M)=24 +2 Bye Cv oODv OEV | ry (7.304) 
ox Ox0y oy" Ox Oy 
ad eee ae” (7.305) 
Ox ox 2 ov oy 
Gp. he ae ea (7.306) 
ov 2 ox ox 
The functions X and Y given in (7.305) and (7.306) can further be rewritten as 
Fe La a | PLA ge cai Ln OM ¢ 
ox Ox 2 Oy oy ox 2 oy 
epg ce a eee Ss 
2 Ox Ox oy Oy oy 2x 
Equation (7.303) can be recast as 
vL(u) —uM (v) was (7.309) 
Ox Oy 
This identity is sometimes referred to as the Lagrange identity. The adjoint of L(w) 
is M(v). 
To find the condition of the self-adjoint PDE, we first expand M(v) as: 
2 2 2 
MGj=4 pe a, 
ax? Oxdy éy" Ox 
A 10B 1 
ye ec + g Dav eae ws E) (7.310) 
Ox Ox 2 Oy Oy Oy 20x 
il (C4 el pio? ol 
Ox Ox 2 Oy Oy Oy 20x 
Therefore, the self-adjoint condition for linear second order PDEs are: 
ca 5 (7.311) 
ox 2 oy 
OC LOB _ (7.312) 
Oy 2x 


Many physical based second order PDEs are self-adjoint. Actually, the self-adjoint 
system occupies an essential role in the development of physical sciences and 
mechanics. The identity given in (7.309) can be used to form a generalized form of 
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Green’s theorem of any linear second order PDE. However, such consideration will 
be deferred to the Chapter 8 on Green’s function method. 

This formulation can be easily generalized to the case of m variables as 
(Hadamard, 1923): 


ml 


L(u) = Lae eu 5. TCU (7.313) 
i,k=1 
with 
An = Agi (7.314) 
The associated adjoint PDE becomes 
m o 
M(v)= A, By)+Cu= 7.315 
=> ga 4rd 5, (By) (7.315) 
The corresponding Lagrange identity is 
IX, Ox. WX 
eee ec be Os (7.316) 
Ox, = OX, OX, 
where 
X,(u,v) = 4 a aa male 5 OA (7.317) 
ik ik XK, 


To see the validity of i Sato we can set m = 2 to recover equations 
(7.303) to (7.308) as a special case. Readers are advised to check this by 
themselves. 


7.6 SELF-ADJOINT CONDITION FOR SYSTEM OF SECOND ORDER 
PDE 


In this section, we are going to extend the idea of finding adjoint PDEs to the case 

of a system of coupled second order PDEs. To the best of our knowledge, this topic 

has not been covered in any textbooks on differential equations. Let us consider a 

system of PDEs as: 

Ly) + Ly(v) =0 (7.318) 

L4,(u)+ Ly (v) =0 (7.319) 

where the differential operator L can be defined as analogous to the second order 
PDE considered in the last section as: 

2 2 2 
L,(u)= 4,2 +28, nT og + Dy MLR u (7320) 
Ox Oxy oy Ox oy 


with i, 7 = 1,2. ous the result = the last section, we have the adjoint of (7.320) as 


a Oy eo tena 
M,(v)= VB) +3 Cy) + OD) +5 OE, + Fy 
(7.321) 
Multiplying (7.318) by uw and multiplying (7.319) by v, and adding the results of 
these, we obtain 
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uL, ;(u) +uL,7(v) + VL5,(u) + VLo5(v) = 0 (7.322) 
To consider the left hand side of (7.322), we first note the following identities, 
which can be obtained as special cases of (7.309) in the last section: 
OX) (u,u) \ oY, (u,u) 


uL, ,(u) =uM 7.323 
Ly (u) =uM, | (u) ce iy ( ) 
diheiasy — (7.324) 
6. oy 
PTs Pa ce le (7.325) 
ox ov 
i= oe (7.326) 
Ox oy 
where 
XG@w=Dy Bical aSaiy 2 (7.327) 
ox oy 
B 
¥ (wu) =(E,, a - Sy? (7.328) 
oy Ox 
Ov Ou Ov Ou 0A, OB 
X, (u,v) = A(u ve Balt. vay + (Di om 2 uv (7.329) 
B 
Y(t) = Bry (uv) 4 Cu v4 (E-Bay (7.330) 
a ax ay oy ay x 


Gu). (p  Sdar_ Bn 
oy oy Ox Oy 


B 
rw 6 a ee ey OC _ Bry, (7.332) 
Ox Ox Oy Oy oy Ox 


aia to a ee uv (7.331) 
Ox Ox 


0A, _ OBy vw? 


X,(v,v) =(D,, -— 7.333 
4(V,v) =(Dyp oe Oy ( ) 
ae OB 
¥,(v,v) = (Ep) - 22-2? 7.334 
4(V,V) = (E59 By ox v ( ) 
Adding (7.323) to (7.326) gives 
uL, ,(u) —uM, ,(u) + ul,(v)—vM,,(u) + vLy,(u) —uM,,(y) 
+L) (Vv) — VM 99 (v) = OX\(uju) , OF (uu) | OX,(u,v) | OF (u,v) (7.335) 
ox oy Ox oy 
; OX, (u,v) a OY3(u,v) re OX 4(v, Vv) i" OY, (v,Vv) 
ox oy Ox oy 


Equation (7.335) can be rewritten as 
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ul L,,(u)-—M,,(u) + Ly.(v)-M,(¥)] 


[Ly ) —M (u) + LY) — Mag (0)] = + 2 aii 
x oy 
where 
P=X,(u,u)+ X>(u,v)+ X3(u,v) + X4(v,V) (7.337) 
O=Y,(u,u)+ Y, (u,v) + Y3(u,v) + Y,(v,v) (7.338) 


Equation (7.336) can be considered as a generalization of the Lagrange identity for 
the case of a coupled system of PDEs. The system of PDEs is called self-adjoint if 


cl le (7.339) 


Therefore, if the left hand side of (7.336) is zero, the system of PDEs is self- 
adjoint. Thus, we will examine the condition that the left hand side of (7.336) is 
Zero. 

Similar to the procedure in arriving at the adjoint operator given in (7.310), 
we can easily verify that 


M0) = 1,0) +0—-(—— + =D, 0 Ey) 
ow “So (7.340) 
0A, OB; oC, OB, 
pe 7 y 
Ox Ox oy oy oy ox 
Substitution of (7.340) into the left hand side of (7.336) yields 
LHS =@ = 2u Ou (i 7 OB, Dy)—2u Ou rae 4,2 E) 
Ox Ox Oy Oy Ox oy 
afi OA, , Bu pia! OB, , a1 -E,) 
Ox Ox Oy Oy Ox Oy 
ee a eer av eda ‘. OB, Dy) —2u Ov (eae " 0C>, Ey) 
Ox Ox oy Oy Ox oy 
A 1B 1B 
wu ca he 21 _D,,)—vu Q ‘a 21 Fogo! Ey) 
x Ox Oy Oy Ox Oy 
Ou 0A, OB Ou OB, OC, 
+vL,,(u) — vl, (u) —2v A (242 D,)-2v—( ree 2 Ei») 
x Ox oy Oy Ox oy 
1A, 1B 1B 
vu au 5 12 _ D)—vu eG e 12 oe Ey) 
Ox Ox Oy Oy Ox oy 
A B. 
Ox Ox oy Oy Ox oy 
2 7 0Ap5 " OB Dy) v2 O joes + 0Cy7 Be} 
x Ox Ov Oy Ox ov 
(7.341) 


We will now consider the conditions that (7.341) becomes zero. First, we see that if 
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Aj, =A, By =By, Cy =Cy (7.342) 
we have 


ov ov 
uly7(v) —uL (Vv) =u Ae (D,y — Dy) +u ay OP E4,)+uv(Fyy —F,) (7.343) 


VLy)(u) — vl, (u) = vO, Dy) ve Ei Ey,)-uwv(Fin — Fy) (7.344) 


In addition, we find that many terms in (7.341) vanish if 


Oni oni =D an oii =F (7.345) 
a Oy ati 

Ady OB By | Ly _ (7.346) 
ax oy 2° ex oy “ 


Substitution of (7.342) to (7.346) into (7.341) gives 
ov Ov Ou Ou 
@ =u—(D,2 — Dy) tu— (En — Ey) — V— (Dyn — Da) —V — (En — En) 
Ox oy ox oy 


Qu Ov OAy, is OB, Dy) Qu Ov (Coa as OC, E>) 
Ox Ox oy oy Ox oy 
ay Ou 0A, Fs OB,2 Di) ay Ou (palo 4 OC}> Ey) 
Ox Ox oy oy Ox oy 
(7.347) 
Finally, we observe that ® in (7.347) is identically zero if 
OB, OC 
1 41, 21) = B+ By (7.348) 
0A, OB 
én, To Di + Dy (7.349) 
The conditions of (7.318) and (7.319) — self-adjoint can be summarized as 
A; =A, By =Byu, Cy =Ci (7.350) 
(A aa (Dy +D,) (7.351) 
— eS (Dit Ds : 
ox oy oo 
(Si lee j= a8, (7.352) 
ene Ber =—(E..+E., : 
ox Oy rs 


Note that self-adjoint conditions which are similar to (7.350) to (7.352) were 
obtained by Kuzmin (1963) when he considered the variational principle for 
electrodynamics fields. However, there is apparently a mistake in Kuzmin (1963), 
and there is no derivation given in Kuzmin (1963). Nevertheless, such analysis is 
related to Green’s identity and the formulation of functionals. 
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7.7 MIXED TYPE PDE 


In some situations, the behavior of the solution of a PDE may depend on the 
dependent variables of the problem. The most well-known mixed type PDE is 
probably the Tricomi equation. 


2 2 
ae, (7.353) 
Ox oy 
According to the procedure of classification given in Section 7.2.2, we have 
B? -4AC =4x (7.354) 


Therefore, we have (7.353) being hyperbolic with x > 0, parabolic for x = 0, and 
elliptic for x < 0. This is called a mixed type of second order PDE. The origin of 
this equation comes from transonic flow in 2-D gas dynamics. This equation 
occupies an important place in the development of supersonic flows from transonic 
flow in the area of rocket science in the beginning of the twentieth century. We will 
sketch the origin of this equation briefly in the following section. We will first 
review two-dimensional steady gas flow in the next section. 


7.7.1 Two-Dimensional Steady Gas Flows 


For two-dimensional steady potential flows of a gas, the Euler equation of motion 
for a compressible fluid is 
ov 
Ot 
where g is the gravitational constant, p is the pressure, and v is the velocity field. 
This is a special case of Navier-Stokes equation for fluid mechanics. For steady 
flow with negligible gravity effect, we have 


1 
+(v-V)v=-—Vp+g (7.355) 
p 


1 
(v-V)vy =-—Vp (7.356) 
fe) 
To express (7.356) in incremental form, we have 
vdv = s@ (7.357) 
fe) 
On the other hand, sound wave speed in gas is defined as: 
e= [2 (7.358) 
p 
Thus, density can be related to pressure as 
pc? =p (7.359) 
We note that (7.359) can be expressed in incremental form as: 
dp=cdp (7.360) 
Rearranging (7.357) gives 
wu =—pVv (7.361) 
dv 


Substitution of (7.360) into (7.361) results in 
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dp Vv 
—=-9—> 7.362 
ee (7.362) 
The total differential of the mass flux density along a streamline is 
d(pv) = pdv+vdp (7.363) 
It is equivalent to 
d(pv) dp v v 
=ptv—= = p(l 7.364 
dv , dv “a road A a : 


Note that for supersonic flow, we have the flow speed faster than the sound speed 

c, we have d(pv)/dv < 0 or mass flux density as a decreasing function of velocity; 

whereas, for the subsonic flow the mass flux density is an increasing function of v. 
From continuity, we have 


O(pv,) + Apvy) a 


0 (7.365) 
Ox Ox 
By introducing a velocity potential ¢, we have 
dg= La dy =v,dx +v,dy (7.366) 
Ox oy 


7.7.2 Hodograph Transformation 


The formulation expressed in velocity is not easy to solve, Molenbroek in 1890 and 
Chaplygin in 1902 applied the Legendre transform to rewrite the unknowns of the 
gas flow equation (7.366) as x and y, and the variables as the velocity components. 
More discussion of the application of the Legendre transform can be found in 
Appendix C of Chau (2013). The velocity plane is known as a hodograph plane 
versus the physical plane of x-y. This approach is known as the hodograph method 
which has been discussed in Section 6.15 for the context of first order PDEs. In 
particular, we observe that (7.366) can be written as 


dg = d(xv,)— xdv, + d(yv,)— ydv, (7.367) 
Thus, we can introduce a new function ® such that 
D=-$+xv,+y, (7.368) 
The total differential of (7.368) gives 
d® =—d¢+ xdv, + ydv, +v,dx+v,dy (7.369) 
Substitution of (7.366) into (7.369) gives 
d®@ = xdv,, + ydv,, (7.370) 


Therefore, we now have the unknown function being given in terms of the velocity 
as 


D = D(v,,V,) (7.371) 
The total differential of (7.371) is 
dD = ce dv,.+ ae dv, =xdv, + ydv, (7.372) 
Ov, ov, 


Consequently, we have 
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gue. (7373) 
Ov~,. ov, 
The variables of v, and v, can be rewritten in polar form as: 
v, =veosé, v, =vsind (7.374) 
Clearly, we have 
ve 
vev4v, tng=~ (7.375) 
: Vy 


With these new variables, applying the chain rule to (7.373) we have 


va OP _ OP ov oP 00 = cos 92? sin? 0@ (7.376) 
Ov, Ov Ov, 06 Ov, ov v 00 
_ OD _0® ov of 00 ~ singe + 099 0®@ (7.377) 
dv, Ov dv, 06 Ov, ov v 00 
By virtue of (7.376) and (7.377), the last two terms in (7.368) can be found as 
xv, + yv,, =(cosé af a Oe dass gn” + eos? OO cine 
: ov 00 Ov v 06 
(7.378) 
O@ 
= yp— 
ov 
Substitution of (7.378) into (7.368) gives 
6-645 02 (7.379) 
Ov 


7.7.3 Chaplygin’s Equation 


We now observe that the continuity equation can be expressed in terms of 
Jacobians as 


A(pv,,v) APY,.x)  A(pv,) 7 O(pv,) _ jj (7.380) 
Oxy) Ox, y) Ox oy 
Note that 
Opv, Opv, 
OPV») _| & ey |_ Avy) (7.381) 
O(x, y) oy oy Ox 
Ox oy 
Opv, Opv, 
alpry.®) [ee ay |__ (ry) wee 
O(x, y) ox ox Oy 
a& ay 


We can multiply (7.380) by 0(x,y)/ O(v,@) and in view of (7.374) to get 
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A(pv,,¥) A(x, y) OAPVy»*) (x,y) _ A(pveosO,y) O(pvsinO,x) _ 


0 (7.383) 
O(x,y) 00,8) Ax, y) AV, 8) O(v, 8) (v, 8) 
The first term on the right hand side of (7.383) can be evaluated as 
Opvcos@ <dpvcosé 
VES). ov 06 SO eg 4 apn (7.384) 
O(v, 8) oy oy ov 00 ov 
ov 00 
Differentiation of (7.377) gives 
2 2 
® _ ine? P_cosd0P | cos OP (7.385) 
év ww wv 00 v dbev 
2 P 2 
OF pp in! id ne oe coe 2 (7.386) 
00 ov Ov0od v 0@ v ee 
Similarly, the second term on the right of (7.383) can be evaluated as 
Opvsin@ dOpvsind 
ehovsin A) =| oe sO i ee (7.387) 
0(v, 6) Ox Ox ov 00 ov 
ov 00 
Differentiation of (7.376) gives 
2 : , 2 
OF aa? @D sndo® sndo@ (7.388) 
ov aw vy 06 v dG 
2 , 2 
Ox ana’ aecsa ® cosP0OP sind do@ (7.389) 
00 ov 0vod v 06 v 00" 
Finally, combining (7.384) to (7.389) with (7.383) leads to 
A(pv) dD 10°@ oD 
+ pV =0 7.390 
dv év ” a6? i av" 
Substitution of (7.364) into (7.390) yields Chaplygin’s equation: 
3 2 2 
o@D v Oo@D OP _9 (7.391) 


+ +Vv 

00? 1-v'/e? a Ov 

This is a linear PDE, and is a good approximation of transonic flow (1.e., 0.8 < 

Mach number < 1.0). For this range, the speed v is about 965 km/hour to about 

1236 km/hour. Thus, for steady state the nonlinear PDE of the Euler equation of 

motion and the continuity equation are now converted to a linear PDE through the 

use of the hodograph transformation. Once the solution of ® is solved in terms of v 

and @ (7.373) and (7.374) can be used to give the final solutions as: 

x=x(v,0,®), y=y(v,0,®) (7.392) 

Note that the validity of the Chaplygin equation given in (7.391) relies on the 
requirement of non-zero Jacobian of the following change of variables: 

u| (x,y) 1,00 d0, v OD 5 


0(v, 4) newer id 06 "jae pe or y] (7.393) 
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It is clear from (7.393) that the Jacobian can become zero if v > c (or in the 
supersonic region). Once the Jacobian is zero, continuous flow throughout the 
region is impossible and shock waves must occur in the supersonic region. This is 
the reason that the supersonic airplane Concord is always accompanied by an 
unpleasant shock wave. In addition, the sound speed is, in general, a function of 
velocity or c = c(v). 


7.7.4 Tricomi’s Equation 


We now look at an approximation of the Chaplygin equation given in (7.391) when 
the flow is transonic (i.e., v > c). In particular, the first derivative term is much 
smaller than the other two terms in (7.391): 


O@ y oD 


y— * ——___ (7.394) 
Ov 1-v/e ev’ 
Thus, we have the approximation for (7.391) as 
2 2 2 
Oo@D v OD _ (7.395) 


+ —_—_—_—_— 
00? 1-v/e* av 
At the transition from subsonic to supersonic, we passed through the transition or 


the so-called transonic flow. We have v ~c = c*, then 
2 2 2 2 
Vv 7 Cx - Cx = Cx 
l-v/c? (itv/e)d-v/e) 2d-v/e) 2ax(l—-v/ cx) 
The last part of (7.396) is the consequence of the following approximation near the 
transition from subsonic to supersonic: 


kre re eas (7.397) 
GC Cx 


(7.396) 


To prove (7.397) near the transition point v = c = c*, the value of c-c«can first be 
expanded using the first term in Taylor series expansion as: 


C—Ce Syne) (7.398) 
dv 


v=ce 
This expansion can be rewritten as: 


e-v=(c—v)fl (Dyno (7.399) 


Since the sound speed is a function of density or c = c(p), we can express it using 
the chain rule as: 
dc_dedp_ pdc 
dv dp dv cdp 
The last part of (7.400) is a result of the following special form of (7.362) near the 
transition point: 


(7.400) 


ap __P (7.401) 
dv Cc 


Substitution of (7.400) into (7.399) gives 
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p sp = d(pc) 
cdp é dp 


c-v=(c-v)[l+ = Ox(C+ —V) (7.402) 


where 


2 

ge gD FT ae (7.403) 
c do dp 2 Op” 

In obtaining the second part of (7.403), we have used (7.360). The last part of 

(7.403) can be found by considering the following relation for sound speed (see 

(7.358)): 


Ax 


a) 
pe= pi (7.404) 
op 
The definition of the sound wave can be written in terms of volume per unit mass V 
instead of density p. More specifically, its increment is related dp to 


i dv 
dp =d(—) =-— 7.405 
pO (7.405) 
Substitution of (7.405) into (7.404) gives 
7.406 
HF) 4 — 
Op Op 


Differentiation of (7.406) with respect to p gives 
d(pc d(pc)_1 cc  @V._1 av 
(pc) _ 02 (pc) _ ( = prc >) (7.407) 
dp dp 2 (-2 37 Op 2 op 
Op 


This particular form applies to the case of constant entropy. Using (7.407), the 
second part of (7.403) can be obtained without difficulty. Combining (7.396), 
(7.395), and (7.403) gives 


aD e aD 


— + - (7.408) 
00? 2ax(1-v/ cx) dv? 
Introduction of the following change of variables 
n= (2ax)!3(-—*) (7.409) 
Ce 
gives 
2 2 2/3 
6 ’ _9o 2 = (7.410) 
ov on Ce 
Substitution of (7.410) into (7.408) gives the Tricomi equation: 
eo ao 
—~ -7— =0 (7.411 
én? 00" 


Therefore, the Tricomi equation is an approximation of the Chaplygin equation 
when the speed is very close to the supersonic speed. The general solution of the 
Tricomi equation is considered next. 
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7.7.5 Solution of Tricomi’s Equation 


Using the classification for second order ODEs, we have for (7.411) 

B’ -4AC =4n >0 (7.412) 
for supersonic flow (i.e., v> c*). Therefore, the Tricomi equation is of hyperbolic 
type. The characteristics are 


dO 0€/0n_ al ~4AC ae 


(7.413) 
dn 0€/ 00 
dO 0E/00_ B- come a (7.414) 
dn  0¢/én 
Integrating both of these, we get the two characteristics as 
§= 0-29" =C, (7.415) 
6 =0459% =C, (7.416) 


Instead of reducing it to the canonical form, we observe that the Tricomi equation 
remains unchanged if we make the following substitution: 


0 >a’, ne >ar (7.417) 
This suggests that we can make the following ee - variables: 
D=6"f(6), §=1-—Z (7.418) 
38 
Differentiation of (7.418) with respect to 6 gives 
od 
oF = 2467" ¢(é) +07" f" (2) 55 “= (7.419) 
Application of the chain rule to (7.418) gives 
GE 87 2 
pele el ee 7.420 
ap On eB (l-¢) ( ) 
Substitution of (7.420) into (7.419) gives 
oe) — ' 
59 720 UE) +0-H/'O) (7.421) 
Differentiation of (7.421) one more time gives 
oo @D " 
ape nO PRE -DFE) +O SBOP) +4—<EP f"E)} (7.422) 


We now turn to the first term in (7.411). In particular, differentiation of (7.418) 
with respect to n gives 


O@ _ poke pin OS _ pak an? 
an 0 IOs -0 ve — ai he) (7.423) 


Application of the chain rule to (7.43) gives 
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2 
05-47 -_30-2 (7.424) 
On = 30 1 
Differentiating (7.424) one more time and using (7.418), we find 
ao eBoy ; 
ae aS E)+4d-SF (S)} (7.425) 
7] 


Substitution of (7.422) and (7.425) into (7.411) gives 
5U-8)f"6) +12 -2k-E5 -WIs'O-Kk-IO =0 (7.426) 


Recall the hypergeometric equation from (4.560) of Chapter 4 that 
6-2) f"@) +[y-(a+ B+DE1/'(O)-aB fe) =0 (7.427) 
Thus, comparison of (7.426) and (7.427) gives 


1 5 
=-k, B=—-k, y=—-2k 7.428 
a p=, oo (7.428) 


Therefore, according to Chapter 4 the solution of (7.426) can be expressed in terms 
of hypergeometric functions, and using (7.418), the final solution of the Tricomi 
equation given in (7.411) is 


3 
D = 07" AF( ks k,2 2k alli 
feet “0 ; (7.429) 
An? 2k+— i... 3 7. 4n 
+B(I-—5) © F(k+—,k+—,2k+—;1 
: 90” : 6 3 6 99?) 


Note from Chapter 4 that we can use the bilinear transformation given in (4.610) to 
convert the solution given in (7.429) into five other different forms. In particular, 
we can use (4.599) to transform the solution to the form (see Table 4.4 second row 
for singular point near € = 1) 
y=C1-6)"h, +C,0-6)"P, (7.430) 
We further pick solution Numbers | and 5 in Table 4.7 to get 
1 2 47 
2°3° 96? 
3 
+B TT F( pee. je 
0 6 3°3°9¢? 
This result is obtained by noting that 


~- 4 
a-é)"** =0-8)'" = Go ga (7.432) 
Following Landau and Lifshitz (1987), we give one more different form of (7.429) 
by using the following transformation: 


1 
& =—— (7.433) 
l-u 
which is for the solution near the singular point of § — » (see Chapter 4). For this 
case, we can pick solution Numbers 3 and 7 from Table 4.8 as 


@ = 0"* {AF (-k,-k + 


(7.431) 


yy 
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2 
@ = {AF(-k,-k+ 2 a5 
3-2 4n3 
0 1 5 3.90? ae 
+B ag OE oe 
n 2 6 2 4n 
This is obtained by noting the following: 
7 90 4 4,7" 
g-I* =) =e) ot (7.435) 
p= 4? pan, 4,k2 8 4 
ECT CS) Tar Ge) (7.436) 


More importantly, we see from (7.431) that ® can be expanded in integral powers 
of 7 near n = 0, and (7.434) shows that ® can be expanded in integral powers of 0 
near 9 = 0. That is, the lines n = 0 and 0 = 0 are not singular lines because power 
expansion of infinite order exists. Equation (7.429) shows that the characteristics 
are singular lines if 2k+1/6 is not an integer, and the factor (90?—-4n3)"""" in 
(7.429) has branch points; whilst if 2k+1/6 is an integer, the solution is degenerate 
for 2k+1/6 = 0 and as shown in Chapter 4 that a second independent solution has 
logarithmic singularity (see cases 2 and 3 of Section 4.5). For more discussion of 
the solution near the lines n = 0 and 0 = 0, the readers are referred to Section 118 
of Landau and Lifshitz (1987). Physically, this means that a continuous finite 
solution for the solution ceases to exist on the characteristics when the gas is 
moving supersonically. This singular solution suggests that a shock wave with 
jumps exists in a supersonic gas flow. Therefore, the study of the Tricomi equation 
is of fundamental importance in investigating supersonic flows around an object. 
Equivalently, to rephrase the same phenomenon in a reverse order, the Tricomi 
equation is of utmost importance in studying supersonic flight of aircraft (e.g., 
Mach number larger than 1). This is consistent with the so-called Le Roux- 
Delassus theorem that any singular surface of a solution of a linear differential 
equation must be characteristics. Thus, the jump character of the characteristics 
indicates shock waves. 

Actually, the solution of the Tricomi equation can also be expressed in 
Fourier expansion as: 


D=u(nye”? (7.437) 
Substitution of (7.437) into (7.411) leads to 
u"+nv-u=0 (7.438) 


This is an Airy equation and can be solved in terms of Bessel functions. By 
following the transformation proposed in Problem 4.52 in Chapter 4, we have 


iv 2 2 
D =./ne* “Adis vip?) +BY, os vip?) (7.439) 


As a final note, Landau and Lifshitz (1987) was first translated to English from 
Russian in 1958. Landau obtained a Nobel Prize in physics in 1962 for his work on 
liquid helium, whereas Lifshitz was a student of Landau and recipient of Landau 
Prize, USSR State Prize, Lomonosov Prize, and Lenin Prize. Fluid Mechanics by 
Landau and Lifshitz (1987) is volume six of the ten-volume book series Course of 
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Theoretical Physics by Landau and Lifshitz. This famous book series has been 
translated into six different languages (some individual volumes to more than 10 
languages). 


7.8 RIEMANN’S INTEGRAL FOR HYPERBOLIC PDE 


For the hyperbolic equation, the following problem is known as the Goursat 
problem: 


2 
a F(x, a (7.440) 
Oxdy Ox Oy 
where 0 <x, y < 1, and the boundary conditions are 
uO, y)=(y), uaxD=y(x), eM=y(0) (7.441) 


This problem is also known as the Darboux problem. There is a related problem 
called the Cauchy problem that also involves the first derivative of u: 


7 Ou dx Ou dy 442 

u(x(t), y(t) = a(t), au dt oy Bt) (7.442) 
Note that the boundary conditions are given in terms of a parameter ¢. We will not 
consider the so-called Cauchy problem here in detail. Note also that there are many 
different usages of the term “Cauchy problem” in the literature. In general, the term 
Cauchy problem is reserved for hyperbolic PDEs (or wave type solutions), instead 
of elliptic PDEs (or equilibrium type solutions). In particular, the so-called Cauchy 
problem is normally defined for an initial value problem not for a boundary value 
problem. 

In this section, we discuss a method derived by Riemann in 1860 for two- 
dimensional space. This work by Riemann was not known until du Bois-Reymond 
noticed its importance in 1864 and was subsequently publicized by Darboux (this is 
the reason that the Goursat problem was sometimes referred as Darboux problem). 
We consider a special case of Goursat problem given in (7.440): 

2 

Og ge ea Fah (7.443) 
Oxy ox oy 

This problem is clearly a linear form of the Goursat problem given in (7.440). The 
triangular domain of the problem is given in Figure 7.5. Boundary values of u and v 
and their first derivatives are given on the curve S, which is not a characteristics of 
the hyperbolic problem. The characteristics are assumed to be given as x = constant 
and y = constant. Riemann’s method was originally proposed for gas dynamics and 
only a particular form of (7.443) was considered. The general form given in (7.443) 
was actually considered by Hadamard. 

The associated adjoint problem is 

av 8 é 
G(v) (Av) (By) +Cv=0 (7.444) 
OxOy Ox oy 
where v is not only a function of x, y but also a function of the position of a (1.e., xo, 


yo) 


F(u)= 


V=V(X,¥; Xo, Vo) (7.445) 
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Then, Riemann considered the function 


BX) 


Figure 7.5 Domain of Riemann method 


2 
vF(u) =v oe + Av ONS Cvu (7.446) 
Oxdy Ox oy 
a’v Ou Ou 
uG(v) =u —u—(Av)-u—(By) + Cuv = 0 (7.447) 
Oxdy x oy 
Subtracting (7.447) from (7.446), we find 
2 2 
vF(u)—uG(v) =v om re ia + Zam) +< (Bu) (7.448) 
Oxdy ” Beay Ox 
The right hand side of (7. = can ee rearranged as 
Ce ya ee is) 
ae” oy Ox 2 5 "oy Oy 2° Ox Ox 
i a (7.449) 
=v aM » Shin +5 (Bw) 
OxOy Boy "Oe 
where 
P= Bg On ie (7.450) 
2 ay wy 
1, Ou Ov 
=—(v—-u—)+B 7.451 
g 2 0 Ox " ax _ ( ) 


Equating (7.448) and (7.449) and integrating over the domain shown in Figure 7.5, 
we have 
oP od 


[[orw- uG(v) dedy = IJ 


The line integral going anti-clockwise as ae in Figure 7.5 is taken as positive. 
Substituting (7.444) and (7.443) into (7.452), we get 


{| vfdedy = ie (Pdy — Qdx) y., (Pdy — Qdx) +] (Pdy—Qdx) (7.453) 


my lay = p. (Pdy—Qdx) (7.452) 


456 Theory of Differential Equations in Engineering and Mechanics 


Since the boundary values of a@f have been given, we only need to consider the line 
integrals along aa and fa as: 


) _(Pdy- Odx) = - | oar | . 5 “ (uv) +u(Bv— Syd 


‘ ; 3 (7.454) 
Vv 
= > (uv), > (uv), I. u( By - Ae 
re) ov 
(Pdy -Qdx)=-| Pdy=-| [=—(uv)+u(Av-—)]dy 
ie lL. I. 2 oy (7.455) 


Z (ur) (Ww), ftv Sas 


where the subscripts a, £, a indicate the evaluation of the function at these points. 
Note that the direction cosines between the unit normal n and the characteristic 
lines aa and fa are both negative one. Now the boundary condition of the adjoint 
problem given in (7.444) can be chosen such that 


G(v) ov 6 (ai= “(Bye Cese (7.456) 
OxOy Ox oy 
need on af (7.457) 
Ox 
ov 
Av-—=0 on aa (7.458) 
oy 


Note that the last term in (7.454) and (7.455) becomes zero by such a choice for 
function v. The adjoint problem is introduced that (7.452) is valid. As we will see 
from the next chapter (7.452) is actually a generalized form of Green’s theorem 
(the original Green’s theorem is only for Laplace equation as in (8.37) given in 
Chapter 8). Equivalently, we have 
2 
Cia 8 tap aed (7.459) 
OxOy Ox oy 


J 8G.xoag 
V(x, ¥93 Xo. Vo) =e” on y= (7.460) 
fF 4G0.gae 
V(X9.V3Xo>Vo) =e a1) on X=Xp (7.461) 
V(a) = V(X, V93 Xo». Vo) =1 (7.462) 
This function v is thus called Riemann’s function. Substitution of (7.454) and 
(7.455) into (7.453) gives 


u(a) = 5g r 5(w) " -{f vfdxdy +) (Pdy — Odx) (7.463) 


This is the solution obtained by Riemann for the hyperbolic problem as long as the 
boundary data were not given on characteristics (i.e., S is not a characteristics). 
Note that Riemann’s function is not symmetric with respect to x, y and xo, yo: 


V(X, V3 X95 Vo) # V(X» Vos XV) (7.464) 
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The equality in (7.464) only holds for the self-adjoint problem, but the original 
problem is not symmetric for general functions 4 and B. 

This technique of using the Riemann function to solve the original hyperbolic 
PDE is called the Riemann method. The hyperbolic equation does not admit 
isolated singularities, and every singularity is continued along the characteristics. 
The key idea of Riemann’s method is to reduce the BVP along the curved S to the 
BVP along two characteristics. 


7.9 BIHARMONIC EQUATION 


The biharmonic equation was found important in engineering applications. The 
biharmonic equation was obtained in the theory of thin plate bending, two- 
dimensional elastic stress analysis using Airy’s stress function, two-dimensional 
highly viscous flow, and three-dimensional stress analysis of elastic solids. The 
analogy between plate bending and low Reynolds number viscous flow was noticed 
by Rayleigh in 1893, Sommerfeld in 1904, and Lamb in 1906. The development of 
theories for solving the biharmonic equation indeed closely relates to the analyses 
of Airy’s stress function and plate bending. In 1892, Russian engineer Krylov 
visited Paris and posed the problem of the biharmonic equation to Hermite, and he 
related this problem to his son-in-law, Picard, another famous French 
mathematician. Consequently, the problem of solving the biharmonic equation in a 
rectangular domain was posed in Prix Vaillant in 1907 with prize money of 4000 
francs. The initial judge panel included Poincare, Picard, and Painlevé and they 
wrote reports on the 12 submissions. The reports were then submitted to an 
authority commission including, Jordan, Appell, Humbert, Levy, Darboux, and 
Boussinesq. There were 4 winners who shared the prize money, and they are J. 
Hadamard, A. Korn, G Lauricella and T. Boggio. As a side note, W. Ritz (a student 
of David Hilbert) also submitted his paper but was “reported” missing and was not 
eventually awarded. It turns out that Ritz’s analysis using a variational formulation 
related to plate bending was most influential for later development (see Chapter 
14). It eventually becomes the focus of Krylov, who is the initiator of such analysis. 
Nevertheless, the biharmonic equation is clearly one of the most important PDEs 
higher than second order. 

There are many famous mathematicians contributing to the development of 
theories for solving the biharmonic equation, including famous names like Airy, 
Maxwell, Clebsch, Ritz, Kirchhoff, Papkovich, Poisson, Sophie Germain, Love, 
Koialovich, Krylov, Navier, Boussinesq, Rayleigh, Sommerfeld, Galerkin, Dougall, 
Filon, Muskhelishvili, Levy, Michell, Pickett, and Timoshenko. Readers are 
referred to the excellent and comprehensive review article by Meleshko (2003) on 
the biharmonic equation. 

The biharmonic equation can be expressed as a repeated application of the 
Laplacian operator as: 


(VeV\(VFV)u =V?V7u =0 (7.465) 
In 3-D Cartesian coordinates, it can be expressed as 
Ce P Ou Ou au 


Viu= + + + + 
Ge ay? Gz" ax” ay?” 


)=0 (7.466) 
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More explicitly, it can be expressed as 
4 4 4 2 2 2 
ee A ee (7.467) 
ox” Oy” az Ox“ Oy Ox" Oz oz“ Oy 
In 2-D Cartesian coordinates, it is reduced to 
OO wl OOM Oe OR 


ViViu =Viu=(—+— (> + — ue = + (7.468) 
ne art aya aya aay? ay4 
In cylindrical polar coordinates, we have 
x=rcos@, y=rsinO, z=z (7.469) 


The corresponding biharmonic equation becomes 
eo. to oe te. de oe 
Viu= + + + +—~)u=0 (7.470 
rae Pak a at re Fae a ie 


r = fe +y? (7.471) 


In spherical polar coordinates, we have 
x=Rsingcos@, y=Rsingsind, z=Rcos@e (7.472) 


where 


The biharmonic equation becomes 
ao 204 1 @ cotg 1 a 
a ar ee are Lar re 7) 
OR’ ROR R° dp R’ 0p R*sin’ g 00 
a 204 1 @ cotg 0 1 oe 
at ee a8 af ¥ 8 2 
OR’ ROR R° dp R Op R*sin’ 9 00 


R= Je? 4 ye (7.474) 


Viu = ( 
(7.473) 


ju =0 


where 


7.9.1 Plane Elastic Stress Analysis for Solids 


For two-dimensional solids under plane stress or plane strain conditions, the stress 
analysis can be formulated in Airy’s stress function gas: 


V49 =-2 bv (7.475) 
k+1 
where V is the potential of body force and x is defined as 
= cla plane stress 
l+v’ (7.476) 
=3-4, plane strain 


The proof of (7.475) can be found in Chau (2013) and will not be reported here. 
This is a nonhomogeneous biharmonic equation. In Cartesian coordinates, the 
stress components are defined in terms of @ 
2 2 2 
é 
gga a7, Ps O y=: (7.477) 
“Ox Ovox 
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In polar coordinates, the stress components are defined in terms of @ 


ay 0 109 log 1a@ 
=— 4+V,o04=-—(-—), o,7=-—+5-54+V (7478 
ee Or? aoe or r 0d - ror 7? ag? ( ) 
If body force is negligible, we recover the biharmonic equation 
V‘o =0 (7.479) 


The main difficulty encountered in solving Airy’s stress function appears in 
satisfying the boundary condition. 


7.9.2 Three-Dimensional Elastic Stress Analysis for Solids 


For three-dimensional stress analysis in elastic solids, there are various 
formulations that can be used. They are classified into stress formulation and 
displacement formulation. The stress formulations include a Beltrami stress 
function, Maxwell stress function, and Morera stress function, while the 
displacement formulation includes Helmholtz decomposition, Lame strain potential 
for incompressible solids, the Galerkin vector, Love displacement potential for a 
cylindrical body, and Papkovitch-Neuber displacement potential (Chau, 2013). 
Among the displacement formulation, Helmholtz decomposition leads to one scalar 
biharmonic and one vector biharmonic equation as: 

Vi¢=0, Viw =0 (7.480) 
where the divergence of the vector y is zero (i.e., V-w = 0). The displacement 
vector can be found as 

u=Vo+Vxy (7.481) 
Alternatively, displacement of elastic solids can also be expressed in terms of a 
single vector called the Galerkin vector G. The governing equation for it is also a 
vector biharmonic equation 


V'G=0 (7.482) 
For this case, the displacement vector is 
u=2(1-v)V’7G-VV°G (7.483) 


For problems with axial symmetry, only one component of the Galerkin vector is 
needed, and it is known as Love’s displacement potential. In particular, we have 


V‘G, =0 (7.484) 
The displacement vector becomes 
Di 2 2 
u=| (1—-2v)V? ae G,e, aes e, Lee ey (7.485) 
az? Oréz r 000z 


Thus, it can be seen that the biharmonic equation is closely related to three- 
dimensional stress analysis. The proofs for these formulations can be found in 
Chapter 4 of Chau (2013). 
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7.9.3 Bending of Thin Plates 


The development of the theory for thin plate bending has a long history. Many 
great mathematicians were involved in its development, like Lagrange, Sophie 
Germain, Navier, Poisson, Kirchhoff, and Levy. The deflection of a thin plate is 
governed by the following nonhomogeneous biharmonic equation: 


V2V2w = Viw= ee (7.486) 


where the 2-D Laplacian is defined in (7.468) and p is the distributed loading on 
the plate. The plate constant is defined as 
3 
Be = 
12(1-v*) 
where h is the thickness of the plate, E is the Young’s modulus, and v is Poisson’s 
ratio. In polar form, plate bending deflection is given by 
2 2 2 2 
1 1 1 1 
viwa(& ce 3 = a eae 3 o w=2 
ore ror r° 00° or ror r 00 D 
Bending of a thin plate finds applications in ship, aircraft, and building design. The 
proof of (7.488) can be found in the standard reference book by Timoshenko and 
Woinowsky-Krieger (1959). 


(7.487) 


(7.488) 


7.9.4 Two-Dimensional Viscous Flow with Low Reynolds Number 


When flow velocity is small in a highly viscous incompressible flow, Lamb (1932) 
showed that the stream function satisfies the biharmonic equation if the viscous 
effect is much larger than the inertia effect (see p. 607 of Lamb, 1932). This is also 
known as Stokes flow in the literature. In particular, the stream function yw satisfies 
the biharmonic equation: 


Viw =0 (7.489) 
where y is defined in terms of the velocity components uw and v as 
ed a (7.490) 
Ox oy 


We sketch briefly the proof of (7.489) here. First, let us recall from (2.121) the 
material derivative as 


D u| 
u_ ou | | )—uxVxu (7.491) 
Dt Ot 2 
This can be rewritten as: 
D 1 
oe ial -2ux@ (7.492) 
Dt oat 2 


where @ is the vorticity of the flow and defined as: 


w= 5Vxu (7.493) 
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Physically, it is the angular velocity of the fluid element that would rotate if it were 
suddenly solidified. If the motion of the fluid is driven by an external force F and a 
pressure difference, the equation of motion becomes 

D 1 

<4 = F-—Vp (7.494) 


where p and p are the fluid pressure and fluid density respectively. Combining 
(7.492) and (7.494) gives 
Ou 
Ot 
Assuming the existence of an external force potential, we can rewrite F as 
F=-VWV (7.496) 
Next, we assume the flow is incompressible (p = constant), or 


1 1 
+5V (ul 2ux@—-F+—Vp=0 (7.495) 
p 


1 
BH axa + Vo ful +¥ +4 py =0 (7.497) 
Ot 2 a) 


If the viscosity in fluid is nonzero, the fluid pressure is no longer normal to the 
surface. Thus, we can add viscosity effect as 


1 i 
BM yx a+ Vill +7 +4 p)+ 2 x@=0 (7.498) 
Ot 2 a) 


where v is the kinematic viscosity and is defined as the coefficient of viscosity u 
divided by the density p as 


v= (7.499) 


DIS 


Next, we recall a vector identity that 
V-u=V(V-u)-2Vx@ (7.500) 
The proof of this identity will be left as an exercise for readers (see Problem 7.12). 


Incompressibility also implies the trace of velocity is zero, or the first term on the 
right of (7.500) is zero. Substitution of (7.500) into (7.498) gives 


OH oy xa + Valu +V +p) =0 (7.501) 
Ot 2 fa) 
Now consider a special case that 
u=(u,v,0), @=(0,0,¢) (7.502) 
Then, (7.501) gives two equations 
aa Qvo+ OO int (7.503) 
Ot Ox 
OY oh ae (7.504) 
Ot oy 


where © can be identified easily from (7.498). It is not given here as we will 
eliminate it. Elimination of 2 from (7.503) and (7.504) gives 


as i ye =vViC (7.505) 
ot Ox oy 


In obtaining (7.505) we have used the definition of @ given in (7.493) to get 
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1 ov 
=—(—-— 7.506 
c= 5 ( rs =) ( ) 
Finally, we now introduce a stream function defined as 
oe ae” (7.507) 
oy Ox 
We note that this stream function automatically satisfies the incompressibility: 
2 
Go. (7.508) 
ox Oy aay Oyox 
Substitution of (7.507) into (7.506) gives 
1 
a= rag (7.509) 
Finally, combining (7.505), (7.507), and (7.509) gives 
é Ow 0 Owod._, 4 
—Viwt+ Viw=vVv 7.510 
a ay By ae YY sea 


This is a nonlinear PDE and not easy to solve. However, for slow moving fluid 
(such that y is small) with high viscosity v, the terms on the left hand side of 
(7.510) are negligible compared to that on the right, and we finally obtain a 
biharmonic equation 


Viy =0 (7.511) 
1 


7.9.5 Uniqueness of the Solution of Biharmonic Equation 


In this section, we will consider the uniqueness of the solution of the following 
two-dimensional biharmonic equation with boundary conditions 


Ofu Ofu ou 


Vie (7.512) 
af aay? OF 
Ou 
ul. =g(9), ie h(s) (7.513) 
T 


where s is the tangential coordinate along the boundary J of the two-dimensional 
domain. The following uniqueness proof was presented by Hua (2009, 2012). 
Suppose that uw; and wz are two distinct solutions of the systems (7.512) and 
(7.513). We define a new function: 


V=u,-U, (7.514) 
Taking the biharmonic operator to (7.514) gives 
Viv=V*u, —V'u, =0 (7.515) 
The corresponding boundary conditions for v can be found as 
V|-= 4/4), =e) -g(s) =0 (7.516) 
BY ee) =h(s)—-h(s)=0 (7.517) 
On|; On|, On|, 
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Now, we recall Green’s theorem (or called Green’s second identity) for the 
Laplacian operator: 


Wro-wv'yda= [wv o2-p as (7.518) 
Q rT On On 


The proof of this theorem will be given in the next chapter (see (8.37)). Let us 
make the following identification for y and @: 


p=v, w=V’v (7.519) 
With this identification, (7.518) becomes 
{fy (V2WV2y—-vW2V2n)dQ = {I (Vey eas (7.520) 
Q r On On 


The second term on the left hand side is identically zero, and both terms on the 
right hand side are zeros in view of (7.516) and (7.517). Thus, we have 


[Jv 420 (7.521) 
Therefore, we must have 
V’v=0 (7.522) 
with 
Ov 
y.=0, —| =0 (7.523) 
On|, 


From the extremal principle to be derived in Section 9.7.7 of Chapter 9 for the 
Laplace equation, the maximum and minimum of v can only appear on the 
boundary. Thus, we must have v be identically zero. Then, (7.514) leads to 

Uy =U, (7.524) 


This contradicts with our assumption that they are distinct solutions. Thus, the 
solution of biharmonic equation with boundary condition (7.497) must be unique. 

Note the special case of g = h = 0, the same result was derived by Fugdele 
(1981) (see Footnote | on p. 450 of Fugdele, 1981). 


7.9.6 Biharmonic Functions and Almansi Theorems 


For the two-dimensional polar form, the general solution can be expressed as 
(Chau, 2013; Meleshko, 2003): 


g(r, ¢) = Agr? Inrg+ By lnrg@+Corlnrdcosd+ Dor lnrdsing 
+(A+ Br)In() +Cp?+D+(E+ Fr’)¢+rd(Gcoosd+H sing) 


= : 52 
+ air + Bu! + Cr In) Ecos 9+ Fy sing) nae) 


00 
+) [Aur ae By tr Cur” + Dnr oR, cos ng +Fy sin np) 
n=2 


For two-dimensional Cartesian coordinates, the general solution of the biharmonic 
equation can be expressed in two analytic functions @ and ¥ (Chau, 2013): 
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p(x, y) = Re[ZG(z) + 7(2)] (7.526) 
where the complex variable z is defined as 
z=x+iy (7.527) 


Alternatively, Chau (2013) mentioned two Almansi theorems in generating 
biharmonic functions in terms of harmonic functions. As discussed by Chau (2013) 
in Sections 4.5—4.7 in his book, there are infinite numbers of harmonic functions 
(i.e., solution of Laplace equation), and, thus, there are also infinite biharmonic 
functions. According to Meleshko (2003), Boussinesq actually derived the so- 
called Almansi theorem independently in 1885 before Almansi did in 1896. 
However, the Boussinesq name was not associated with these theorems. The proofs 
of Almansi theorems were given in Fung (1965). In this section, we will provide a 
simpler version of proofs given by Hua (2009, 2012). 

The first Almansi theorem states that the following function is biharmonic 
U = XU +Uy (7.528) 
where uw; and uz are both harmonic. To prove this theorem, let us assume that wu is 
biharmonic and wz is harmonic, and we want to show the following 


Vu, =0 (7.529) 
Since wz is assumed as harmonic, from (7.528) we must have 
V?(u—xu,) =0 (7.530) 
Equivalently, it can be expressed as 
Ful Gu =i +o 4 Fy 9 Om (7.531) 
ox a? ay’ Ox 


In obtaining the last part of (7.531), we have employed the fact that uw; is harmonic. 
First, we observe that a solution of (7.531) is 


uray) =[ SVulE. yids (7.532) 


The validity of this solution can be proved by using Leibniz’s rule of differentiation 
under the integral sign. Let us take the Laplacian of (7.531); we get 

by ygeiy? M2 Ps 6 (7.533) 
2 Ox = =Ox 
In getting the last part of (7.533), we observe that u is biharmonic. This leads to the 
fact that 


Vu, =v(y) (7.534) 
Clearly, we can define another solution of w; as: 
Ou, 
Vu = =-Wy) (7.535) 
oy 
Then, we can select wu as: 
My = +uy (7.536) 


Then, it is obvious that 
Vu, =V"u; +V7u, =-v(y)+v(y) =0 (7.537) 
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This completes the proof. 
The second Almansi theorem states that the following function is biharmonic 


u=(r° -7 Ju, +uy (7.538) 
where uw; and wz are both harmonic. For u is biharmonic and wz is harmonic, we 


want to prove that uw; is also harmonic. To prove this, we first recall the identity 
(Eq. (1.43) of Chau, 2013): 


V*(wo) = Vv +UVY)(Vo)+Wo 
= Vy +yV9+2 90 OY oy , Op oy (7539) 


Ox Ox ey Oy 
Taking the Laplacian of (7.538) and applying (7.539), we have 


Vu = V7 [(r? = 1 uy tu] = V(r? — 19 ay] 
Ou, Or? at Or? 


2 2)—52 27,2 2 
=(r° -—1H )V*u, +u,V° * —7H )+2 7.540 
(ro =19 Vu tuHV (rH) Vs ae ay ay ' ) 
=4u, + wae a i is 
oy 
In obtaining (7.540), we have a the following identity: 
Vr? =4 (7.541) 
Taking the Laplacian of (7.540), we get 
V?V7u =4V"u, 4 aves mE Viv “ y)=0 (7.542) 
y 
It is straightforward to show that 
du aud 
V2 (x—) = 2—F +x (Vu 7.543 
(= 23 (0m) (7.543) 
eu au a 
V?(y—)=2—4 a (7.544) 
Substitution of (7.543) and (7.544) into (7.542) 
12V*u, +4(x a4 y e Vu, =0 (7.545) 
ox ~ oy 
Thus, we finally get 
Vu, =0 (7.546) 


This completes the proof. 


7.9.7 Solution of Circular Domain 


For the two-dimensional unit circular domain, the biharmonic equation becomes 
e 1a i # yf 18, 1 & 
Or rar PAP a ror rae 
subjected to the following boundary conditions: 


Viu =( —)u = (7.547) 
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u|,_, = 8), = =h(6) (7.548) 
r=l 


Recall from the second Almansi theorem that the biharmonic function can be 
expressed in terms of harmonic functions wu; and u2 as 


u=(r? —1)u, +, (7.549) 
Applying boundary conditions (7.548) to (7.549) gives 
Ou 
uy|,_, = 8(9), =[2u, +—*},_, =A(6) (7.550) 
OF |, or 


For a unit circular domain, the harmonic function subject to the boundary condition 
given in the first equation of (7.550), there is a well-known Poisson integral 
formula for the Laplace equation (details of this formula will be discussed in 
Section 9.7.6 of Chapter 9): 


2 
u,(r,0) = =| Oe) ay (7.551) 
a1+r? —2rcos(@-y) 
The most important step is to show that the following function is harmonic 
V? (2m, jp Oy a9? (yp Oy 6 (7.552) 
or or 
To show the validity of (7.552), we have 
2 2 3 
a ar ny ie pp Oe (7.553) 
or or or or 
2 
1 aL _ 1% Uy (7.554) 
ror or r or @r 
10 Ow. 10 Au 
m2 rv a >) (7.555) 
00 r or 00 
Combining (7.553) to (7.555), we get re Laplacian of the second term in (7.552) as 
2 3 2 
Voom a3o po 2 _1 OM ey che (7.556) 
or a? arr Or: =r Or ae’ 
Next, we find that 
3 2 ae 2 
pee 2 pen te 1 Ou, =a, (7.557) 
or or =r Or ae? or? or Or +r @@ 
It is because uw is a harmonic function. Therefore, we have 
3 2 2 2 
0% 16 Oy O'un Lou, 20, (7.558) 
arr Or ae? ar? or Grr ae? 
Substitution of this result into (7.556) gives 
eu Oty. Vey. 1 Ou 
V7(r—2) =2 tt 2 2) =2V7u, =0 7.559 
(r a" 7 7 Ee r Per, 2 ( ) 


This completes the aot of nes Then, we can apply the Poisson integral formula 
for the Laplacian again: 
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7 2 
ee a ae | e EM) ay (7.560) 
Or = 2n4-11+r° —2rcos(O-y) 
Differentiation of (7.551) with respect to r gives 
Ou 1+1r7)cos(O- 2r 
2a J fT Cr ees WO gyn (7.561) 
or 2 l+r? —2rcos(O- vy) 
Substitution of (7.561) into (7.560) gives 
, r [" (1+r*)cos(@—y) —2r 1 [" (l—r*)h(y) 
=_-_ee —-. sO —— 
1 
20 


g(w)dy +— 
-z 1+r?—2rcos(O-y) An J-71+r? —2rcos(@-y) 
(7.562) 


Finally, substitution of (7.551) and (7.562) into (7.549) gives 
cae [" hy) F [" reo Wile) 4 
{ ; + V3 
2 2¢-71+r° —2rcos(0-w) [l+r? —2rcos(6-w)]° 
(7.563) 


Note that this formula was derived in Hua (2009). The kernel functions in Hua’s 
integral are illustrated in Figure 7.6. 


[l-rcos(@-y)] 
4 -[l+ r? —2r cos(@ -y)P 


3 (l-r’) 
+r? —2rcos(O—y) 1+r? —2rcos(O-y) 
2 
1 
0 
0 0.5 z a 1.57 20 


Figure 7.6 Kernel function for Hua’s integral 


This shows that the solution at any point inside the circular domain are functions 
of the weighted average of the boundary values A(y) and g(w). Actually, this four- 
volume series by Hua was originally published in 1962 (Volumes 1 and 2), 1978 
(Volume 3) and 1981 (Volume 4). The 2009 edition was a reprint celebrating the 
100th anniversary of Professor Hua Loo-Keng’s birthday. This 2009 edition has been 
translated into English (Hua, 2012). The same result given in (7.563) has been re- 
derived by Dong et al. (2005) using the result of Zheng and Zheng (2000). Neither of 
them cited the book by Hua (earlier versions), and their derivations are more 
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complicated than the analysis given here. It can be seen that Hua’s recognition of the 
validity and usefulness of (7.538) simplifies the analysis tremendously. As a side note, 
we should mention that Hua Loo-Keng is the most influential mathematician of 
modern China. He did not have a formal education, and he is an excellent self-learner. 
A full biography is given at the end of this book. 


7.10 SECOND ORDER PDE WITH NON-CONSTANT COEFFICIENTS 


The main contributor for general nonlinear second order PDEs is Gaspard Monge. 
For PDEs of second and higher order, we normally use the following notations for the 
first and second derivatives: 

ee a a a a 

pal F, galt, tee, paS, ged (7.564) 

ox OxOy oy ox oy 
Some second order PDEs can be integrated immediately by inspection. Here are 
some examples. For the case of two variables, a general PDE of the second order 
can be expressed as: 


F(x, ¥,2,1,55t, Pq) =9 (7.565) 
This PDE is in general nonlinear and very difficult to solve. In the following 
sections, we will consider some particular forms for which analytic methods can be 
used to find the solution. 
First, we will look at some linear PDEs that can be solved by inspection. The 
most general linear form is given by 
Rr+Ss+Tt+Pp+Oq+Zz=F (7.566) 


where the coefficients R, S, T, P, O, Z, and F are only a function of variables x and 
y. There are four types of second order PDEs that can be solved easily. 


7.10.1 Type I (Direct Integration) 


In Type I, if only one second-order derivative term appears in our PDE, the 
problem can be solved by direct integration. In particular, the Type I PDE is 


Oz O*#F 
Oz F 
= =— =F, (x, 7.568 
s axdy  S > (x,y) ( ) 
2 
je AFG (7.569) 


dy? 7 
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7.10.2 Type II (ODE of p or q) 
If there is only one second order derivative term together with one first derivative 


term (either p or q) such that it becomes an ODE in either p or q. In particular, the 
Type II PDE appears as 


Rr + Pp= RP + Pp = Fxy) (7.570) 

xX 
Op 

Ss + Pp = S—+ Pp = F(x,y) (7.571) 
oy 
gq 

ergs eee) (7.572) 
0q 

cS arms aa (7.573) 
y 


Note that all of these become a first order ODE for either p or q. 


7.10.3 Type ILI (First Order PDE) 


Certain special cases of (7.566) can be expressed as a first order PDE for either p 
or q. They are 


RAs Poe ROS? tere) (7.574) 
Ox oy 


5+ 114.09=SL+7 SF P+ Oq= Flay) (7.575) 
Xx ~ 

Both of these are linear first order PDEs. The Lagrange method discussed in 
Section 6.7 of Chapter 6 can be used to solve them. 

7.10.4 Type IV (Second Order ODE) 

Certain special cases of (7.566) can be expressed as a second order PDE for either 


p or q. They are 
2 


Rare eR - po are ree (7.576) 
Ox Ox 

1409422270 240% 47 = F(x y) (7.577) 
ey? ey , . 


These are second order ODEs for z. 


Example 7.4 Consider the solution of the following PDE: 


07z 


=2x+2 7.578 
axdy y ( ) 
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Solution: Integrating with respect to x, we get 
Oz 2 


x” +2xy+ @(y) (7.579) 
oy 
Integrating (7.579) with respect to y, we find 
z=xytxy?+ [ora +(x) (7.580) 
Since both o and y are arbitrary functions, we can rewrite it as 
z=x’ytxy" + f(x)+FO) (7.581) 


Example 7.5 Consider the solution of the following PDE: 


2 
go 0 (7.582) 
Ox ox 
Solution: By using the definition of p given in (7.564), we have 
go 556 (7.583) 
Ox 
This can be rearranged as 
ae = ec) (7.584) 
77) x 
Integration gives 
Pp - £0) (7.585) 
x 
where fis an arbitrary function of y. Using the definition of p in (7.564), we get 
6/4 1 
—=p=—=f0) (7.586) 
Ox x 
Integrating (7.586) with respect to x, we find 
1 
eB 7 OPO) (7.587) 


7.11 MONGE AND MONGE-AMPERE METHODS 


Recall from (7.565) that the following general second order PDE is considered: 

F(x, y,2,1,5,t, p,q) =0 (7.588) 
We will consider a general method in this section, called Monge’s method for the 
case of quasi-linear. The extension of this method, called the Monge-Ampere 
method, will also be considered for the case of nonlinear. 
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7.11.1 Some Examples 


Let us consider the following particular problem, and then we will generalize the 
observation to the so-called Monge method. In particular, a first order PDE is given 
as 


Oz Oz 


2; 
a a os qy = G(x" y) (7.589) 
Differentiation of (7.589) with respect to x, we have 
92 a 7p page (7.590) 
Ox Ox 
Using the definition given in (7.564), we can write (7.590) as 
2rx+2p—sy =2xyd'(x’y) (7.591) 
Differentiation of (7.589) with respect to y, we have 
2sx—ty—q =x°¢'(x’y) (7.592) 
Eliminating the arbitrary function from (7.591) and (7.592), we get 
(2sx —ty—q)2y = x(2rx +2p—sy) (7.593) 
Rearranging (7.593), we obtain 
21x? —5xys +2y°t + 2( px+qy) =0 (7.594) 


In explicit form, (7.574) is 
2 2 2 
9 as 7 a en (7.595) 
ex? OxOy oy’ ox oy 

Note that this is a linear second order PDE in r, s, and t. Inversely, we can consider 
(7.589) as a partial integral of PDE in (7.593). However, we can show that this is 
not the only partial integral of (7.595). In fact, applying the procedure in getting 
(7.595) to the following first order PDE, we find the same (7.595): 


px—2qy =y(xy") (7.596) 
That is, two different first order PDEs lead to the same second order PDE. This is 


what Monge observed and this is the basis of formulating Monge’s method. 
For a second example, we consider a nonlinear first order PDE 


p’+q=@(2x+y) (7.597) 
Differentiation with respect to x and y respectively gives 
2pr+s=29'(2x+y) (7.598) 
2ps+t=9'(2x+y) (7.599) 
Eliminating the arbitrary function @, we find 
—2pr+(4p—1)s+2t=0 (7.600) 


This is a linear PDE in, s, and ¢. In explicit form, it is 
az 6"z aia OZ az Oz 
Ox Ax? Ox axdy = ay" 
Finally, we consider the following example 
y-p=P(x-4) (7.602) 
Differentiation with respect to x and y, respectively, gives 


=0 (7.601) 
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—r =(l—s)¢'(x-q) (7.603) 
l-s =-t¢'(x-q) (7.604) 
Eliminating the arbitrary function ¢, we find 
2s+(rt-s?)=1 (7.605) 
Note that the terms in the bracket are nonlinear in r, s, and ¢. This is equivalent to: 
@z O28z 672, 
+[ ( j=l (7.606) 


Oxéy ex dy’ Oxdy 


7.11.2 Generalized Form 


Guided by examples in the previous section, we will consider the more general 
form in this section. More specifically, two first integrals of the second order PDE 
given in (7.588) are assumed as: 


u=Uu(X,V,Z, Pq) (7.607) 
V=V(X,Y,Z, D9) (7.608) 

In general, we can write intermediate integral of (7.588) 
u=@(v) (7.609) 


That is, if both (7.607) and (7.608) correspond to the same second order PDE, u 
and v must relate to each other. Taking the total differential of (7.609), we find 
OE i OE OE Oa 
ox a OZ Op oq 


s 3 3 ; (7.610) 
=¢' one d+ dy +dz+—dp+§—dq} 
oy OZ Op oq 
Dividing through dx and dy eae we find 
Ou Ou Ou Ou Ov 2p ne Ov 
pt =9' Mie ray (7.611) 
ax az ie oq ed oq 
a a as (yf eae aa (7.612) 
Oy oz” &p ay az" Op oq 
Eliminating the unknown a i. we get 
Rr+Ss+Tt+U(rt—s?)=V (7.613) 
where R, S, T, U, and V are all functions of x, y, z, p and g: 
CAD A CA) MCAD BC pet 
O(p,z) O(p,y) O(x,q) — O(Z,q) 
BECCARY CCAD CCAD BRC eis) 
O(x,p) — O(z,p) — (9,2) O(4,y) 
_ wry) ,_ Aw) Aw) | Au) Gas 


Ap.’  ay.z) 1 (zx) Ax) 
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Note that (7.613) is a nonlinear second order PDE because of the bracket term. 
However, if wu and v are not functions of p and gq, U will be zero and in turn (7.613) 
is reduced to a linear PDE (or quasi-linear PDE). This special case is the focus of 
Monge’s method. 


7.11.3 Monge’s Method 


We first consider the case that U is zero, and thus the PDE becomes 

Rr+Ss+Tt=V (7.617) 
The following method was proposed by Monge in 1784. Let us take the total 
differential of p and q as: 


pe da wey (7.618) 
ox oy 
Fea  eeiacs ie Se (7.619) 
ox oy 
Substitution of (7.618) and (7.619) into (7.617) gives 
es 7 yy (7.620) 
dx dy 


This can be rewritten as 

Rdpdy + Tdgdx —Vdxdy — s( Rdy* — Sdydx + Tax’) = 0 (7.621) 
Now we set the first three terms and the bracket term in (7.621) to zero separately. 
This can be rewritten as 


Rdpdy + Tdqdx — Vdxdy = 0 (7.622) 


Rdy? — Sdydx + Tdx? =(A,dy + B,dx)(A,dy + Bydx) = 0 (7.623) 
These are the intermediate integrals of (7.617) and are known as Monge’s equations. 
Note that we can always factorize (7.623) as shown in the second part of (7.623). 
Then, this leads to two systems, namely 
Ady+B,dx=0, Rdpdy+Tdqdx —Vdxdy =0 (7.624) 
A,dy + B,dx=0, Rdpdy+Tdqdx —Vdxdy = 0 (7.625) 
If these equations can be integrated, we have the intermediate integrals. We may have 
either one or two intermediate integrals, depending on whether both of them are 
integrable. The solution that satisfies (7.624) or (7.625) is also a solution of (7.617), 
but a solution of (7.617) may not satisfy (7.624) or (7.625). 


Example 7.6 Consider the solution of the following PDE: 


2 2 2 
Df SF By OF ao OE ay ng (7.626) 


Ox? OxOy dy’ Ox Oy 


Solution: In simplified form, (7.626) can be written as 
2x°r —5Sxys + 2y*t + 2(pxtqy) =0 (7.627) 
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Using the result of (7.622) and (7.623), we have 


2x" dy? + Sxydydx +2y'dx* =0 (7.628) 
2x" dpdy + 2y"dqdx + 2( px + qy)dxdy =0 (7.629) 
Factorizing (7.628), we have 
(xdy + 2ydx)(2xdy + ydx) =0 (7.630) 
Setting the first factor in (7.630) to zero, we get 
xdy + 2ydx = 0 (7.631) 
Integration of (7.631) gives 
uy =x’y=a (7.632) 
Dividing (7.629) by xdy and noting (7.631), we get 
2xdp — ydq + 2 pdx — qdy =0 (7.633) 
This can be integrated as 
Uy =2px—yq=b (7.634) 
These two characteristics give the solution as 
2px—yq = $(x"y) (7.635) 
Similarly, setting the second factor of (7.630) to zero, we find 
xy =c (7.636) 
Following a similar procedure, we get 
px—2qy = 00") (7.637) 
Solving for p and qg from (7.635) and (7.637), we get 
p= 2dy)-vOr") (7.638) 
oe! 2 2 
q= By y)—2y(xy")} (7.639) 


By definition, we can integrate z by its total differential 


de = pdx + qdy = ~ (29x?) -y?)}de + 507) ~2y(xy?)}dy 


1 Qdx dy. 1 dx 2d, ae) 
xX Xx 
== 9? y( 4 ) <a (tS) 
3 x y 3 x y 
Integrating both sides, we get 
1 1 

z= [oy In(x?y)—5 [vay )d In(xy?) (7.641) 

Therefore, a solution of (7.626) is 
z= f(x’ y)+ F(x’) (7.642) 


However, this is only one of the possible solutions of (7.626), but not the most 
general solution. 


Example 7.7 Consider the solution of the following PDE: 
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2 2 2 
2O0°Z 3 Oz  0z_& 


J x? ‘ Oxdy Oy? ax ) 
Solution: In simplified form, (7.643) can be written as 
y’r—2yst+t = p+6y (7.644) 
Using the result of (7.622) and (7.623), the two intermediate integrals are 
y dy? + 2ydydx + dx” =0 (7.645) 
y’dpdy + dqdx —(p+6y)dxdy =0 (7.646) 
Factorizing (7.645), we obtain 
(ydy+dxy’ =0 (7.647) 
Integration of (7.647) gives 
2x+ y =a (7.648) 
Dividing (7.646) by ydy and noting (7.647), we get 
ydp —dq+(p+6y)dy =0 (7.649) 
Integration of (7.649) results in 
py-q+3y" = (7.650) 
These two characteristics give the solution as 
py-q+3y =d(2x+y’) (7.651) 
In explicit form, we can express (7.651) 
Sy ay = g(2x+y") (7.652) 
ox” Oy 


Applying the Lagrange method discussed in Section 6.7, we find the following 
subsidiary equation 


dx dy _ dz (7.653) 
y -l -3y?+¢(2x+y’) 
The first two parts of (7.653) give the first characteristics as 
uy = x+y? =a (7.654) 
In view of (7.654), the last two of (7.653) gives the second characteristics as 
dz+[-3y" + d(a)]dy =0 (7.655) 


Integration of (7.655) gives the second characteristics and back substitution of the 
value of a from (7.654) leads to 


Uy =z—y + yh(2x+y’)=b (7.656) 
Thus, the solution for (7.643) is 
y{z—y + yh(2x+y*),2x+y"} =0 (7.657) 


Finally, the solution in (7.657) can be given in explicit form 
z=y —yh(2x+y?)+ f(2x+y7) (7.658) 
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7.11.4 Monge-Ampere Method 


Monge’s method, discussed in the last section, was extended to the nonlinear form 
given in (7.613) by Ampere in 1814. In particular, we consider the following PDE 


Rr+Ss+Tt+U(rt—s?)=V (7.659) 
Recall from (7.618) and (7.619) that r and ¢ can be found as 


r= dp—sdy (7.660) 
dx 
t= dq—sdx (7.661) 
dy 
Substitution of (7.660) and (7.661) into (7.659) gives 
—sd —sd. dp- - 
RO ip OOO er G68) 
dx dy dx dy 


Expanding (7.662), we get 
s[Rdy? — Sdxdy + Tdx? + U(dxdp + dydq)| 


(7.663) 
—[Rdydp + Tdxdq + Udpdq —Vdxdy|=0 
Setting both the bracket terms to zero, we obtain 
Rdy* — Sdxdy + Tdx” +U(dxdp + dydq) =0 (7.664) 
Rdydp + Tdxdq + Udpdq —Vdxdy = 0 (7.665) 


We now assume the existence of a function 2 = A(x,y,z,p,q) such that the following 
group can be factorized: 


A[Rdy* — Sdxdy + Tdx* + U(dxdp + dydq)|+ Rdydp + Tdxdg + Udpdg —Vdxdy 
= (ady + bdx + cdp)(ady + Bdx + ydq) 
=aady* + (af +ba)dxdy + bBdx* + cBdxdp + aydydg + cadydp + bydxdq + cydgqdp 
=0 

(7.666) 
This is the most important step in the Monge-Ampere method. Equating coefficients 


of all differential terms on both sides of (7.666), we find 
AR=aa, -SA-V=af+ba, AT=bB, cBh=AU =ay, 


7.667 
ca=R, by=T, cy=U ( ) 

We can choose the solution for the unknown coefficients as: 
Jo gam % =: G2, 22. 2G (7.668) 


Substitution of (7.668) into the second equation of (7.667) gives a governing 
equation for i: 


TR 
a 


Uae =s1=F (7.669) 


Rewriting (7.669), we find a quadratic equation for A 
U?A? +SUA+TR+UV =0 (7.670) 
There are two distinct roots for i: 
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A=’, A=A, (7.671) 

Finally, we can factorize (7.666) as 
(A, Udy + Tdx + Udp)(Rdy + A,Udx + Udq) = 0 (7.672) 
(A,Udy + Tdx + Udp)(Rdy + 2,Udx + Udq) = 0 (7.673) 


By setting the bracket terms to zero in turn, we obtain four combinations. However, 
two of them give unacceptable results. In particular, the first and second systems 
are 

(Rdy + A,Udx +Udq) =0, (Rdy + 2,Udx + Udgq) =0 (7.674) 

(A,Udy + Tdx +Udp) =0, (A,Udy + Tdx + Udp) = 0 (7.675) 
Both of these lead to 4; = Az, which is not acceptable. Finally, we have the 
following two systems of equations: 

(Rdy + A,Udx +Udq)=0, (A,Udy + Tdx + Udp) =0 (7.676) 

(A,Udy + Tdx +Udp)=0, (Rdy+A,Udx +Udq) =0 (7.677) 
If any of these two systems can be integrated, we have two equations for p and q. 
The solutions of them can be used to give the exact differential of z: 

dz = pdx + qdy (7.678) 

We will illustrate this method in the following example. Thus, the failure or success 


of the Monge-Ampere method depends on whether we can solve for p and g from 
(7.676) and (7.677). 


Example 7.8 Consider the solution of the following PDE: 
az O28z 672, 
+[ ( yJ=1 (7.679) 
Oxy ax? Gy? — Axdy 


Solution: In simplified form, (7.679) is written as 

2s+(rt—s?)=1 (7.680) 
This is the nonlinear PDE type given (7.659) considered by Ampere. Thus, we 
have 


R=T=0, S=2, U=V=1 (7.681) 
Therefore, the characteristic equation for 1 given in (7.670) becomes 
A? +244+1=0 (7.682) 
Both roots are —1, and the corresponding system is 
dy-—dp=0, dx-—dq=0 (7.683) 
Integration gives the characteristics as 
uj=yY-p=a, uy =x-qz=b (7.684) 
Therefore, the intermediate integral is 
y—p=f(x-@) (7.685) 


Note that an arbitrary function is involved in (7.685). The Monge-Ampere method 
does not lead to the general solution but instead yields a particular solution. 
Therefore, without loss of generality we can set 

x-q=a (7.686) 
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With (7.686), (7.685) gives 


y-p=flaj=b (7.687) 
By making this assumption, we can solve for p and q as: 
p=y-b, q=x-a (7.688) 
Substitution of (7.688) into (7.678) yields 
dz = pdx + qdy =(y—b)dx +(x-a)dy (7.689) 
A particular solution is 
z=xy—bx-ayt+c (7.690) 


Another particular solution can be obtained by assuming a linear functional form 
for (7.685) 


y-p=m(x-q)t+n (7.691) 
Thus, we have 
Oz Oz 

mq =——-m— = y-n-mx 7.692 
a ee (7.692) 

Adopting the Lagrange method discussed in Section 6.7, the subsidiary equation is 
dx dy _ dz (7.693) 

1 -m y-n-mx 


The first pair gives the following characteristics 
uy =y+mx=a (7.694) 
The second pair of (7.693) gives 
dx dz dz 


= = (7.695) 
1 y-n-mx = a-—n—-2mx 
Integrating both sides we get 
Uy =z—(a—n)x+mx* =b (7.696) 
Therefore, another particular solution is 
zZ=-nx+xy+@(yt+mx) (7.697) 


This solution is slightly more general, but the bottom line is that this is still a 
particular solution. Thus, the Monge-Ampere method will lead to a “particular” 
solution only. 


Example 7.9 Consider the solution of the following PDE: 
az Oz Oz a26'z az 
— +3 —— + — +[—- —_( 
dx? Oxdy- Gy? ax Gy Oxy 


)}=1 (7.698) 


Solution: In simplified form, (7.698) is written as 
r+3s+t+(rt-s?)=1 (7.699) 


This is of the nonlinear PDE type given in (7.659) considered by Ampere. Thus, we 
have 


R=T=1, S=3, U=V=!1 (7.700) 


Therefore, the characteristic equation for 1 given in (7.670) becomes 
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24° +324+1=0 (7.701) 
The two roots of A are —1 and —1/2. From (7.676) and (7.677), we have two 
systems as: 


—dy+dx+dp=0, dy =e + dg =0 (7.702) 
~5dy tdi + dp 0, dy-dx+dq=0 (7.703) 
Integration of (7.702) and (7.703) gives two intermediate integrals as 
1 
Preys eed) (7.704) 
1 
presale a) (7.705) 


where f and g are arbitrary functions. Next, let us assume 
1 
ae aa q-x+y=f (7.706) 


where a and B are not constants. Using (7.706), we can rewrite (7.704) and (7.705) 
as 


ptx-y=f(@), ptx-zy= 8B) (7.707) 

Solving for x, y, p and g from (7.706) and (7.707), we get 
x=2(a-f), y=2[g(f)-f(@)] (7.708) 
p=y-xt+f(a), q=x-yt+P (7.709) 


Then, the total differential of z is expressed as 
dz = pdx + qdy =|y—x+ f(a@)|dx + (x—y+ B)dy 
=(y—x)(dx—dy) + f(a)dx + Bdy 
Note from (7.708) that 
dx =2(da-dB), dy=2[g'(P)dB- f'(a)da] (7.711) 
Substitution of (7.711) into (7.710) gives 


dz= -Sa(x-y)" -2f(a@)dB+2f(ajdat+2Be'(PdB-2Bf'(a)da (7.712) 
Integration gives 


z=-5 (xy) +2] f(@da +2] Be'(BdB-4p f(a) 


(7.710) 


=F (xy)? + (a) +2f8(8)-2] e(BydB-2B0(@) (7.713) 


== (x=)? + 6a) + By'B)-(B)-2f6(@) 
where the arbitrary functions ¢ and w are defined as 
wWa)=2[ fade, y(B)=2| sBdB (7.714) 


The final solution is summarized as 
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z= -F (0-9) + $(a)+ By"(P)-yW(B)-289'(a) (7.715) 
x=2%U(a-f), y=2[g(P)-f(@)] (7.716) 


In this solution, a and B are considered as parameters. Once they are given, x and y 
can be evaluated and, subsequently, z can be determined by (7.715). There are two 
arbitrary functions involved in this parametric solution, and this is the general 
solution. 


712 FACTORIZATION OF PDE WITH CONSTANT COEFFICIENTS 


The method considered here can be viewed as a generalization of Section 3.5.10 
for ODEs. We now consider a special form of PDE with constant coefficients: 


(D" +a,D"'D+a,D""D? +...+4,D")z = f(x,y) (7.717) 
where the differentiation with respect to x and y are written symbolically 
p=, pa. (7.718) 
ox oy 
We first consider the homogeneous case 
(D" +a,D"'D+a,D"*D? +...+.a,D")z =0 (7.719) 
Let us consider the simplest case 
(D—mD)z =0 (7.720) 
More explicitly, it can be written as 
aa Sle (7.721) 
ox oy 


Using the Lagrange method discussed in Section 6.7, the auxiliary equation of this 
first order PDE is 


ca 


aa (7.722) 
Integrating (7.722) gives two characteristics 
uj=Zz=a (7.723) 
Ur =mx+y=b (7.724) 
Therefore, the general solution is 
z=F(mx+y) (7.725) 
This suggests that the solution of (7.719) can be expressed in the following form 
Z=Fi(y+mx)t+ Fy(y tmx) +...+F,(y+m,x) (7.726) 
where m; (i = 1,2,3,...,n) are the roots of 
m" +am"|+a,m"* +...+4, =0 (7.727) 


For the case of equal roots, we can consider the problem as 
(D-—mD)* z =(D—mD)(D-mD)z =0 (7.728) 


Thus, we can rewrite (7.728) as a system of equations as 
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(D—mD)z=u, (D-—mD)u=0 (7.729) 
From the result in (7.725), we have 
u=F(y+mx) (7.730) 
The first equation of (7.729) becomes 
p-mgq = F(y+mx) (7.731) 
The Lagrange method gives an auxiliary equation as 
a ae (7.732) 


1 -m F(y+mx) 
The first two parts of (7.732) give 


uy =yt+mx=a (7.733) 
The second and third parts of (7.732) result in 
dz— F(a)dx =0 (7.734) 
Integration gives 
Uy =zZ—-xF(y+mx)=b (7.735) 
Thus, the general solution is 
o{z—xF(y+mx), y+mx} =0 (7.736) 
Therefore, equivalently the solution is 
z=xF(y+mx)+ Fi(y+mx) (7.737) 
This procedure can be generalized to give the solution of the following PDE 
(D-mD)"(D-m,D)---(D—m,D)z =0 (7.738) 


The general solution is 


z=x"'F(y+mx)+x" °F (y+mx)+...+ F,_(y +mx) (7.739) 

+H (y tmx) +...+ A, (vy +m,x) , 
Note that in this technique the order of differentiation for each term must be of the 
same order. 


Example 7.10 Consider the solution of the following PDE: 
az az az 


a Bee + axoy? 0 (7.740) 
Solution: In factorized form, we have 
(D® -3D°D+2DD")z =0 (7.741) 
The associated algebraic equation for m is 
m> —3m? +2m=0 (7.742) 
which can be factorized as 
m(m—2)(m—-1) =0 (7.743) 


Using (7.739), we have the solution 
Z=F(y)+ hy (vt+x)+ (yy +2x) (7.744) 
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Example 7.11 Consider the solution of the following PDE: 
pO gO Ban O 2 


—+5 +2—=0 (7.745) 
-e axdy = ay 
Solution: In factorized form, we have 
(2D* +5DD+2D7)z =0 (7.746) 
The associated algebraic equation for m is 
2m? +5m+2=0 (7.747) 
which can be factorized as 
(2m +1)(m+ 2) =0 (7.748) 
Using (7.739), we obtain the solution as 
z= R(y-53)+F(r-22) (7.749) 


This solution can also be found by the classification discussed in an earlier section, 
and two characteristics can be found. The canonical form can be solved. 
Apparently, the present technique is more efficient. 


7.13 PARTICULAR SOLUTION BY SYMBOLIC METHOD 


The symbolic method for determining a particular solution of an ODE has been 
discussed in Section 3.5.11. In this section, this method is applied to consider a 
particular solution of a PDE. Consider a nonhomogeneous PDE of the form 


F(D,D)z = f(x,y) (7.750) 
Using the symbolic method, we get the ae solution as: 
Zz 7.751 
> FD) ———= f(%y) ( ) 


Following the procedure discussed in Section 3.5.11, the symbolic method involves 
factorization, finding partial fractions, and expanding functions in infinite series. 
The following example illustrates this method. 


Example 7.12 Consider the solution of the following PDE: 


2 2 
O24 +922 - 1997 +36xy (7.752) 
ax? axdy ay 


Solution: In factorized form, we have 
(D? -6DD+9D*)z =12x? +36xy (7.753) 


The homogeneous PDE becomes 
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(D* -6DD+9D*)z =0 (7.754) 
The associated characteristic equation is 
m: —6m+9 =(m-3)" =0 (7.755) 
The homogeneous solution becomes 
Z, = F\(y+3x)+xFy(y +3x) (7.756) 
The particular solution can be expressed as 
ig= mee = ne (12x +36xy) 
= + 4012? +3639) poe 
D i 3D,3 
D 


Treating the differential operators as algebraic quantities, we can expand it using 
Taylor series as: 
1 
3D.> 
{a= 
ae, 
Substitution of (7.758) into (7.757) gives 


D Dee 
=1+6—+27 +... 7.758 
= i, ( ) 


1 D D 2 2 
= 1+6—+27 +..)(12x* +36x 
Zp =p H2UEY +H y) 


1 
ee +36xy) + 6; 363+... (7.759) 


DP: 
=x" +6x°y+9x* =10x* +6x°y 
The general solution now becomes 
z, = Fi(y +3x) + xFy(y + 3x) +10x* + 6x3 y (7.760) 


Let us now consider a more general form of the symbolic method. Consider the 
following linear order PDE: 


BanDe= “on “=p omgafes) (7.761) 
ox ov 
Using the Lagrange method in Section 6.7, we have 
a (7.762) 
1 —m IG, y) 
The first two parts of (7.762) give 
—mdx = dy (7.763) 


The corresponding characteristics is 
uy =ytmx=c (7.764) 
Then, the first and third parts of (7.762) can be combined to give 
dz = f (x,c—mx)dx (7.765) 
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where we have used the first characteristics (7.764) to replace y by x. Integration 


gives 
= | f(%,c—mxdx (7.766) 
Therefore, symbolically we can write the particular solution as 
1 
a= = f(x, y)= x,c —mx)dx 7.767 
p= Dp l =] f,e—m) (7.767) 
Let us illustrate by example. 
Example 7.13 Consider the solution of the following PDE: 
o 0.0 O i 
( 2—)\(— + —)z =(y-De’ (7.768) 
ox Oy Ox oy 
Solution: In factorized form, we have 
(D-2D)(D+ D)z =(D? — DD -2D”)z =(y—l)e* (7.769) 
The homogeneous PDE becomes 
(D? —- DD-2D7)z =0 (7.770) 
The associated characteristic equation is 
m —m—2=(m—2)(m+1) =0 (971) 
The homogeneous solution becomes 
2, = Fi( + 2x) + (y-x) (7.772) 
Define a function wu in (7.769) such that it becomes 
(D-2D)u =(y—-le* (473) 
The first characteristics of (7.769) is 
uy =yt2x=c (7.774) 
The particular solution can be found by first considering 
= [ f@,c-2x)a = [(c-2x—Derax =(c—2x+Ne* =(y+Ne* (7.775) 
By virtue of (7.767), we get 
1 ; 
u = ———(y-lDe* =(y+ De" 7.776 
(p-2b)” Jer =(y+D (7.776) 
Substitution of (7.776) into (7.769) gives 
(D+D)z =u =(y+le* 
The first characteristics is 
Uy =y-xX=c, (7.777) 
Thus, taking the inverse of the first fractional operator in (7.777) 
Z = —— +1 e* 7.778 
(D+D) (y+) (7.778) 


Using (7.777), we have 
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Zz +Det = [(¢, +x+De*dk = c,+x)e* = ye* 7.779 
s “os? ) (¢ ) (q+ x)e" =y (7.779) 

The general solution now becomes 
Z, =F(y+2x)+ Fy (y—x) + ye* (7.780) 


Let us now examine another form of nonhomogeneous PDE 

(D—mD—a)(D-nD -b)z = f (x,y) (7.781) 
Let us consider the homogeneous case first. In particular, we first consider the 
simple case: 


(D-—mD~—a)z = p—mq-az =0 (7.782) 
The Lagrange method gives an auxiliary condition of 
dk _ dy _& (7.783) 
1 -m az 


The first two parts of (7.783) give 


=y+mx=a (7.784) 
The first and third parts of (7.783) give 
adx = = (7.785) 
Zz 
Integrating this on both sides, we find 
uy, =ze “ =b (7.786) 
Thus, the general solution is 
o(ze “, y+mx) =0 (7.787) 
Equivalently, we can solve for z as 
=e"y(yt+mx) (7.788) 
Extending this analysis to a more general form of homogeneous PDE, we find: 
(D—mD—a)(D-nD—b)z =0 (7.789) 
=e" f(y+mx)+e"F(y+nx) (7.790) 


This is only true if a and b are constants (recall the factorization of ODEs discussed 
in Chapter 3). Let us illustrate the method with the following example. 


Example 7.14 ee ~ _= of ie following PDE: 


(<4 SWE 42S Dem de 3e46y (7.791) 


Solution: In factorized form, we have 
(D+ D-1)(D+2D-3)z = 4+3x+6y (7.792) 
According to (7.789) and (7.790), the homogeneous solution is 
z, =e" f (y—x) +e" F(y—2x) (7.793) 
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The particular solution of (7.792) can be considered as 
1 


“p “(D+ D—1(D+2D~—3) 


The inverse operator can first be expanded in series form: 


1 _l eel a ae] 
(D+ D-1)(D+2D-3) 3 pee gree 


(4+3x+6y) (7.794) 


=F 1+ (D+D)+..}{l4+5(D+2B)+..} (7.795) 


1 1 = 
=—{1+-(4D+5D)-+... 
5 5 | +s 
Substitution of (7.795) into (7.794) gives 

z, = F114 54D +5B)+..J(443x46y) 


= 5 443046744410} (7.796) 


=6+x+2y 


Finally, combining the homogeneous solution and particular solution gives the 
general solution 


Zp =e* f(y—x) +e" F(y—2x) + ye* +64+x+42y (7.797) 


7.14 SUMMARY AND FURTHER READING 


In this chapter, we started with the classification of second order PDEs, leading to 
three different types of differential equations. They are hyperbolic, parabolic, and 
elliptic. The canonical forms of the three types are considered. It was shown that 
for any linear second order PDE with constant coefficients, we could always 
convert it to three types of second order PDE: they are the nonhomogeneous Klein- 
Gordon equation (for hyperbolic type), the nonhomogeneous diffusion equation 
(for parabolic type), and the nonhomogeneous Helmholtz equation (for elliptic 
type). The solutions of these PDEs are then investigated briefly. Adjoint and self- 
adjoint general second order PDEs are considered. The mixed type of PDE is 
discussed in the context of two-dimensional steady gas flows. By hodograph 
transform, ideal gas flows is converted to Chaplygin’s equation and the Tricomi 
equation. The solution of the Tricomi equation is considered for the sake of 
completeness. Riemann’s integral of hyperbolic PDEs is also discussed. Further 
discussions of the mixed type PDE are available in Smirnov (1978), Landau and 
Lifshitz (1987), and Tricomi (1923). In view of its importance in mechanics and 
elasticity, the biharmonic equation is considered in detail. Four mechanics 
problems leading to the biharmonic equation are defined and derived: they are 
plane elastic problems, three-dimensional elasticity, bending of thin elastic plates, 
and two-dimensional viscous flow with low Reynolds number. Uniqueness of 
solution of the biharmonic equation is demonstrated. The biharmonic functions are 
considered through the use of Almansi theorems. The biharmonic solution for the 
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circular domain is considered in detail, using an integral formula that can be 
considered as an extension of the Poisson integral for potential theory. This 
solution was first obtained by Hua (2012). The solution technique for second order 
PDEs with non-constant coefficients is considered, including the Monge and 
Monge-Ampere methods. Finally, we discuss the factorization technique for higher 
order (higher than two) PDEs. Finally, we discuss the symbolic method for solving 
PDEs with constant coefficients. 

The mathematical analysis of PDEs of second order has been covered in all 
textbooks on PDEs, but PDEs of higher than second order have been relatively 
untouched in most textbooks on PDE. The only exceptions are the coverage of 
biharmonic equations by Selvadurai (2000b) and by Ayres (1952). In this chapter, 
we cover the biharmonic equation in more detail and we also discuss the technique 
of factorization for solving higher order PDEs and the symbolic method for higher 
order PDEs of constant coefficients. 


7.15 PROBLEMS 


Problem 7.1 The validity of (7.99) can be established from the following equation: 
OE =iMy2 , p AE=I) AE-in) , E=IMy9 _g 7.798) 
ox Ox Oy Oy 


Show the details. 
Problem 7.2 Show that 


RP = 44 (Ep £402(22)2(O2y2 +44B oS on 86 én , 0g On 
Ov Oy Ox Ox Oy Ox Ox Oy 
were _ erledwedr: 2) + 1% a (7.799) 


sac yey, 26 On, = 86 bn 86 on 
Oy Oy Ox Ox Oy Oy Ox Ox Oy 


Problem 7.3 Show that 
4AC = 4A? Couey + scr(Sr(CI +4 pce 
Oy Ox Ox Oy 
aay a a(S 28 ssa (Ey (Sb Can 
ox Ox Oy 
ony, 0g OF | O52 On On 
oy Oy Ox oy Oy Ox 


+4BC[( 


(7.800) 


Problem 7.4 From the results of Problems 7.2 and 7.3, show that 
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Bh. pain, WlnO Sas _ Oo yp oy a POT eee 
Bo =4AG BIGGS OPP Go? BE GF! 
06,2-0N,2 _4/06,,0N,,05,,On,  -ON,2 06 
MAGOG Oe? BF 
= (B? -4AC) J” 
Thus, the validity of (7.19) is established. 


| (7.801) 


Problem 7.5 Classify the following second order PDE: 


> Oru Ou > Oru 
x +2xy +y = 
ax? Oxdy dy" 


Also, find the canonical form of this differential equation. 
2 


0 (7.802) 


Ans: Parabolic, ‘= 0 


Problem 7.6 Classify the following second order PDE: 
Oru Cru Ou 
y +—= 
ax? OxOy Ox 


Also, find the canonical form of this differential equation. 


(7.803) 


Ans: For y > 0, hyperbolic, = 
Onog 


Problem 7.7 Generalize the self-adjoint conditions given Section 7.6 to the three- 
dimensional case (i.e., three variables). In particular, the linear differential operator 
becomes 


2 2 2 2 2 
Ly (u) = Ay 5 + By ZS + Cy 5+ 2Dy SA 428 
IX . IXOZ 
a yy 7 (7.804) 
eG ae, as eo 
Ovoz Ox Gy "Gz ~* 
Ans: 
AeA, B= By, C=C, (7.805) 
D, =D, Ej =£ 4, F; =f (7.806) 
0A, OD; OF; 1 
==(G,+G,) (7.807) 


ij Ji Ji 
ao a (7.808) 
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0E, OF, OC 1 
+ =—(K,+K,) 
ox = Oy a 2 4 
Problem 7.8 Find the solution of the following second order PDE: 
2 
Cad +AW =0 
Ouog 


Ans: 
W(u,6) = AJ (2fAue ) + BY (2) Aus ) 


Hint: Apply a change of variables of € =./¢ and then z= 2éVA . 


Problem 7.9 Find the Riemann function v for the following PDE 
uC 

—— +—u= f(x, 

Bxoy 4 L(Y) 


Ans: 


W(x, ¥) =Jo(}C —X My —Yo)) 


Hint: See Problem 7.8. 


(7.809) 


(7.810) 


(7.811) 


(7.812) 


(7.813) 


Problem 7.10 Show that the Riemann function for the case of A = B = C=0 in 


(7.443) is 
w(x, y)=1 


(7.814) 


Problem 7.11 Consider the following second order PDE, which is a gas dynamic 


problem considered by Riemann: 
av a ov ov 2av 
+ + >= 
OxOy x+y Ox Oy (x+y) 
(i) Consider the following change of variables 


lore F(), pa e-em) 


(x+y)(S+m) 
Show that 
WS 4 7+ Z Fe 
Ox xt x+y Ox 
Ov _/€+M\a, —a OZ 
5 I FOtS Fe) 
av &+n.,,a(a+l) a oz Oz, _, 
ayox “lg US ay F(z) a ex ey 
oz Oz Oz 


* ax FG ar z)) 


(7.815) 


(7.816) 


(7.817) 


(7.818) 


(7.819) 
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(ii) Show that (7.815) can be reduced to 


2 
ee ie 2 Se ? F(z) =0 (7.820) 
Ox Oy Oyox (x+y)? 
(iii) Prove the following identities: 
od >2(z-1) (7.821) 
Ox Oy (x+y) 
2 ay 
Orz _ 2z x (7.822) 
dyox (x+y) 
(iv) Show that F satisfies the following equation 
2(z-lF"(z)+ (1-22) F(z) + a(a+1)F(z) =0 (7.823) 
(v) Find the solution for v 
Ans: 
v= ye - Ai 20); (7.824) 
(x+y)S+m) 
Problem 7.12 Prove the nite vector identity that we use in Section 7.9.4: 
V'u=V(V-u)-2Vx@ (7.825) 


Problem 7.13 Prove that the biharmonic equation in Cartesian coordinates cannot 
be solved by using separation of variables. 


Problem 7.14 It is given that 

Viw=0 (7.826) 
Consider a circular two-dimensional domain such that w = w(r,0). Now define a 
new function w* as: 


w*(r,0) = rw(2,0) (7.827) 
F 
Prove that w* also satisfies biharmonic equation (7.826). 


Hint: 

(1) Assume a change of variable of 7’ = 1/r and 0'= 0. 
(ii) Assume next that w = w*(r', OV?°. 

(111) Use the second Almansi theorem to express w*. 

(iv) This result can be found in Eq. (13) of Duffy (1961). 


Problem 7.15 It is given that u satisfies the following diffusion equation 
Ou _ Oru 
ot ax? 

Consider a function v defined in terms of u(x, f) as: 


(7.828) 
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x -l 


1 = 
v(x,t) =e *u(—,— 7.829 
(x,t) a C : ) ( ) 
Prove that v also satisfies the diffusion equation (7.828). 


Hint: This is Problem 498 of Gelca and Andreescu (2007) as a training problem 
for the William Lowell Putnam Mathematical Competition for college students in 
North America. 


Problem 7.16 Find a second order PDE that has a first or intermediate integral of 
the following form: 

py—-qt+3y =d(2x+y’) (7.830) 
Ans: 

2 2 2 
oe 2 0 (7.831) 
ox OxOy dy” Ox oy 

Problem 7.17 Find the solution of the following second order PDE by direct 
integration 


2y 


2 
OF aay (7.832) 
Oxdy 
Ans: 
1 1 
Za SUV ZVXtAO +0) (7.833) 


Problem 7.18 Find the solution of the following second order PDE by direct 
integration 


Oz 
=x-y (7.834) 
Oxdy 
Ans: 
1 1 
z= gE gy tate, (y) (7.835) 


Problem 7.19 Find the solution of the following nonhomogeneous diffusion 
equation 


u, =aCUu,, + f(x)sin(t), O<x<l, t>0 (7.836) 

subject to the following initial and boundary conditions: 
u(0,t)=0, u(l,t)=0, t>0 (7.837) 
u(x,0)=0, O<x<l (7.838) 


Ans: 


(x1) py aze sint+e" Mt Soy ies (7.839) 
u(x,t) = |, sin(nzx : 
par l+a‘as 
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1 
[a I, f(x)sin(nzx) dx (7.840) 
A, =n, n=1,2,3,.. (7.841) 


Problem 7.20 Extend Hua’s formula derived in (7.563) to the case of a circular 
domain with radius 79: 


_@ my oI [" hy) 
201% 2J-2/? +H —2rry cos(O -y) 
™ [7% —rcos(@— 
+f L% ! O-wlsy) sy} 
[1 +r° —2rrm cos(O-y)] 


(7.842) 


Problem 7.21 Derive the following mean value theorem for biharmonic problems: 


(0,0) =f" sway" nonaw (7.843) 


Hint: Compare the mean value theorem given in Chapter 9 for harmonic problems. 


CHAPTER EIGHT 


Green’s Function Method 


8.1 INTRODUCTION 


In 1828, self-taught genius George Green, at the age of thirty-five, published An 
Essay on the Application of Mathematical Analysis to the Theories of Electricity 
and Magnetism (Green, 1828). It is amazing that Green derived this result and 
published it as a book at his own expense before he received any formal education. 
The report was sent to 51 subscribers of the Nottingham Subscription Library. To 
be exact, he had only attended one year of primary school at the age of nine before 
this discovery. Green is probably the best self-learner in the history of mathematics 
and physics. 

Green’s work remained relatively unknown until Lord Kelvin in 1845, four 
years after George Green’s death, rediscovered it. He recognized its importance 
and helped to publish Green’s essay in Crelle’s Journal. Poincare summarized our 
knowledge of Green’s functions near the turn of the twentieth century. In 1946, P. 
M. Morse and H. Feshbach published their classnotes as Methods of Theoretical 
Physics. They laid out the four main properties that a Green’s function must 
possess. Morse and Feshbach showed that “Green’s function is the point source 
solution [to a boundary-value problem] satisfying appropriate boundary 
conditions.” Thus, Green’s function could be found by simply solving the 
differential equation subject to a Dirac delta function with homogeneous boundary 
conditions. With this understanding, the powerful techniques of eigenvalue 
expansions (e.g., Chapter 10) and integral transform methods (e.g., Chapter 11) 
could be used in a straightforward manner to find Green’s functions. Green’s 
function method is especially useful in solving nonhomogeneous differential 
equations. It always provides the basis of integral equations and consequently the 
boundary element method (Brebbia et al., 1983). 

Shortly after the publication of Green’s monograph, German mathematician 
Carl Gottfried Neumann (1832-1925) developed the concept of Green’s function 
as it applies to the two-dimensional (in contrast to three-dimensional) potential 
equation. He defined the two-dimensional Green’s function, showed that it 
possesses the property of reciprocity, and found that it behaves as In(r) as r > o. 
A. Harnack gave the Green’s function for a circle and rectangle. All of these 
authors used eigenfunction expansions in obtaining the Green’s function, which 
becomes one of the fundamental techniques in constructing a Green’s function. 
Later on, John Dougall (1867-1960) derived three-dimensional Green’s functions 
in cylindrical and spherical coordinates. Subsequently, Green’s functions for many 
different differential equations were derived. For example, the Feynman diagram 
for elementary particle interactions actually describes the interaction between 
particles by Green’s function. Nobel Prize laureate Julian Schwinger confessed that 
it was Green’s function method led him to the Nobel Prize in physics. He shared 
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the Nobel Prize with Tomonaga and Feynman in 1965 in quantum electrodynamics 
(Beiser, 2003). 


8.2) POTENTIALS 


Whenever a vector field can be determined by the vector derivative of a scalar 
function, we say that potentials exist. In nature, many phenomena can be modelled 
and expressed in terms of potentials, such as gravitational field, electric field, 
incompressible flow, etc. 
Mathematically, a velocity field v is expressed as 

v=V¢ (8.1) 
where ¢ is the scalar potential. From the divergence theorem of Gauss, the volume 
integral can be converted to the surface integral as 


[Nav t= Je -nas (8.2) 


Substitution of (8.1) into (8.2) gives 
ify V24dQ = {I V¢-ndS (8.3) 
Q r 


If the velocity field v represents fluid flow and it is incompressible (i.e., V-v = 0), 
we have 

V7¢=0 (8.4) 
which is the Laplace equation. Therefore, for any problem that can be modelled by 
the Laplace equation, there must exist potentials. 


8.3. GREEN’S FUNCTION FOR LAPLACE EQUATION 


The Laplace equation is one of the most fundamental second order partial 
differential equations because of its repeated appearance in the modelling of 
physical problems, including electrostatics, incompressible fluid flow, gravitational 
theory, and membrane deflection. In Cartesian coordinates, it is given as 
Ou Ou 2 Oru _ 

ax? oy" 62? 

To consider the three-dimensional Green’s function, it is more convenient to 
consider the Laplace equation in spherical problems 

10,50 1 @€8,. eu 1 Ou 

=r? —)+=— (sind —)+ ~—.—-—— =0 (8.6) 
r Or Or rsin@ 00 00 rr sin’ 0 dp 

For the case of a point source or Green’s function in an infinite domain, the 
solution must be symmetric with respect to the origin, and thus the solution is 
independent of @and g. For this case, we write the problem as 


V’G(r—1%) = O(r —1%) (8.7) 
where is the Dirac delta function, which is infinite at r = ro and otherwise zero: 


V-u= 


(8.5) 


V-u= 
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O(r-%)=0 rF#NH (8.8) 


=0 r= 
In addition, we have that 
ff Ol Q=1 (8.9) 


where 2 is any volume integral embracing the source point ro. If the source is not 
included in Q, the integral will be identically zero. This Dirac delta function is 
named in honor of electrical engineer and physicist, Paul Dirac, who received the 
Nobel Prize in physics for his major contribution to quantum mechanics. The idea 
of using the Dirac delta function actually occurred much earlier than its usage in 
quantum mechanics by Dirac. For example, it can be used to prescribe point force 
in the case of beam bending subject to a concentrated force. However, its 
successful application to quantum mechanics leads to the development of a 
completely new theory by mathematicians. To validate the mathematical analysis 
involving the Dirac delta function, the theory of distribution of Schwarz and 
Gelfand or the so-called generalized theory, was proposed. According to the 
generalized theory, the Dirac delta function is defined using the process of 
integration. An introduction on Dirac delta and the associated distribution theory 
will be given in a later section. 

Note that strictly speaking, Green’s function should be defined in (8.7) by 
using the adjoint differential operator of (8.5). But for the case of the Laplace 
equation, it is self-adjoint. Therefore, (8.5) and (8.7) are valid only for the case of 
the self-adjoint problem (Greenberg, 1971). 

More generally, if a Green’s function is formulated in a finite volume, we can 
express it into two parts: 

G=G)+G, (8.10) 
The first part, Go, is called the fundamental solution with a singularity at the source 
point r = ro for an infinite domain (this is actually a particular solution of the PDE), 
whereas the second part is the homogeneous solution or satisfies the following 
homogeneous form with given a boundary condition (Greenberg, 1971): 


V’G, =0 (8.11) 
This two-part Green function will be illustrated again in Section 8.6. 
For functions independent of @and gq, (8.6) is reduced to 


1 d,,dG 
——(r* —)=0 8.12 
r- dr u dr ) oe 
except at point r = ro. Integrating (8.12) once, we find 
dy za (8.13) 
dr r 
Integration of (8.13) gives 
Gy oo ee (8.14) 
r 


For the case of a point source at the origin, the solution Go should decay to zero as 
r — (recalling that the fundamental solution is for the infinite domain), and this 
gives 
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C, =0 (8.15) 
To find the first constant, we have to consider the volume integration embracing 
the source point ro as shown in Figure 8.1. An arbitrary small spherical domain 
containing the source point is denoted by Qo and the corresponding boundary is Xo. 
Then, we apply the Gauss theorem to get 


fff, "oe ff, GPs 


2a ea 
= | | Oe Sy in eqbde (8.16) 
0 40 0r sr 
=4nC, =1 
The last part of (8.16) results from (8.9). Therefore, we have 
1 
C=— 8.17 
An ( ) 


Finally, the fundamental solution is 


Gy) = =e (8.18) 
4ar 
However, in the literature related to electrostatic problems, a negative sign is used 
on the right hand side of (8.7). That is, (8.7) and its fundamental solution are 
normally given as 


V’G(r—1%) =—d(r—1) (8.19) 
eel 
7 4ar 


eo (8.20) 


Figure 8.1 A 3-D body containing the singular point that a point source ro is applied 
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The fundamental solution given in (8.18) or (8.20) is finite everywhere except at 
the origin (i.e., 7 # 0). This is the three-dimensional Green’s function for the 
Laplace equation for an infinite domain. Actually, Chau (2013) showed that this 
harmonic solution (i.e., solution of Laplace equation) is the granddaddy of many 
other harmonic functions, and is of profound importance in solving elasticity 
problems using Papkovitch-Neuber displacement potentials (or the so-called P-N 
potentials). 

For the two-dimensional Green’s function, it is convenient to consider the 
cylindrical coordinates of the Laplace equation 


10, Ou 1 07u 


Vou = + =d(r-1 8.21 
u or ee ? op (r-1) (8.21) 
For this case, we write the solution as 
u= g(r) (8.22) 
The governing equation of Green’s function given in (8.21) becomes 
d’g 1 dg 
ee (8.23) 


As discussed in an earlier chapter, we can introduce a change of variables to reduce 
the order of the ODE as 


= =Z 8.24 
dr ee 
Then, (8.23) becomes 
acl + Z Z=0 (8.25) 
dr r 
This is a separable first order ODE. Rearranging and integrating, we get 
Z= L (8.26) 


F 
Thus, combining (8.24) and (8.26), we obtain 

g=C,nr+C, (8.27) 
Following the procedure for the 3-D Green’s function for the infinite domain, we 
take an area integration of a circle containing the source point. That is, 


[[vsin-[ os 
z eae Inr+C, dO (8.28) 


=27rC, =1 
We have set Cz = 0 in (8.8) for simplicity. Thus, we have the fundamental solution 
as 


g= sh (8.29) 


where r # 0. This is the two-dimensional Green’s function for the Laplace equation. 
Again, in electrostatic literatures it is sometimes given as: 
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10. Ou. 1 uy 


sae 4: =-d(r—- 8.30 

u= a ~ oe (r-1H) (8.30) 
1 1.1 

= Inr=—1 8.31 

Bg ae a (8.31) 


For higher dimensions (i.e., m > 2), the infinite Green’s function can be 
expressed as (Zachmanoglou and Thoe, 1986): 
_ 1 1 
" In(3—m)! r™? 
If we ignore the constant term, this agrees with the result given in Zachmanoglou- 


Thoe (1986). For m = 3, we recover the three-dimensional space Green’s function 
given in (8.20). 


(8.32) 


8.4. GREEN’S IDENTITIES 


Green’s identities were derived in 1828 by George Green, in his attempt to provide 
a general mathematical theory for solving electricity and magnetism problems. It is 
amazing that Green derived this result and published it as a book at his own 
expense before he received any formal education. To be exact, he had only 
attended one year of primary school at the age of nine before this discovery. Green 
is probably the best self-learner in the history of mathematics and physics. Let us 
start with the divergence theorem of Gauss of the ae function: 


NN & + * \in- Jf. [P = +02 4RZ as (8.33) 


where P, Q, and R are some arbitrary functions of space within the body of Q with 
boundary /“as shown in Figure 8.2. The unit normal is denoted as n. 

We now assume particular forms of P, Q, and R, which are expressed in terms 
of two functions u and v: 


Poy, O25. Rag (8.34) 
Ox ov 


Oz 
Differentiation of (8.34) gives 
OP _ Ou dv | ay 


8.35 
Ox Oxo ae ( ) 
2 
ee ag (8.36) 
dy dye ay 
2 
OR _ ou Lae v (8.37) 


Oz Oz & 2° 
Substitution of (8.35) to (8.37) into (8.33) gives 


[lJ uv e= ,S 7 | ae | ot a0 | [fu Sas (8.38) 


This is called Green’s first identity. 
Now, we can redefine P, QO, and R in (8.34) by reversing wu and v: 
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pa. Gas, nap (8.39) 
Ox oy Oz 
dn 
ds 
F 


Figure 8.2 A body with domain Q and boundary I showing the unit normal 


Following the same procedure, we have 


[IJ ev7e= {a2 ! - ° ! _ Sdn ! Ihe et dS (8.40) 


Taking the difference of (8.38) and (8.40) gives 
Ne (uV?v—vV"u)dQ = II. uv Shas (8.41) 


This is Green’s second identity, also known as aw S — This equation 
relates u, v, V7u, and V7v inside the body to u, v, Ou/On, and Ov/On on the surface of 
the body. This Green’s identity is physically related to the uniqueness of the 
solution to the Laplace equation, the conservation of mass, and the Maxwell- 
Rayleigh reciprocity law. It lays the mathematical foundation for Green’s function 
method and the boundary integral method. Therefore, it is a very important 
mathematical theorem. Note that the differential operators on the left hand side of 
(8.41) are Laplacian. In fact, a similar identity can also be formulated for a more 
general differential equation. This will be done in later sections. 
We now consider a special case that 


ee ! (8.42) 


| 2 2 2 

: (x-X9)" +(Y- Yo) + (2-2) 

where the observation point Mo (xo, yo, Zo) and the distance r are shown in Figure 
8.3. Note from the last section that we have actually set v as the three-dimensional 


Green’s function for the Laplace equation. This is the idea behind Green’s function 
method. Substitution of (8.42) into (8.41) gives 


Moo. [uV? © = “Vv*ujda = Voce [u <-* a < Shas (8.43) 
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A small sphere of radius ¢ around point Mo has been excluded from the integration 
whereas an extra surface integral of the sphere is added to the surface integral. Note 
that once the point Mo has been excluded in the integration, we have 


v2) =0, V-u=0 (8.44) 
r 
everywhere within . oe Q-Q:. Therefore, (8.43) is reduced to 


1 Ou 0 1, lou 
—(-)--—]ds dS =0 8.45 

W, lu on <0) r an! Ma > On! ( ) 
On the spherical surface Iz, we have 


a a 


7 8.46 
an *? ar“ re ¢? — 

Therefore, we can simplify a surface integral as 
MM. une “ds =— J udS = Ania (8.47) 

r, 


where the superimposed bar indicates the average value of u on the spherical 
surface I’;. On the other hand, on the spherical surface we have 


fl. <= ad --{l. ot as = Ané @ (8.48) 


Back substitution of these results into (8. 45) ce to 


1 
I]. [u << ~)-- Sus + 4a - Ane =0 (8.49) 
in 
We now take the limit ¢ > a 
lim 4zu = 4zu(M,) (8.50) 
é>0 
lim Ae =0 (8.51) 
630 On 
In view of these results, (8.49) can be rewritten as: 
1 Oo 1, lou 
u(M,) = {I dS 8.52 
(Mo) 4n aD? r an Se 


This is Green’s third identity. In potential theory, the solution in (8.52) can be 
considered as the summation of two potentials on the boundary. In particular, the 
solution due to the so-called double layer potential is defined as: 


1 Oo) 
u(My) = — real leas (8.53) 


where pt is the double layer potential (given value of wu on the surface). 
Alternatively, the solution due to the single layer potential is defined as: 


u(Mo) = -=|f Eos (8.54) 


where o is the single layer potential given on the boundary (as the normal 
derivative of w). The methods of single- and double-layer potentials are apparently 
due to Gustave Robin in his PhD thesis on potential theory. We will mention Robin 
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again in Chapter 9 when we discuss the Robin problem in potential theory. For the 
case that u is not a harmonic function, (8.52) can be modified as 


4 0.1. 1a 1 i 
u(My) [eso ds real amen (8.55) 


4n 


(x, y,Z) 
M (x9, ¥o>2Z0) 


Figure 8.3 The integration domain for Green’s third identity 


From (8.55), if we know u and Ai/c on the boundary, we can find the value 
of u anywhere within the body. However, in reality, we could not impose both of 
them on the same surface. This is similar to the fact that you cannot independently 
impose both moment and rotation at a hinged support of a beam at the same time. 
Once moment is applied, the beam will rotate according, depending on the bending 
stiffness of the beam, and vice versa. 

The following example shows the reason why (8.53) and (8.54) are called 
double and single layer potentials (which was proposed by Robin) and what their 
relation is. 


Example 8.1 It is given that a harmonic function for a single layer potential o can 
be evaluated by the following surface integral 


1 1 
u(My)=—7- II. [olds (8.56) 
where 


V-u=0 (8.57) 
Consider two layers of single potentials distributed on two surfaces, which are 
separated by a distance h. One has strength o whereas the other has strength —o, 
such that the limit of A — 0 and o > o are taken such that oh > wu, which is 
uniform on the whole surface. Show that the solution is given by 


1 él 
u(Mo)=—3—|[ lu<C lds (8.58) 
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Solution: The two single layers are the source and image as shown in Figure 8.4, 
and they can be related as: 


o(S)dS(¢) =—o(S')dS(¢') (8.59) 


Thus, the solution for the two single layers can be superimposed as 


1 1 1 1 
(My) =-F= | eS) Fe If eye MASE 
1 1 1 
“Flees 5 OO (8.60) 


1 1 
——— A(E,E' dS 
el ee es ¢') ra x) enh i 


We consider the limit that h + 0 and o > is taken such that oh > wu. Note that 
the bracket term can be replaced by 


1 1 1 a) 1 
= 8.61 
en TEs ess Bn) r(x) sae 

Substitution of (8.61) into (8.60) gives 
9 dS(é) (8.62) 


1 
u(My)= - [vox on(g) r(S,x) 


This is exactly the integral equation for double layer potentials given in (8.58), and 
ut is known as the surface density or dipole of the source and its image. 


Dipole layer/double layer 


= = lim [oh] 


— 0 
~n ek image h © h>00>0 
©. 


source ~“@._ 


T(é)° 


Figure 8.4 Double layer potential as a superposition of two single layer potentials 


8.5 BOUNDARY INTEGRAL FOR HARMONIC FUNCTIONS 


Green’s third identity given in (8.52) is only valid when the observation point Mo is 
not on the surface. If the observation point is on the boundary, special 
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consideration needs to be made. Figure 8.5 illustrates the limiting process of an 
interior point Mo moving in the direction of unit normal n to the boundary J? First, 
the surface is divided into two parts, one is the total surface excluding a circular 
disk containing the boundary singular point, and the other one is the circular disk 
containing the singular point. In view of this, the volume integral remains zero 
because and both w and 1/r are harmonic functions, and the singularity point is not 
inside the domain. Only the surface integral given in (8.52) needs to be evaluated 
and can be rewritten as 


la alle 
tim (lf Ey es + ff es [u _- <_1dS} = 0 (8.63) 


Note that the circular disk is tangent to the surface ae there is a unique tangent in 
the process of approaching the boundary. The first integral on the right of (8.63) 
over the domain /—/; is analytic and regular, and thus 


finn ff. ei es if “= (8.64) 
e>o0JdJr-r, ares > r On an 7? r On 
The second integral on the left of (8.63) can be evaluated as 


tim ff i? ey = Sas = tim ff uSCoas~ lim ff ¢ + Sas (8.65) 


é>0 é>0 


an - r On é>0 
The second feel on the right of (8.65) can be determined as 


Figure 8.5 The observation point My approaching the boundary 


lim ff Le ta =e dS = tim ne? = tim ne! =0 (8.66) 
e>09JT, r On 6206 9dr, On Ee 0 € On «>0 On 
The last part of (8.66) is zero provided that the double layer potentials prescribed 


on the surface are bounded. 
The first integral on the right of (8.65) can be determined as 


01 : 01 61 
li —(—)dS =1 M)-u(M,,)|—(-)d M,)—(—)dsS 
tim ff was = time [f tu( Mt) —u(M, Cds + ff a.) as} 

(8.67) 
The second integral on the right hand side of (8.67) can be evaluated as 
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ol . é€ 1 Or 
= Las = iat 2 Gd 
I, = lim | J, MMod=- Cds u(M.,) lim I, 3 5,2dp (8.68) 


Referring to Figure 8.5, we find that 
or A 


=sina =— (8.69) 
On r 
In addition, we apply the following change of variables from p to r: 
p=Nvr-/* , or pdp=rdr (8.70) 


The integral in (8.68) becomes 


‘Wind aomar ina — ea 
esl pea ee fe 
However, we can take the limit 4 — 0 (Mo approaching the boundary 7) much 
faster than ¢ — 0, such that 


lim é =0 (8.72) 
éeo0 € 
Finally, we get 
I, =-2nu(M,) (8.73) 


By following a similar procedure, the first integral on the right hand side of (8.67) 
can be evaluated as 
f= tim ff fu) —ue, as 
é>0ddT, On r (8 74) 
=-27 lim A | [w(M)—u(M,)] dr 
é>0 JZ r 

Now, we impose the Holder condition on the single layer potential (e.g., 
Muskhelishvili, 1975): 

\u(M)—u(M,)| < Ar? (8.75) 


where 4 and 0 < 8 < 1 are positive constants. Substitution of the equality of (8.75) 
into (8.74) gives 


VPre 
I, =-27Alima ro dr 
éE>0 a 
a tim A{(e? + A?)P 9? — 49} 
ey: (8.76) 
2A bg 
lim (— 4 Paes A ep- D2 (a “ys 4 
“pete 
=0 


as 4 — 0 (Moapproaching the boundary) and ¢ > 0 with A/e > 0. 
Substitution of these results into (8.63) gives the following boundary integral 

equation 

} a aay 


uM) == ff [um <()- (8.77) 


Green’s Function Method 505 


where My is on the boundary and M is the source point for the surface integral. 
Note that (8.77) and (8.52) can be combined to give a single integral equation. 


1 ool. 10u 
M,)= ds 8.78 
ayu(M,)=—>— [| uw )—— (8.78) 
where 
1 M,inQ 
a, = 8.79 
2 . M, onT ( ) 


As discussed by Kellogg (1929), as long as the double layer potential is continuous 
and the single layer potential satisfies the Holder condition, the boundary value of 
u can be evaluated from a boundary integral. 

The boundary integral shown in (8.78) also forms the basis of the boundary 
element method (Brebbia et al., 1983). 


8.6 GREEN’S FUNCTION METHOD FOR LAPLACE EQUATION 


In the Green theorem given in (8.41), if both wu and v are harmonic functions, we 
have 


If. wy Shas = 0 (8.80) 
Adding (8.80) to (8.52), we obtain 
1 1 Ou 
u(M,) = Sj NE vy bds (8.81) 
on = Tia. 4rryy, On 
If we impose the following boundary condition when we determine v: 
1 
v| = ; (8.82) 
7 Arryn, f 
we can simplify (8.81) as 
0 1 
M ds 8.83 
uM) =-|fu Ge) (8.83) 
Let us now define the function inside the bracket as 
G(M,M,)= -v (8.84) 
4aryn, 
Then, 
u(M,) = ff vas (8.85) 


This function G is called Green’s function. 
On the other hand, if the boundary condition is given such that 
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oi 
T 


Ov 
rp ON Aatryy, 


on 


(8.86) 


for this boundary condition, (8.81) can be simplified to 


Ou 
u(My) = eZ} dS (8.87) 


For this example, we can see that Green’s function appears naturally and the 
structural form of Green’s function depends only on the differential equation. 

Note that this structural form of the two-part Green’s function in (8.84) was 
mentioned in (8.10) in Section 8.3. Physically, these two parts of Green’s function 
appears naturally. For example, for the case of the electrostatic problem for a finite 
domain V, the fundamental problem can be recast as: 


V’G= -— 5%) (8.88) 
0 


where & is the permittivity in a vacuum as discussed in (2.103) in Chapter 2. If this 
problem is recast into an infinite space: 


V°G =-—[5(r-m) +@(D)5r] (8.89) 
0 


where or is the delta function on the boundary of domain V (i.e., only nonzero on 
the boundary), the second term on the right of (8.89) is physically the felt electric 
potential on the boundary induced by the point source at location r = ro (i.e., 
induced by the first term on the right of (8.89) at the boundary of V). Thus, Green’s 
function can be recast into two parts: 


G=G)+G, (8.90) 
where these two parts satisfy 
1 
V’Gy) =-—(r-n) (8.91) 
ri) 
1 
VG, =-—@() 6, (8.92) 
€ 


Physically, the first part Go is Green’s function in an infinite space (or called the 

fundamental solution) and the second part of Green’s function in this case is the 

electric potential induced by the felt potential on the finite boundary of domain V. 
Thus, Green’s function for the finite body is 


G=- : Inr+G, (8.93) 
279 


The exact form of G; depends on the geometry of the boundary as well as the 
boundary conditions. 


8.7 GENERAL GREEN’S THEOREM FOR SECOND ORDER PDE 


For the hyperbolic equation, the following generalized Green’s theorem was 
derived by Riemann in 1860 in two-dimensional space. This work by Riemann was 
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not known until du Bois-Reymond noticed it in 1864 and was subsequently 
publicized by Darboux. Actually, similar results were also obtained by Kirchhoff in 
1895 and Volterra. In particular, the following hyperbolic differential operator is 


considered: 
2 
ii i ee, (8.94) 
OxOy ox oy 


The adjoint differential operator of F' is derived in Section 7.5 as 


av a é 
G(v) (Av) (By) +Cv (8.95) 
Oxdy Ox oy 
From the result of Section 7.5, we have the following identity 
vF (u) —uG(v) = = + & (8.96) 
ox Oy 
where 
pe (8.97) 
2 ay oy 
0-50 ou) + Buy (8.98) 
Integrating (8.96) over the two-dimensional ae we have 
[ore- uG(v) dxdy = {eS oe livdy = [ea Odx) (8.99) 


This is generalization of Green’s second ne for differential operators other 
than the Laplacian given in (8.41). 

More generally, this Green’s theorem has been extended to any linear general 
second order differential operator with m variables by Hadamard with the 
following differential operators: 


‘ Ou wo, ou 
F(u)= A, ——— + » B,—-+Cu 8.100 
«) py : Ox; OX;, 2, Ox; ( ) 
The corresponding adjoint operator of F is 
m e m 
G(v) = v Byv)+Cv 8.101 
= 2 aaa! 4D 8») (8.101) 
The starting identity for Green’s theorem estines 
IP, IP. 
vF(u)—uG(v) = OF og, ln (8.102) 
Ox, ve, 
where 
m m m OA. : 
P=) Ay ia uv HB.) (8.103) 
ra Ox, ai Ox, = Ox, 


For this operator, the generalized Green’s theorem becomes 
{I [vF(u)—uG(v)\dV = -{ (aP+mP+..+2,P,)dS (8.104) 


where dV and the directional cosine are defined as: 
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dV = dx,dx,:"" dx,, (8.105) 
1, = cos(n, x;) (8.106) 


The hypersurface of the m-dimensional space is denoted by dS. In addition, 7; is the 
cosine of the angle between the normal to the surface and the x; axis. 


8.8 GREEN’S THEOREM FOR BIHARMONIC OPERATOR 


More generally, Stakgold (1968) showed that the general Green’s theorem can be 
formulated in terms of an n-th order differential equation: 
L(u) =0 (8.107) 
The associated adjoint problem is 
L*(u)=0 (8.108) 
In fact, the adjoint problem is defined by the following generalized Lagrange 
identity: 
vL(u)-—uL*(v)=V-J (u,v) (8.109) 
where vector J is a bilinear function of u and v involving derivatives of up to order 


n—1. The formula given in (8.109) is known as the Lagrange identity. It was 
discussed in Chapter 3 for the case of ODEs. In this case, (8.109) is reduced to 


E(u) —u*(v) = J(u») (8.110) 
X 
The general form of Green’s theorem for (8.109) is 


[[Jezeo-uz* oar = [[ sas (8.111) 


For the biharmonic operator, it can be shown that the adjoint operator is also a 
biharmonic operator (Stakgold, 1968). Equation (5.76) of Stakgold (1968) gives 
the following result for Green’s theorem for the biharmonic operator as 


O 0 Ou Ov 
Viu—uV*vldV = Vv? Vey" Vu 112 
it u—uV "vd [fo cs u aro v i ae (8.112) 


This result was first obtained by Mathieu in 1869 and rederived - ‘Koialovich i in 
1903. 

Green’s theorem in (8.112) can be proved easily by starting from Green’s 
second identity given in (8.41). First, we make the following substitution: 


v=V"¢, u=w (8.113) 
Substitution of (8.109) into Green’s second identity (8.41) gives 


[Jw'e-ver'vnaa= fw we -: 


Secondly, we make the following substitution into en 
v=¢, u=Vy (8.115) 


wv 


v-¢ Mas (8.114) 
nN 


Ny, (V>yWV"p—V"y)dQ = If. vy _g— yas (8.116) 


Adding (8.114) and (8.116), we get 
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2 2 
[[[ oreo 'nan= [fv bevy Fp 2Y_v2g Mas (8.117) 
Q r On On on On 
This is identical to (8.192) if we make the following identifications: 
ywev, ou (8.118) 

This completes the proof of (8.112). 

Now we can use (8.117) to formulate the solution of the biharmonic equation 
in terms of Green’s function. Consider a nonhomogeneous biharmonic equation as: 


Viu= f(x), v=o, onT (8.119) 
To solve this problem, we consider the eee Green’s function defined by 
V1G=6(x—-X); G-L 0, onT (8.120) 
We can now make the following identifications for o and y in (8.117) 
yw<G, geu (8.121) 


Then, we have 


2 2 
{fy (GV4u—uV"G)dQ = il (GO Gg Ot gy Sas 81D) 
Q r On On On On 
Substitution of the results from (8.119) and (8.120) into (8.122), we have all the 
boundary terms on the right of (8.122) zero and obtain 


u(x)= [ff og. sda (8.123) 


where G(é, x) is the response at point & due to a force term applied at x. This seems 
strange and confusing. For the present case of the biharmonic operator, it is self- 
adjoint and we have the reciprocity of 
G(€, x) = G(x, &) (8.124) 

More discussion on this will be given in a later section. 

If the boundary conditions are nonhomogeneous, the problem cannot be 
solved by using (8.123). This scenario is considered next using the boundary 
integral equation approach. 


8.9 INTEGRAL EQUATION FOR BIHARMONIC PROBLEMS 


The boundary integral equation for the two-dimensional biharmonic equation was 
derived by Christiansen and Hougaard in 1978 for the case of a circular domain. 
The formulation was extended to the general domain by Fuglede (1981). Weber et 
al. (2012) gave a slightly different derivation of Fuglede’s result. In particular, 
Fuglede considered the following biharmonic Dirichlet problem with domain Q 
and boundary S: 

Viu=0 inQ (8.125) 

re) 


u= fo; paar! onT (8.126) 
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Before we formulate the solution in terms of a pair of integral equations, the 
following fundamental functions for biharmonic and harmonic operators are 
defined: 


ViG=-6 inQ (8.127) 
V’g=-6 inQ (8.128) 
For two-dimensional cases, we find 

g= ay a (8.129) 

2x rT 

i oo 
G=—r* In(-) (8.130) 

870 r 


The proof of these expressions will be deferred to a later section. In addition, these 
two fundamental functions can be related by: 


VG=¢-2 5 (8.131) 
20 
To show this, we can substitute (8.130) into ei left eu side of (8.131) as 
1d, 0 
VG= "In 
r Or r or G a = 
1 2 2 
= 2r ins r 8.132 
8ar S(2r ) ( ) 
1 
= ine ae 


2n r 22 
In view of (8.129), the validity of (8.131) is established. 
Now, we recall Green’s theorem from (8.41) with the following substitution u 
= g to give 


{| (gV7v-wW? gd = | EE (8.133) 
Q r On On 


Note that we have rewritten the integral in two-dimensional space. Substitution of 
(8.128) into (8.133) and the use of the integral property of the Dirac delta function 
gives 


apex [I eV7vdQ= fe oy Bas (8.134) 


where a, has been defined in (8.79) and equals 2 when x is on the boundary or 1 
when x is inside the domain Q. In obtaining the first term, we have applied the 
following integral property of the Dirac delta function: 


J] ,o@ mF 4O= f(r) (8.135) 


More discussion of this integral property of the Dirac delta function will be 
discussed in a later section. Note that (8.134) is the first integral equation for the 
biharmonic operator. 
Next, we make the following substitution into Green’s theorem: 
u<eG, ve Vu (8.136) 


For the two-dimensional domain, we obtain 
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MI. (GV‘u—VuV?G)dQ = ls (Go Vu 92, as (8.137) 
nN 
If uw is a biharmonic function (i.e., satisfying (8. ee (8.137) is reduced to 
ff, VuV2GdQ = if (Go Vu 2, as (8.138) 


Substitution of (8.131) into (8. - gives 


If, g(V"u)dQ+— — {I V7udQ = c= uy as (8.139) 


Using (8.134) to rewrite the term on the left of ae (i.e., use uv instead of v 
in (8.134)), we get 
V2 
anucc)+— [f V7udQ =| go age was 0) 
27 40. ron On On On 
The fundamental solution for harmonic operator g can be rewritten in terms of the 
biharmonic fundamental function G by using (8.131) as: 
2 2 
ayucxy+— ff V2udQ =[ (ea ae ais 
27 40 r on on On on (8.141) 


Finally, we can apply the following Gauss theorem to the second term on the left: 
} V-vdQ= | y-ndS (8.142) 
Q r 
Thus, we have 
fl V2udQ = {| V-(Vu)dQ = I (Vu)-ndS = I Ouas (8.143) 
Q Q r ron 


With this result, (8.141) is simplified to 


Vv Vv? 
aru) = fv? Gann SI cl 

On On 
We can now specify (8.144) on te boundary and substitute the given boundary 


condition from (8.126) to get 


-V*u Sas (8.144) 
nN 


I et) =| (Av GHz, OV'G  OVu ya, OCas (8.145) 
2 FE on on on 
We now substitute the following identification for v in (8.134): 
ve Vu (8.146) 
where u is a biharmonic function. Then, (8.134) becomes 
aVrutl eV‘udQ = i. (g Vu 92, s )ds (8.147) 
on 


The second term on the left of (8.147) vanishes identically as a consequence of the 
biharmonic function. Thus, on the boundary we get 


2 6V-u 2 Og 
= V dS 8.148 
u [te a Ue) (8.148) 


2 
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Equations (8.145) and (8.148) provide a pair of integral equations on the boundary 
but the unknown is not wu, but is instead 


2 
en sa (8.149) 
on 
Making the following identifications, we have two unknowns defined as 
2 
V-uev and wen (8.150) 
n 
The pair of coupled systems for v and w becomes 
l Vv? 
= fot) = | Pe af o  eny as (8.151) 
2 r on On 
1 og 
=v=[ (ew-v yas (8.152) 
2 iy On 


where fo, fi, are given on the boundary by (8.126), and g and G are the fundamental 
solutions for the Laplace and biharmonic equations respectively, and have been 
given in (8.129) and (8.130). Numerical evaluation of this pair of integrals is 
discussed in Weber et al. (2012) using the boundary element method in 
applications to smooth data in computer graphics. 

It has been shown by Payne and Weinberger that the nonhomogeneous 
biharmonic Dirichlet problem can be transformed to the homogeneous one 
discussed in this section defined in (8.125) and (8.126). In particular, a 
nonhomogeneous Dirichlet problem can be formulated as: 


Viv=F inQ (8.153) 
v=f, ca onT (8.154) 
On 


We can decompose the unknown v as 
V=u,+Uy (8.155) 


The formulation for uw; and uw2 are Problems 1 and 2: 


Problem 1: 
Viu,=F inQ (8.156) 
u, =0, cL a (8.157) 
On 
Problem 2: 
Vu, =0 inQ (8.158) 
te) 
tof, —2=g onl (8.159) 
on 
Problem | can be solved by using Green’s function method as 
“= =| Fr? InrdS (8.160) 
82 JT 


Problem 2 is the homogeneous problem defined in (8.125) and (8.126). Thus, the 
solution u2 can be obtained by solving the pair of integral equations given in 
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(8.151) and (8.152). Thus, substitution of the solution of uz and the solution in 
(8.160) into (8.155) gives the final solution for the nonhomogeneous problem. 

Thus, the pair of integral equations approach discussed in this section can 
also be applied to the nonhomogeneous biharmonic problem. 


8.10 DIRAC DELTA FUNCTION 


For simplicity, we first recast the definition of a Dirac delta function defined in 
(8.8) and (8.9) for the one-dimensional case: 


O(x)=0 x#0 
(8.161) 

=o x=0 
[. 5(x)dx =1 (8.162) 


This can be considered as a special case of the one given in (8.8) and (8.9). 
Another basic definition of the delta function is the shifting property of the Dirac 
delta function: 


[£8 de = £0) (8.163) 


This Dirac delta function is named in honor of electrical engineer and physicist, 
Paul Dirac, who received the Nobel Prize in physics for his major contribution to 
quantum mechanics. The idea of using the Dirac delta function was actually much 
earlier than its usage in quantum mechanics by Dirac. For example, it can be used 
to prescribe point force in the case of beam bending subject to a concentrated 
force. Physically, in the domain of mechanics, the delta function defined above is a 
point force or called a concentrated point. 

According to our normal understanding in mathematics, there is no function 
that is nonzero only at one point and yet still has a finite integral as defined in 
(8.162). To justify the rigorous meaning and the use of the Dirac delta function, 
mathematicians developed a new theory for the delta function defined above. This 
theory is called the theory of distribution or the so-called generalized theory. The 
main contributors of this new mathematical theory include Bochner in 1932, 
Sobelev in 1936, Schwarz in 1940, Mikusinski in 1948, Temple in 1953, Lighthill 
in 1958, and Zemanian in 1965. It was Schwarz who put this theory on a firm 
foundation, and was awarded the inaugural Fields Medal in 1950 because of his 
contribution to distribution theory. He linked the distribution theory of the Dirac 
delta to the Fourier transform and such analysis was found very fruitful in solving 
partial differential equations. Schwartz’s student Hormander was also awarded the 
Fields Medal in 1962 on related work. Some regard the Fields Medal as equivalent 
to the Noble Prize in physics (although their award criteria are quite different). In 
this sense, the importance of the Dirac delta function and the associated theory of 
distribution should not be overlooked in the development of mathematics. 
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8.11 DISTRIBUTION THEORY 


The Dirac delta function is not a regular smooth continuous function that we are 
familiar with. However, its physical significance in physics and engineering leads 
to mathematicians attempting to rationalize its usage. It is a very powerful 
mathematical tool to model the point charge in electromagnetism or quantum 
mechanics, point mass in gravitational theory, point forces in mechanics, and point 
sources in heat conduction problems. A whole new branch of mathematics called 
distribution theory or theory of generalized function is introduced. The distribution 
theory is rather theoretical for most engineers and scientists. We will only discuss 
the basic concept of distribution theory in this section. The study of the Dirac 
function is also known as the study of generalized functions (in contrast to the 
regular functions that we have learned). A new set of rules on its manipulation are 
defined rigorously for this strangely behaving Dirac delta function. As mentioned 
in the last section, major contributors to its development include Bochner, Sobolev, 
Schwartz, Zemanian, Mikusinski, and Temple. 

In the following subsections, the properties of the Dirac delta function in the 
sense of distribution will be summarized, and the concept of using generalized 
functions (called 6-sequence functions) to model the Dirac delta function is 
discussed. 


8.11.1 Properties of Dirac Delta Function 


According to the theory of distribution, the Dirac delta function is not treated as a 
pointwise function, but instead it is defined in terms of how it operates or integrates 
with other well-behaved functions. The integration given in (8.163) is first 
rewritten as: 


[Pedder =< f(9),5(8) >= FO) (8.164) 


where integration is replaced by < , > and is called functional. The function f(x) is 
now called a testing function. The function f(x) needs to be infinitely differentiable 
and converge to zero at too. We will replace the Dirac function by a distribution 
(x) such that 


< f(0),0) >= [PO ax (8.165) 


According to the rules of integration, we have the following properties in the sense of 
distribution: 
(i) 
<¢,af+Bg>=a<¢,f>+P<¢,g> (8.166) 
where a and B are constants. For a regular function A(x), we have 
(ii) 
<h¢, f >=< ¢,hf > (8.167) 
(iii) 
< @(x—a), f(x) >=< G(x), f(x+a) > (8.168) 
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(iv) 

< ax), f(x) =p HOLD > (8.169) 
(v) 

< b\(x), f(x) >=— < A(x), f(x) > (8.170) 
Properties (i11) and (iv) can be proved by change of variables. Note that Property 
(v) can be proved by integration by parts. The proofs of these identities will be left 
as an exercise for readers. 

Some amazing properties of the delta function can be established by using 

these integral definitions of distribution theory by identifying the distribution ¢ as 
Dirac delta function 6. 


Example 8.2 Find the value of the following function in the sense of distributions: 
h(x)d(x) (8.171) 
where A(x) is a regular function. 


Solution: Let us consider the functional of this function with a testing function f(x) 
as: 

<ho, f >=< 6,hf >=h(0) f(0) (8.172) 
The last result in (8.172) is a result of applying (8.167) and (8.164). On the other 
hand, we observe that 


h(0)f(0) =h(0) < 6, f > 
=<h(0)6, f > 
By comparing (8.172) and (8.173), we obtain the following amazing result in the 
sense of distribution 


(8.173) 


h(x)d(x) =h(0)d(x) (8.174) 

Note that this result is true for all testing functions f(x). 
Let us look at the special case of h(x) =x; we have in the sense of distribution 
x6(x) =06(x) =0 (8.175) 


This is an amazing result that comes from distribution theory. 


Using Property (1i1) given in (8.168), we have 
‘ O(x—a)f (x)dx =< 6(x—-a), f(x) >=< 6(x), f(xt+a) >= f(a) (8.176) 


This is the shifting property of the delta function that we reported in (8.163). Using 
Property (iv) given in (8.169) 
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[aan fedr=< dar), f0) >= 4 <0, SAB >= F/O) 
: jj Og 
“a 5(x), f(a) > (8.177) 
a 


1 
=< 6(x), f(x) > 
i 
Thus, we have the scaling property of the Dirac delta function in the sense of 
distribution 
1 
a 


0(ax) =— 6(x) (8.178) 


Using Property (v) given in (8.170), we have in the sense of distribution 
[SF pede =< 5), $0) >=-<8),f'0)>=-/'O) 6.179) 
-o dx 
This expression can be generalized to a higher order derivative of the Dirac delta 


© d"(x te 
[SP rende=y" sO) (8.180) 
2 dx 
Let us define another function called the Heaviside step function as: 
A(x)=1 x>0 
=0 x<0 
This step function is illustrated in Figure 8.6. Using distribution theory, we can 
establish a relation between this Heaviside delta function and the Dirac delta 
function. 
We start with the functional of the Heaviside step function and a testing 
function 


(8.181) 


} © H(x)f (ode =< H, f >= } . f(xdx (8.182) 


The functional of the derivative of the Heaviside step function is 


A(x-a) 


a 


Figure 8.6 Heaviside step function 
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Ir BED Fes flak Ff Sse [ ” f(xadx 

-0 dx 0 (8.183) 
=-f(|p = fO =< 6, f > 

Comparing the first and the last terms of (8.183), we find the following identity in 

the sense of distribution 


GTB (8.184) 
dx 
Inversely, we can rewrite it as 
H(a)=[_ oe (8.185) 


Using ordinary analysis, the derivative of a step function does not exist, but using 
the theory of distribution, its derivative is defined in (8.184). This property actually 
allows us to define the derivative of a discontinuous function by using Dirac delta 
function. 

In particular, considering a piecewise continuous function containing a 
number of jumps, we can denote this discontinuous function f(x) as depicted in 
Figure 8.7. We can define a continuous and piecewise differentiable function g(x) 
in terms of f(x) and a series of Heaviside x“ functions: 


g(x) = f(x)- Says, H(x-a;) (8.186) 
j=l 
where the jump is defined as 


Af, = f(a;)-f@) (8.187) 


The function g is actually the dotted curve shown in Figure 8.7. We have assumed 
in (8.186) that there are & discontinuous points. By ignoring these jumps at a, 
2, ... dn, We define the derivatives of function fas 


a 
1 ay a3 x 


Figure 8.7 A discontinuous function in terms of Heaviside step function 


Chk Fee) (8.188) 


The function g(x) defined in (8.186) is a function whose derivative g’ coincides 
with its distributional derivative. Note, however, that [/’] is in general different 
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from the distributional derivative of function f Let us now consider the 
differentiation of g in the sense of distributions as: 


k 
g(x) =f'(x)- A F5(x-4;) (8.189) 
j=l 
But, since g is continuous, we must have a= [f] and thus from (8.189) we have 
f'=[f'@] Danae a;) (8.190) 


Thus, each jump in the fmultiplying ih ue contributes to the distributional 
derivative of fas shown in (8.190). 

We can also find a higher derivative of f with a similar procedure, and we 
have the distributional derivatives as 


k 
fP@O=[/]+ VAs5@) (8.191) 
j=l 
k 
PO=[PO]+ DIA F}5(2) + A FS) (8.192) 
j=l 


(0) 


f™@= [Ameo Das f, A(x) +t Af, SY (X)] (8.193) 


Our discussion of the theory of eee stops here. 


8.11.2 Delta Function as Sequence of Functions 


There is no function in a regular sense that satisfies the definition of (8.164) to 
(8.170). However, there are perfectly regular functions which get as close to the 
Dirac delta function as one like. Thus, we can define the Dirac delta function as a 
sequence of these functions. Let us consider a sequence of infinitely differentiable 
functions defined as /;, 


lim [- f,.@)8@dr = 80) (8.194) 


The Dirac delta function is considered as the limit of a sequence of functions. 
These functions are normally referred to as generalized functions. Using the sense 
of distribution, we define 


[pe ede = lim J" fear = 2) (8.195) 


However, this definition is not equal to pointwise equivalence. In other words, we 
do not assume that the sequence function is equal to the delta function directly. 
That is, 
O(x) # lim f,, (x) (8.196) 
no 


Instead, we need the integration, or in a distributional sense, to enforce the 
closeness of this sequence function to the delta function (see (8.195)). 


Green’s Function Method 519 
Here are some examples of sequence functions: 


(i) First 6-sequence 


fr Q) = ae : (8.197) 


m 1+n?x 
This function is illustrated in Figure 8.8, and, as expected, it approaches the shape 
of the Dirac delta function as n increases. sai of (8.197) into (8.195) gives 
lim of f(xg(a)dx = lim * s@) = dx 
© 147 x? 
-jiml[” &¢/ 4 
non fT d—«o J+ es 
— 0] dt (8.198) 
1+ 


120 


dg 


2~—Itan '(o) - tan” '(-00)] 


= 4(0) 
where we have applied a change of variables of € = nx. However, the calculation in 
(8.198) is not strictly valid as € + 0. This is because we do not have the behavior 
of a delta function, although it decays to zero as € > . To fix the non-uniformity 
problem, we can integrate the left side of (8.198) as: 
n { g@) = dx = n g() = ds 4 7 g(x)- g(0) dx (8.199) 
nr d- 


o]+n°x Mo] +4+n°x m0 l+n?x 


-0.1 -0.05 0 0.05 0.1 
Figure 8.8 The first 5-sequence 


The first term on the right of (8.199) has been evaluated in the second part of 
(8.198). Thus, we have 


520 Theory of Differential Equations in Engineering and Mechanics 


Bf £O) aen gst [? SOO 


ol+n -o lin 
_ n € @(x)- g(0) 5 & g(x)—g(0) 
g(0)+ al, nat ls pee (8.200) 
+f" g(x)-g(0) g(0) E310) 75 
l+n?x? 


=g(0)+/_,+/, +1, 
The integrals for the tails that approach negative and positive infinity are denoted 
as [-» and Iw respectively. If the value of g(x)—g(0) is bounded such that 
|g(x)-g(0)|<_M, (8.201) 
then, we can establish the inequality 
os 1 


T,=1, <M— ha 


elt a’x (8.202) 
are pera 
a 


Let us consider the limiting case that 


é7>0, now, nero (8.203) 
Note that this can be achieved easily by setting 
1 
é=— (8.204) 


This is clearly not the only choice for ¢ that satisfies (8.203). Thus, both /-~ and Jo 
approach zero as long as g(x) is bounded near x = 0. Finally, the integral around the 
origin can be evaluated eee the mean value theorem: 


aoe * g(x)- oe as [e(é) coef” = 
me-e l|+n m¢-el+n°x (8.205) 


dx 


= [e(€)—@(0)]—tan(ne) 


where —e < &<e. Since ne > ~, and s— 0, we have /: > 0. Therefore, we have 


tim 2 [° $@— ae= g(0) (8.206) 
noo 74-0] 4+n°x 
In addition, if we set g(x) = 1, we see that the requirement of (8.162) is also 
fulfilled: 
lim 2 Ecsta ody an 1 (8.207) 
1 


n> 7T -0]+n7x2 


(11) Second 6-sequence 


fn (x) = mo (8.208) 
a 
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This is a bell-shaped function and is shown in Figure 8.9. We can see that this 
function approaches the Dirac delta function much faster than that given in (8.197). 
We will skip the details of proving (8.162) and (8.163). 


-0.05 -0.03 -0.01 0.01 0.03 0.05 


Figure 8.9 The second 5-sequence 


(iii) Third 5-sequence 


- 2 
(oe (8.209) 
ni 


Xx 
This 5-sequence is shown in Figure 8.9 for different values of 7. 


-0.05 0 0.05 


Figure 8.10 The third 5-sequence 
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(iv) Fourth 5-sequence 
sin nx 


f,(%) = (8.210) 


So far, all sequences discussed are saar aike This function is illustrated in 
Figure 8.11 for various values of n, and, clearly, this sequence converges much 
slower than the first 6-sequence. 

As shown in Figure 8.11, this is a weak 6-sequence for the Dirac delta 
function, but this sequence is very important since it is related to the Fourier 
transform of the Dirac delta function. The Fourier transform is defined as (see 
Section 11.2.2 in Chapter 11) 


f(i= [. fae dx (8.211) 
#52 } ” Pel dk (8.212) 
V2n J 
We now formally substitute the Dirac delta function into f That is, 
f(x) = d(x-2) (8.213) 
By virtue of (8.211) and (8.212), we have 
b(k,é)= [. 6(x—£)e™ dx = me (8.214) 


Back substitution of this into (8.212), we have 


il oo 
5(x-6)-——[ — 
V2 /-~ /27 
= —{- chk 
27 J-« 
For the case of € = 0, we obtain an integral representation (or more precisely the 
Fourier transform) of the Dirac delta function 


es ei dk 
(8.215) 


6(x) = me el dk (8.216) 
271 I-00 
Finally, we can rewrite — as 
O(x) = lim — ae dk= = lim — (cosh +isin kx) dk 
R>0 27 R>0 27 
1 a. met) 
= lim Saat = lim 
Row 27x Roo 7x 


If we take R as a large integer, we obtain the second 6-sequence given in (8.210). 
This d-sequence actually gives the formal proof of the Fourier transform. In 
particular, if we multiply (8.213) by f(é) and integrate with respect to € from minus 
infinity to plus infinity, we have 


f(x)= i 5(x-E)f (dé = —|. [- eC) ¢(\dkdE (8.218) 


We can reverse the order of integration and rewrite the last integral in (8.209) as 
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_ ~ike _ 1 (? ite A 
f(x)= fG)dg\ak-=— Ie Fi(k)dk (8.219) 


1 [ ikx 1 [ 
ar ——s e — é 
V2 J-~ V2n J 


Figure 8.11 The fourth 5-sequence 


This actually gives the formal representation of the Fourier transform. Thus, an 
appropriate form of the 5-sequence within the framework of the distribution theory 
can be used to derive the Fourier transform. In principle, other transforms can be 
recovered if proper 5-sequence representation is used. 

In the next paragraph, we will see that Fourier series expansion can also be 
linked to the Dirac delta function if an appropriate form of 5-sequence is assumed. 


(iv) Fifth 5-sequence 


(8.220) 


=0 |x| > 
The interval for x has been limited to from —z to m such that (8.162) will be 


satisfied. This 6-sequence is essentially the same as that given in the fourth 6- 
sequence. To see this, we can rewrite (8.220) as 


sin(n + aye . 
5(x) = lim[ 2 if ] (8.221) 


ae me 2sin ka 


We can see that for large n, the first square bracket term on the right of (8.221) is 
essentially the same as the fourth 5-sequence given in (8.210). The second square 
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bracket term approaches | as x + 0. Indeed, when we plot (8.221) using the same 
scale as in Figure 8.11, we essentially get the same plot (although the actual values 
are slightly different) and, thus, this plot will not be given. 

Next, we use the following identity 


. 1 
sin(n + —)x fate 
—4-=—) (8.222) 
ot x an = 
7 Sin > k=-n 
The left hand side of (8.222) was known as the Dirichlet kernel having a period of 
2n. To show the validity of (8.222), we first write the summation on the right as 


n 0 n 
> et =-1+ Di +e (8.223) 
k=-n k=—-n k=0 

We now apply the following change of index j = —s for the first sum on the right of 


(8.223). Then, we get 
S=> cP =-14+ e+) (8.224) 
k=0 k=0 


k=-n 
More explicitly, we can express it as 
S=-1+fl+e™ +e? 4..4e™ 4 
{ (8.225) 
l+e "+e!" 4+..4e 
It is clear that both of them are geometric series. That is, we can apply the 
following formula for the sum of geometric series 


ey 


a+artar’ +...tar"| = “ (8.226) 
—¥ 
Application of (8.226) to both the series in (8.225) gives 
1- gilnt)x {= ge iets 
s I+ ix af 1x 
I-e l-e (8.227) 
_ cos(n + 1)x —cos nx 
l—cosx 
Note the following trigonometric identities 
cos A—cos B = d sin Come) sin uae (8.228) 
2 2 2 
1-cos24 =2sin? A (8.229) 
Substitution of (8.228) and (8.229) into (8.227) results in 
sin(n + co 
————— (8.230) 


_ Xx 
sin — 
2 


Using (8.230) into (8.225), we finally get (8.222). This completes the proof. 
Application of (8.222) to (8.220) gives another important form of the Dirac 
delta function: 
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5x8) = = >, efits). |x-El<a 


k=—0o 


(8.231) 
=0 |x =¢ | >a 
This indicates that there are infinite Dirac delta functions along the x-axis, and we 


must limit our consideration to | delta function only. We now multiply (8.231) by 
JAS) and integrate from —7 to 7 to get 


w 1 . ikx [7 -i 
fe)=| de-Hfede=s— Dd [f@e“ag, else 232) 


This formula can be recast as: 


1< i 
f= ey Ce™, alsa (8.233) 
where 
C=] se" ae (8.234) 


Therefore, we have shown that Fourier series expansion of an arbitrary function 
fix) can be proved using a special representation of the 5-sequence within the 
framework of the theory of distribution. More 6-sequences are given in Section 
15.2.5 in Chapter 15. 

In short, we have shown Schwartz’s result that the theory of distribution of 
Dirac delta functions can be linked to both the Fourier series and Fourier transform. 
Without going into deep discussion, we mention that in solving PDEs, the Fourier 
series expansion technique is applicable to problems with a finite domain and 
PDEs with eigenvalues in the form of a discrete spectrum (like the buckling load of 
a column and their multiples for higher modes of buckling); whereas the Fourier 
transform technique is applicable to problems with an infinite domain and PDEs 
with eigenvalues in the form of a continuous spectrum (like wave propagation in a 
half-space). More discussions will be given in Chapter 10. 


8.12 GREEN’S FUNCTION METHOD FOR PDE 


Let us now consider Green’s function method for a nonhomogeneous linear ODE 
of second order defined as: 
2 


d“u du 
L(u) =a +a +agu = f(x 8.235 
(u) 2 ae Od (8.235) 
where a <x <b with boundary conditions: 
B (u) = @ ula) + a u'(a) + B,u(b) + Biu'(b) =0 (8.236) 
By (u) = @y\u(a) + @ygu'(a) + By\u(b) + Bogu'(b) =0 (8.237) 


We define Green’s function of the operator ZL with homogeneous boundary 
conditions as the solution of 
L(G) =6(x-€), B(G)=0, B,(G)=0. (8.238) 
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The adjoint Green’s function of the adjoint operator L* with homogeneous 
boundary conditions is the solution of 
L*(H)=6(x-7), (8.239) 
Bi(H)=0, B,(H)=0, (8.240) 
where 


2 k 2 
. a (a,u d*u ; du pos 
Lu) =! A «a, F420} ay) Has a +4, 


k=0 

(8.241) 
By (u) = & u(a) + &u'(a) + Bub) + Byu'(b) = 0 (8.242) 
B3(u) = aya) + yyut'(a) + Byyu(b) + Byyu'(b) = 0 (8.243) 


This adjoint operator has been derived in an earlier chapter. We can multiply 
(8.238) by H(x,n) and (8.241) by G(x,€), subtract, and integrate from x = a tox =b 
to get 


| “[HLG) ~GL*(H)|dx = [He m)5(x—)dx— | : G(x, 2)8(x—n)de 


(8.244) 
= H(E,n)- G7, 6) = GH, 
where a < &, 7 <b and 
J(u,v)| = {a,(vu'—uv')+(a, —a),yuvy? (8.245) 
Thus, in explicit form we get 
A(§,1)— G7, 6) = {ag[H (x, G(x, 6) — G(x, 6) H'(x,7)] (8.246) 


+(a, — 43 )G%, A(X), 
This is the relation between the two Green’s functions of the operator L and adjoint 
operator L*. In obtaining the above result, we have employed the well-known 
Green’s formula given in (8.111). Depending on the boundary conditions of the 
original problem given in (8.236 and (8.237), we normally pick the boundary 
condition of the adjoint problem given in (8.242) and (8.243) such that (8.245) is 
identically zero. The following example demonstrates how to do it. 


Example 8.3 It was given that the boundary conditions in (8.233) and (8.234) are 
B,(u) =u(a) =0 (8.247) 
B,(u) =u'(b) =0 (8.248) 

Find the adjoint boundary conditions in (8.242) and (8.243) such that (8.245) is 


identically zero. What would be the relation between two Green’s functions of the 
operator L and adjoint operator L*? 


Solution: With (8.247) and (8.248), (8.245) becomes 


T(uyv)l = {Lay 0) — 45 ()]r(b) — ay (b)v'(b)}u(b) —a,(a)(au'(a)_ (8.249) 
To set this to zero, we have 
Bi (v) =v(a) =0 (8.250) 
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a,(b)— a3(b) 


By (v) =v(b)-[ |v(b) = 0 (8.251) 
a,(b) 
Thus, we have 
A(S,7) = G(4,¢) (8.252) 
For the special case that the system is self-adjoint, we can further have 
G(o,7) = G(7,5) (8.253) 


The reversibility of the source point and the observation point of Green’s function 
is called the Maxwell-Rayleigh reciprocity law. In mechanics, the self-adjoint PDE 
implies existence of an energy function or energy conserves. The principle of 
virtual work only applies to the case that the corresponding PDE is self-adjoint. 
Most PDEs arising from classical mechanics and physics are for problems with the 
existence of an energy function and potential, and thus, the associated PDEs are 
mostly self-adjoint. For example, if we formulate problems with frictional forces, 
the resulting PDE is expected to be non-self-adjoint. Most of the existing books on 
Green’s function only cover the case of self-adjoint, but without mentioning that 
their results are only valid for self-adjoint PDEs. 


Now we return to seeking the solution of (8.235) using Green’s function method. 
Let us consider a more general problem with nonhomogeneous boundary 
conditions 

Lu)= f(x), Blu)=a, Bu)=f8 (8.254) 
where @ and f are constants. Now multiplying (8.254) by A(x,é) and (8.241) by 
u(x), subtracting and integrating from x = a to x = b, we obtain 


* THL(u) — ul *(H)] dx = " onaDie } "u5(x—E dx (8.255) 


Applying integration by parts or Green’s formula, we obtain 


b 
u(é)= | fO)HC,2)de- J(u HI, (8.256) 
Let us rename the variables; we get 
b 
u(x)=[ fH Eds -JuS). HE), (8.257) 


This is the formula for evaluating the solution of (8.254) using Green’s function 
method. 
Now we can consider the special case that a = £ = 0, and in view of the 
proper choice of adjoint boundary condition (see Example 8.3), we will have 
J(u(x), H(E,x))|, =0 (8.258) 


For this particular case, we also have the validity of (8.252). Thus, for the 
homogeneous boundary condition, (8.257) can be simplified to 


u(x)= J /OGSae (8.259) 


This explains the strange appearance of Green’s function method that the Green’s 
function in the integrand has the source point x and observation point & being 
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reversed whilst on the left hand side, x is the observation point. Only for the special 
case of the self-adjoint PDE, we have in view of (8.253) 


ua=] fOGExae (8.260) 


which is given in most textbooks without mentioning the importance of the adjoint 
operator. 


Example 8.4 Consider the case of a simply supported beam subject to a non- 
uniform distributed load g(x) as depicted in Figure 8.12. The problem is formulated 
as 
rae 
a a (8.261) 
where u is the deflection of the beam, and EF and / are the Young’s modulus and 
moment of inertia of the bean section. The boundary conditions are 
2 
u(a)=u(L)=0, M(0)= eT 
he 


2 
=0, M(L)=EI 7a = 0 (8.262) 
X 


x=0 x=L 


Find the solution in terms of Green’s function method. 


Figure 8.12 Green’s function method for beam bending 


Solution: For the case of a prismatic beam (i.e., beam with uniform cross-section), 
we can consider the beam deflection at point x subject to a unit point force applied 
at €& Mathematically, this is the formulation for Green’s function 
4 
ry gals = P6(x-é) (8.263) 
dx 
As shown in Figure 8.12, we have set the applied force at unity (1.e., P = 1). 
Balancing the unit on both sides of (8.263), we find that the dimension of the Dirac 
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delta function is 1/length. Similarly, one can find that Dirac delta functions in 2-D 
and 3-D are of dimensions 1/length? and 1/length?. By integrating (8.263), we have 
the following results 


=) 
me = H(x-6)+C, (8.264) 
dx 
du 
EI = (x-E)H(x-E) + Ox 4+ Cy (8.265) 
du 1 2 x 
Pl == BGC eG (8.266) 
dx 2 2 
1 3 2. 
3 x x 
Elu=—(2- EP He) 4G E+ T+ Ort Cy (8.267) 


Applying the first two boundary conditions at x = 0 given in (8.262), we have 
C,=C,=0 (8.268) 
Using the boundary conditions at x = L given in (8.262), we get 
3 


s(b-6 +C, Z+GL =0 (8.269) 
(L-£)+C,L=0 (8.270) 
The solutions of these equations are 
¢,-£-02L-6 (8.271) 
6L 
C=- fo (8.272) 


Therefore, Green’s function for deflection is 
G, (58) = u%,$) = Sale) Hx 6) +E? 4 2EL-F x] 6.273) 
Similarly, we can get Green’s function for moment as 
Gy (6) =M (a, 8) = (0-H (x) (8.274) 


Different forms of G, and Gy can be found in Problems 8.2 and 8.3. The beam 
deflection and moment can be expressed in terms of Green’s functions as 


ux) =fG,e.8a@aé (8.275) 


M(x) =f Gy (.8)q(aé (8.276) 


In structural mechanics, Green’s function for the moment given in (8.274) is also 
known as the influence line. 


530 Theory of Differential Equations in Engineering and Mechanics 
8.13 SOME RESULTS ON GREEN’S FUNCTIONS 


In this chapter, we have discussed Green’s function method mainly related to the 
Laplace equation and biharmonic equation. In this section, we will summarize some 
of the other commonly used Green’s functions or fundamental solutions. The proofs 
of some of these Green’s functions are out of the scope of the present chapter. 


8.13.1 Helmholtz Equation 


The Helmholtz equation is known as the reduced wave equation, and it results from 
the harmonic wave equation. 


V°G+k’G =-6(x) (8.277) 
with k? > 0. The fundamental solution for 1-D, 2-D, and 3-D are respectively: 
: ik|x| 
ie 
G(x) = 8.278 
109) oF (8.278) 
i 
G,(x) = qito lx) (8.279) 
elf 
G(x) = ——_ (8.280) 
47|.x| 


where in (8.279) Green’s function is expressed in terms of the Hankel function of 
the first kind of order zero. Note that the fundamental solution is of the wave type. 


8.13.2 Diffusion Equation 


For a diffusion equation subjected an initial pulse, the problem can be formulated as 


i =aVu, ul _,=5(x-4) (8.281) 
The fundamental solution in 1-D is 
u(x,t;6,T) = : 57 expt on y° , t2T 
Anat 4a“ (t—T) (8.282) 
=0 t<T 


Similarly, for the 2-D case we have 


oi Go +O-M, G5 


2 > e 
4a’ (t—T) (8.283) 
=0 t<t 


Physically, the diffusion equation relates to problems of heat conduction, 
consolidation of soils, and diffusion of pollutants. 


1 
u(x, y,0,6,7,T) = e 
Ara’t 
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8.13.3 Wave Equation 


The wave equation subject to an initial velocity pulse is formulated as 
Ou Ou 


202 
—=aVu, ul .=0, — =d(x-x 8.284 
ot lo Gt si ( 0) ( ) 
Physically, it is a velocity pulse. For a one-dimensional wave equation 
1 
u(x,t; é,7) = — x-—El<a(t-—t 
mie 2a | fl ) (8.285) 
=0 |x—€| > a(t—-7) 
For a two-dimensional wave equation 
1 
u(x,t; 6,7) = ——_—————— rs<at 
2nayart —r? (8.286) 
=0 r>at 


where 


r=y(E-9 +Q7-y) (8.287) 
For three-dimensional wave equation 
O(r—at) 


8.288 
Azar ( ) 


u(x, ¥,256,79,0,t) = 


where 


r= \(E-x) +(n-yy +H -2) (8.289) 


8.13.4 Biharmonic Equation 


For a two-dimensional biharmonic equation, Green’s function can be obtained by 
solving the following PDE 


V’V’G =6(x—Xp) (8.290) 
The corresponding 2-D Green’s solution for the biharmonic equation is 
2 Ba 22 
Ge ine = ) in fx? + y? —1} (8.291) 
1 1 
This Green’s function can be obtained by direct integration. Let us define 
V’-G=0 (8.292) 
Thus, (8.290) becomes 
V’® = 5(x— Xp) (8.293) 


This is, however, precisely the Green’s function problem for the Laplace equation and 
its solution for the 2-D case is known 


® = ae (8.294) 
2a 


Rewrite (8.292) in polar form (due to axisymmetry), and we get 
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10, 0 
_—(r—G)=0 8.295 
r Or ( or ) ( ) 
Substituting (8.294) into (8.295), rearranging and integrating, we find 
0G =i 
= 55 |rinrar 
. is (8.296) 
=—[r’ nr-—r’]+a 
4 2 
The last part of (8.296) can easily be obtained by integration by parts. Thus, we have 
OF Mappa age (8.297) 
Or 42 2 r 
Integration of (8.297) gives 
2 
G=—(nr-l)+anr + (8.298) 
1 


The unknown constants can be found from the boundary conditions. If we set a = b = 
0, we have (8.291). The fundamental solution is the singular part of (8.298), and thus 
the fundamental solution Go is normally given as: 


2 
G)=—Inr (8.299) 
870 


For the three-dimensional biharmonic equation, Green’s function can be obtained by 
solving the following PDE 
V’°V°G = 6(x— Xp) (8.300) 
Using the same idea of the 2-D case, we can employ the fundamental solution of the 
3-D Laplace equation as 
1 60, 450G 1 
(r 


=@O= 8.301 
r? Or or ) 4nr ( 
This equation can be readily integrated to get 
Gs-=2 (8.302) 
8m Fr 


Setting a = b = 0, we have the fundamental solution for the 3-D biharmonic 
equation as 
ae (8.303) 
8x 


8.13.5 Multi-Harmonic Equation 


For n-dimensional space, the fundamental function for the multi-harmonic equation is 
obtained from the following equation 


(V7)"G,, =-8(x —Xq) (8.304) 
G,=CQr""lnr, 2m>n (n=even) 


Ean (8.305) 
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where m is an integer. Note that m = 1 corresponds to the harmonic fundamental 
solution whereas m = 2 corresponds to the biharmonic fundamental solution, and so 
on. For physical applications, we are mainly concerned with 2-D and 3-D cases. For 
such cases, the constants in (8.302) can be found explicitly and, more specifically, the 
fundamental function of an m-harmonic equation is 


=o i 5 r"™ Inr, 2m>n (n=2) 
82 2"[(m 1] 


m 


(8.306) 
1 1 2m-3 
= —_—___ (n =3) 
4z (2m—2)! 
The validity of (8.302) can be proved by starting from the fundamental solution of the 
Laplace equation. 


Two-dimensional case 
For the 2-D case, the biharmonic Green’s function is the solution of the following 
problem (i.e., m = 2) 


V‘G, =V’G, =-6(x- Xp) (8.307) 
where 
G, =V’G, (8.308) 
The second part of (8.307) is the Green’s function problem for the Laplace equation, 
and has been solved previously. Thus, we have 


G,= SF in (8.309) 
20 
Back substitution of (8.309) into (8.308) gives 
10, 066 1 
V’G, = 2) =—1 8.310 
2 Joe a a ( ) 


This was solved in the previous section (see Section (8.299)) and the fundamental 
solution is (i.e., only retaining the singular term) 


G; 2 fine (8.311) 
8x 
Following a similar procedure, for m = 3 we have 
V°G, = V'V’G, = V*G, = 5(x- Xp) (8.312) 


The last two terms represent the biharmonic Green’s function and the solution is 
given in (8.311). Thus, the tri-harmonic Green’s function problem becomes 


VG, =G, (8.313) 
More explicitly, it can be expressed as 
ue (r OCs r? Inr (8.314) 


ror Or 82 
Integrating once, we obtain 


3 
a tinge 5 e8 (8.315) 
r 


Integrating one more time, we get 


534 Theory of Differential Equations in Engineering and Mechanics 


1 1 
Gy= gor (nr =) talnr+b (8.316) 


Thus, the fundamental solution is 


rin (8.317) 


3 1282 


By employing the same procedure, it is straightforward to show that the multi- 
harmonic Green’s problem with m = 4 is reduced to solving the following equation: 


be Ge Se = tp (8.318) 
ror or 1287 
The fundamental solution of G4 is found equal to 
C= r°inr 8.319 
+ 46087 a 
By observation, we can generalize the coefficient in the multi-harmonic function as 
= u u r-"2 Inn, 2m>2 (8.320) 


82 2”[(m-1)!P 
This is the formula given in (8.306). 


Three-dimensional case 
For the 3-D case, we can follow a similar procedure. In particular, 


V‘G, =V’G, =-6(x- Xp) (8.321) 


where 

G, =V’G; (8.322) 
The second part of (8.321) is the Green’s function problem for the Laplace equation, 
and it has been solved previously for the three-dimensional case. Recall from (8.20) 
that 

Ca (8.323) 

4ar 

Back substitution of (8.323) into (8.321) gives 


18 20G,,_ 1 


VG, = 8.324 
2? Or ( or 4ar ( ) 
This was solved in the last section and the fundamental solution is given in (8.303) as 
G=— (8.325) 
8x 
Following a similar procedure, for m = 3 we have 
V°G,; =V‘V’G, = V'G, = 6(x- Xp) (8.326) 


The last two terms represent the biharmonic Green’s function and the solution is 
given in (8.325). Thus, the problem for the tri-harmonic Green’s function problem 
becomes 

V’G; =G, (8.327) 
More explicitly, it can be expressed as 


Green’s Function Method 535 


8.328 
r ar | or 870 ( ) 
Integrating once, we obtain 
2 
i eA (8.329) 
or 322 
Integrating one more time, we get 
3 
r a 
G,; =——-— +b 8.330 
> 96n Fr con 
Thus, the fundamental solution is 
ee 
G, =—— 8.331 
> 96 a 


By employing the same procedure, it is straightforward to show that the multi- 
harmonic Green’s problem with m = 4 is reduced to solving the following equation: 


1 8 2 9G, _ r 


8.332 
r ar‘ or” =: 96 : 
Integration on both sides gives G4 as 
5 
ip 
G, = 8.333 
4 28807 oe 
By inspection, we find that 
Lot ams (8.334) 


Gi, = 
42 (2m-—2)! 
This completes the proof of (8.306). 


8.14 SUMMARY AND FURTHER READING 


In this chapter, the powerful technique called Green’s function method was discussed. 
The method was founded by George Green when he considered the solution of 
problems of electricity and magnetism. The method was founded on the validity of 
Green’s first, second, and third identities, which were originally derived for the 
Laplace equation that governs the problems of electricity and magnetism. Green’s 
second identity is also known as Green’s theorem, whereas Green’s third identity is 
actually the formula for Green’s function method. We extend the discussion of 
Green’s theorem to the biharmonic equation, in view of its importance in mechanics. 
The most general Green’s theorem for second order linear PDEs was also derived. 
The evolution of the boundary integral equation from Green’s function method is 
discussed in view of its importance in numerical analysis. The boundary integral 
equation for the biharmonic equation was also considered and it turns out that two 
coupled integral equations need to be solved. In view of the role of the Dirac delta 
function in the derivation of Green’s function, the definition of the Dirac delta 
function and its justification using the theory of distribution is introduced. The Dirac 
delta function approximated by generalized functions called the 6-sequence is 
discussed. It was demonstrated that both Fourier series expansion and the Fourier 
transform were intimately linked to the theory of distribution if appropriate forms of 
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the 5-sequence were chosen. Green’s function method for general second order PDEs, 
which are not self-adjoint, is discussed. Finally, a number of Green’s functions for 
commonly encountered PDEs were summarized. They include Helmholtz, diffusion, 
wave, biharmonic, and multi-harmonic equations. 

The present chapter differs from most of the existing textbooks on differential 
equations in that detail of the Green’s function method for biharmonic equations is 
discussed. The presentation on multi-harmonic equation is also original and cannot be 
found in any textbook on Green’s function method (e.g., Duffy, 2001). 

Green’s function method is one of the most powerful mathematical techniques 
in solving PDEs and a short chapter like this can cover the main concepts and ideas of 
this method. An excellent and elementary introduction to Green’s function method is 
given by Greenberg (1971). The book series by Stakgold (1967, 1968, 1979) also 
provided a comprehensive discussion of Green’s function method and are also highly 
recommended. A handbook on Green’s function is given by Butkovskiy (1982), 
whereas a compilation of exact solutions in terms of Green’s function can be found in 
Polyanin and Zaitsev (2002). More colorful stories written on George Green can be 
found in Cannell and Lord (1993). 


8.15 PROBLEMS 


Problem 8.1 Show the following identity: 


NX, . NX 
cos[(n + 1) —]sin — 
cosx+cos2x+...+.cos nx = 2 2 (8.335) 
sin~ 
2 


Hint: Use the result related to the fifth 6-sequence. 


Problem 8.2 Show that the Green’s function of deflection given in (8.273) for a 
simply supported beam subject to unit point load can be recast into the following 
forms: 


G, (1,6) = pu )-(L-2" -(L-x)] <2e8 


_ (L=x)5 r : 
on ee eee) 


(8.336) 


Problem 8.3 Show that the Green’s function of moment given in (8.274) for a 
simply supported beam subject to unit point load can be recast into the following 
forms: 

(L-¢)x 


7 Osxsé) 


Gy (x,¢) a 
; (8.337) 
(E<x<L) 


Problem 8.4 For the beam bending problem discussed in Example 8.4, show that 
Green’s function of moment is continuous at x = € and there is a unit discontinuity 
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of the derivative of the Gy with respect to x at x = € More specially, show the 
following: 


[Gy 2)]-. =0 (8.338) 
ce 

[s = ai (8.339) 
Ox 5 


Problem 8.5 Discuss the physical meaning of the results obtained in Problem 8.4. 


Hint: What is the physical meaning of the derivative of moment? What changes 
suddenly within the beam across the unit point force? 


Problem 8.6 Consider the case of a fixed end (or built-in support) supported beam 
subject to non-uniform distributed load g(x) as depicted in Figure 8.13. The 
problem is formulated as 

Or coe 
where u is the deflection of the beam, and EF and / are the Young’s modulus and 
moment of inertia of the bean section. The boundary conditions for fixed end 
support are 


tazenee 4) a OE 26 (8.341) 
dx x=0 dx SE. 
Find the Green’s function method of the problem. 
Ans: 
I 3 aL-sy" ,_ b+28 2 
G, (x,€) =——[(x-éy H(x + x4 8.342 
(XS) aaa sy H(x—6) 7 ( 7 g)x°] (8.342) 


Figure 8.13 Green’s function method for built-in beam bending 
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Problem 8.7 Define a possible form of Dirac delta function as: 


0 t<0 
6,(t)=sl/é 0<t<é (8.343) 
0 t>é 
Show that 
[- 6, (x)dx =1 (8.344) 
lim I (x)6, (x)dx = f (0) (8.345) 


Thus, it is a potential candidate for the Dirac delta function. 


Problem 8.8 Derive the following result for G4 for the 2-D case (multi-harmonic 
Green’s function with m = 4): 
so r°inr 

46087 138247 
Thus, the validity of (8.319) can be demonstrated. 


r°+alnr+b (8.346) 


4 


Problem 8.9 Show the following identity using the theory of distribution 
x"d(x) =0 (8.347) 


CHAPTER NINE 
Wave, Diffusion and Potential Equations 


9.1 INTRODUCTION 


Arguably, wave, diffusion and potential equations are the three most fundamental and 
important second order PDEs in engineering, science, and mathematical physics. 
They are the standard topics covered in nearly all textbooks on partial differential 
equations and mathematical physics. They are the PDEs that have been studied most 
extensively since the discovery of calculus. The main contributors in the studies of 
these equations include Bernoulli, Euler, D’Alembert, Huygens, Lagrange, Fourier, 
Laplace, Riemann, Hilbert, Poisson, Cauchy, Hadamard, and Sommerfeld, to name a 
few. In Chapter 7, we have demonstrated that there are three general types of second 
order PDEs, namely hyperbolic, parabolic, and elliptic types. The most commonly 
encountered type of hyperbolic, parabolic, and elliptic PDEs are wave, diffusion and 
Laplace equations. 

The wave phenomenon has long been recognized as one of the most distinct and 
fundamental features observable in nature. When we throw a stone into a calm pond, 
a series of circular waves of water will radiate from the impact. This propagating 
wave seems to suggest that pockets of water particles are moving outward. In fact, 
energy is propagating along these waves, instead of water particles. Water molecules 
are simply oscillating around its equilibrium positions. This is a phenomenon of 
propagating energy. Wave phenomena influencing our daily lives include sound, 
light, other electromagnetic waves, earthquakes, and water waves. The direction of 
such wave signals is associated with characteristics. When a whip is lashed, a one- 
dimensional wave is generated. Similarly, vibrations of violin strings can be modelled 
as one-dimensional waves. Energy propagation in solids is considered as stress waves. 
As shown by Chau (2013), both shear stress waves and compressional stress waves 
result in classic three-dimensional elastodynamic equations. In Chapter 2, we have 
demonstrated that both Maxwell equations for electrodynamics and elastodynamics 
can be converted to nonhomogeneous wave equations. Therefore, the study of wave 
equations is of fundamental importance, especially nonhomogeneous wave equations. 

Heat conduction has been considered since the time of Newton (Newton’s 
cooling law) and Fourier (Fourier law of heat conduction). This phenomenon can be 
modelled as a diffusion equation. Subsidence of ground due to settlement is caused by 
consolidation of clay. When loading is applied to a fully saturated clay, the small 
value of permeability of clay does not allow the water to be squeezed out 
immediately. The pore water pressure will increase to balance this externally applied 
load. This excess pore water pressure (excess compared to the long term value) will, 
however, induce seepage through the clay. It is a very slow process of squeezing 
water out of the soil skeleton and at the same time the decreased pore water pressure 
will be converted to effective stress (portion of the total stress taken up by the soil 
skeleton interactions). This results in the diffusion of water and thus the phenomenon 
is governed by the diffusion equation. 
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Many physical problems are governed by the Laplace equation. These include 
the incompressible potential flow of fluid, electrostatic problems, gravitational 
attraction of bodies, steady state temperature distribution, twisting of elastic bars, and 
membrane deflection (such soap film). The solution of the Laplace equation is also 
known as the harmonic function. It is also known as potential theory as it is related to 
the problems of gravitational potential. For the two-dimensional case, it also closely 
relates to the analytic function in the complex variable technique. 


9.2 WAVE EQUATIONS 


Mathematically, the linear wave phenomenon can be modelled by the following wave 
equation 
2 
av7g= 22 9.1 

ae (9.1) 
where ~ is a wave potential function, V? is the Laplacian operator, and a is the 
wave speed of the propagating energy. For the case of waves in solids, readers can 
refer to Section 9.4 of Chau (2013). For the special case that the function @ is 
independent of time, (9.1) reduces to the Laplace equation. In fact, the resulting 
Laplace equation governs the equilibrium problem of steady distribution of the 
potential subject to certain boundary conditions. Mathematically, as we have 
discussed in Chapter 7, (9.1) is a hyperbolic type of PDE. There exists a pair of 
curves called characteristics, along which there is a discontinuous solution 
propagating. The wave type phenomenon is the easiest to visualize in nature, 
compared to the equilibrium type of elliptic PDE and the diffusion type of 
parabolic PDE discussed in Chapter 7. 

The next section will discuss the simplest case of 1-D waves. 


9.2.1 D’Alembert Solution for 1-D Waves 


The one-dimensional wave equation is one of the first differential equations ever 
formulated in modelling physical phenomena. For the case of a vibrating spring of 
a violin shown in Figure 9.1, the deflection of the spring is governed by 


Ou Ou 
2 
ae oP “ 
where 
ak (9.3) 
p 


The tension in the vibrating spring is 7 and p is the mass per unit length of the 
spring. 
‘Figure 9.2 shows the vertical and horizontal components of the spring tension 
T. Horizontal force equilibrium leads to 
T(x + Ax,t)cos(@ + AO) —T (x,t) cos(@) = 0 (9.4) 
Similarly, vertical force equilibrium leads to the following equation of motion: 
T(x + Ax,t) sin(@ + AO) — T(x, t)sin(@) = pAxu,, (x,t) (9.5) 
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where x is the center of mass of the spring element shown in Figure 9.2. As shown 
in Figure 9.2, we can denote the vertical component of spring tension as V, and 
(9.5) can be simplified as: 


u(x,t) 


Figure 9.1 Vibrating violin string as 1-D wave phenomenon 


V(x + Ax,t) -V(x,0) 


= pu, (%.t) (9.6) 
Ax 
Using the fundamental definition of differentiation, we get 
Vi(x,t) 7 Puy (x, 0) (9.7) 
The vertical force component can be related to the horizontal force component by 
V (x,t) = H(t) tan 6 = H(t)u,.(x,1) (9.8) 


To obtain the last part of (9.8), we have set the slope of the spring equal to the 
derivative of u taken with respect to x. 
Substitution of (9.8) into (9.7) gives 


(Hu,.),, = Puy (9.9) 
Finally, for small deflection (it is normally the case), we have cos@—> | as O—> 0: 
H(t)=Tcos0 ~T (9.10) 


Using (9.10), we can further simplify (9.9) as 
2 Ou Ou 
q a Be 
ox” Ot 
This completes the proof for the one-dimensional wave equation given in (9.2). 


a’ =T/p (9.11) 


Txt) v9. ng 

tT 

f 1 

i 1 

\ 1 
[ 

ae | 

° i Ties? Tt) x a xb At 


Figure 9.2 Force equilibrium in a vibrating spring of violin 
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Example 9.1 Consider the solution of the following finite spring of length L fixed 
at two supports and subject to an initial deflection f(x): 


nw O<x<L, t>0 
u(0,t)=0, u(L,t)=0, +20 (9.12) 
u(x,0) = f(x), u,(x,0)=0, O<x<L 


2 
au,, =u 


Solution: Using separation of variables, we have 


u(x,t) = X(x)T(0) (9.13) 
Substitution of (9.13) into (9.12) gives 
aX"T=XT" (9.14) 
This can be rearranged as 
Xx" = 1 7" 
¥ a a (9.15) 
This leads to two ODEs 
X"+2X =0 (9.16) 
T"+a’AT =0 (9.17) 
Thus, the boundary and initial conditions become 
u,(x,0) = X(x)T'(0) =0, O<x<L => T'(0)=0 (9.18) 
u(0,t) = X(0)T(t) =0, u(L,t)= X(L)T(t) =0, + >0 (9.19) 
The governing equation for X can be summarized as 
X"+AX =0, X(0)=X(L)=0 (9.20) 
The general solution of (9.20) is 
X =¢ cosVAx +c, sin Ax (9.21) 
The boundary condition in (9.20) leads to 
X(0)=0>¢, =0 (9.22) 


Substitution of (9.21) into the second boundary condition of (9.20) yields the 
eigenvalue equation 


sin VAL =0 (9.23) 
The eigenvalues are 
A, =n m1, n=1,2,3,+0 (9.24) 
The quantities 1, =nza /L, for n = 1, 2, ..., are the natural frequencies of the string, 
that is, the frequencies at which the string will freely vibrate. The vibration mode is 
X,, (x) = sin (nz x/L) (9.25) 
The solution of T is 
T(t) =k, cos(nzat/L)+k, sin(nzat/L) (9.26) 
T'(t) =(n7a/L){-k sin(nat/L)+k, cos(nzat/L)} (9.27) 


Boundary condition (9.18) will lead to 
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k, =0 (9.28) 
The fundamental solution becomes 
u,, (x,t) =sin(nax/L)cos(nzat/L), n=1,2,3,--., (9.29) 


The general solution becomes 


utes Dea Cyl, (Xt) = 3 sin(nzx/L)cos(nzat/L) (9.30) 


u(x,0) = f(x) = Ye sin(nzx/L) (9.31) 


Multiplying both sides by the sine aad an and integrating from 0 to L, we have (in 
view of the orthogonal properties of the resulting sine integral given in (7.186)): 


L 
c, == | f(x)sin (nx! L) dx (9.32) 
0 
Thus, the final solution is 


ued = > 2; sin(nz x/L)cos(nzat/L) (9.33) 
n=l 
The constant c, for the sine Fourier expansion of the initial deflected shape f(x) can 
be carried out explicitly once the function is given. This completes the solution of 
the 1-D wave phenomenon of violin string vibrations. 


We now introduce the classical solution proposed by D’Alembert for 1-D wave 
problems. This solution is beautiful and is a triumph in the history of solutions of 
the wave equation. 

Let us rewrite the solution given in Example 9.1 by defining the following 
function: 


h(x) = Dc, sin(nx/ L) (9.34) 
n=l 


Next, we consider the expansion of this function with two special arguments: 


h(x—at) = a [ sin (na x/L)cos(naat/ L)—cos(nx/L)sin(nzat/L)| (9.35) 


n=l 


h(x +at) = ae [ sin(nzx/L)cos(nrat/L) +cos(nzx/L)sin(nzat/L)| (9.36) 
n=l 

These expressions are the results of direct application of the following sum rule for 
the sine function: 

sin(A+ B) =sin AcosB+cos Asin B (9.37) 
Adding (9.35) and (9.36) gives exactly twice (9.33), the solution that we obtained 
by solving the initial boundary value problem of Example 9.1. More precisely, we 
can write 
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u(x,t) = pa sin(nzx/L)cos(nzat/L) 
n= 


=[h(x-at)+h(x+at)|/2 


(9.38) 


Physically the first A function is an outgoing wave or a right going wave and the 
second / is an incoming wave or a left going wave. The argument in the / function 
can be considered as the “characteristics” of the propagating solutions as we 


introduced in Chapter 7. 


Naturally, one may ask whether all solutions of one-dimensional wave 
problems can be written as superposition of two waves. The answer is yes, and this 
general result is D’ Alembert’s seminal result. Let us consider the general 1-D wave 


equation as 
Au Ou 
eo ae 
Ot Ox 
Introduce the following pair of change of variables: 
6=x+at, n=x-at 
Using the chain rule for partial differentiation, we get 
Ou Ou Og | Ou On _ Ou | Ou 
Ox 0€ Ox On Ox O0€ ON 


aud Ou, Qu\dg | 0 (du , du \on 
ax? O€\0E On )ex Adn\aE On) & 
Ou Ou Ou Oru 
= qe +—; 
ax* OE 0é0n On 


Thus, we have 
2 2 2 2 
a? 2h ge P4 ,g2 Ou g? 2H 
Ox 0g 0é0n on 
Ou Oudgé Oudn Ou Ou 
= + =a a 
ot 0& Ot On Ot 0& On 


a= OE GE On )ax an 

2 Ou Ou Ou 

a (a ae 

0g 0g0n On 

Substitution of (9.44) and (9.46) into (9.39) leads to 
O7u 7 
0&0n 

Integration once with respect to n gives 


Ou _ 
oe 1 


Integration one more time with respect to € gives 


0g On 


2 n e 
Oru 6 (S oH FE ag 0 (s |e 
ne 


(9.39) 


(9.40) 


(9.41) 


(9.42) 


(9.43) 


(9.44) 


(9.45) 


(9.46) 


(9.47) 


(9.48) 
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u(xst) = | SOE +A) 
= f\(xt+at)+ f,(x-at) 


In the derivation of (9.49), we have not specified any initial and boundary 
condition, and thus this result is general and can be applied to any initial boundary 
problem of a one-dimensional wave. More importantly, the solution in (9.49) is 
expressed in terms of an arbitrary function of the two arguments (or the two 
characteristics for this hyperbolic equation). 

We will consider the case of an infinite spring subject to certain initial 
conditions using the solution in (9.49). More specifically, we consider an infinite 
spring subject to initial deflection and velocity: 


(9.49) 


U |,0= P(X), < x < +00 (9.50) 
as =y(x),-0 <x < +00 (9.51) 
Ot |r=0 
Adopting the general solution form (9.49), we get 
Ou Ox +at Ox —at 
— (x,t) = f(x +at + f3(x—at =af/—afs 9.52 
ar )= fi +at) a fy(x—at) a if} — of (9.52) 


Applying the initial conditions (9.50) and (9.51) (i.e., at t= 0) to (9.49) and (9.52), 
we obtain two equations: 
Ai) + fo(X) = P(x) (9.53) 
af(x) — afz(x) = v(x) (9.54) 
Integration of (9.54) gives 
1 px 
A) fi) =— | Wag +C (9.55) 


Equations (9.53) and (9.55) provide a system of two equations for two unknown 
functions, and the solutions of it are: 


1 1 px C 
A) => 90) +5- | was +> (9.56) 
1 Ll pe C 
fie) =590)-5- | vOaé-F 0.57) 
Back substitution of these results into (9.49) leads to 
wet=ioexed eoG-ch ie |  y(Od€ (9.58) 
2 2a x—at 


This is the renowned solution by D’Alembert. For the case of zero initial velocity, 
Figure 9.3 illustrates the right going solution or outgoing solution. Because in the 
formulation of the wave equation we have not incorporated any damping term, the 
initial deflected shape will be preserved during the propagation. Similarly, the left 
going solution or incoming solution is illustrated in Figure 9.4. 

More generally, once an infinitely long spring is initially deflected, waves 
will be generated to both left and right as illustrated in Figure 9.5 at three different 
times. In this illustration, the disturbance was imposed at t = fo; as time increases, 
this initial disturbance generated a left and a right going wave. The wave is 
propagating at a speed of a. 
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Figure 9.3 Right going solution of 1-D wave equation 
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Figure 9.4 Left going solution of 1-D wave equation 
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Figure 9.5 Both left and right going solutions of an infinitely long spring 


9.2.2 Domain of Dependence and Influence Zone 


Recall in the general solution of a 1-D wave that there are two characteristics and 
one corresponds to a left-going wave and the other the right-going wave. In the 
time-space, there are certain forbidden regions that the wave solutions cannot travel 


to. This is a consequence of causality. 


Figure 9.6 illustrates the domain of dependence. Consider that at a point P in 
space-time, the wave signals that the point P received can only come from the 
influence of the space-time triangle of ABP. To see the mathematical details of this, 


we consider the following example of the nonhomogeneous wave problem. 
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Au(x,0)=9(x) B ~* 
u,(x,0) =y(x) 


Figure 9.6 Domain of dependence 


Example 9.2 Consider the solution of the following 1-D nonhomogeneous wave 
problem (with external forcing term given) with prescribed initial deflection and 
velocity: 


Ur = 2 Uy = I (x,0), t>0 


(9.59) 
u(x,0) = p(x), u,(x,0) = w(x) 


Solution: The problem can be solved by using superposition. First, we recognize 
that the original problem can be decomposed into two sub-problems: 


Problem I 
Ur “au, =0, 1¢>0 (9.60) 
u(x,0) = p(x), u,(x,0) = w(x) 
Problem II 
Uy — 2 Uy, = f (XE), t>0 (9.61) 


u(x,0) =0, u,(x,0) =0 


Problem I has actually been solved by using the D’Alembert solution discussed in 
the last section as: 


1 1 x+at 
u(x.t) = lox tat) + o(x—anl+ | w(edé (9.62) 
For Problem II, we can use the classification approach in Chapter 7 that 
A=1, B=0, C=-a’ (9.63) 


Thus, we have 
B? -4AC = 4a’? >0 (9.64) 
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This system is of course the hyperbolic or wave type. The two characteristics are 


dx B+ B°-4AC dx _ B~\B’ -4AC 


a, = =-a (9.65) 
dt 2A dt 2A 
The corresponding characteristics can be obtained by integrating the ODE as 
€=x+at=C,, n=x-at=C, (9.66) 
Then, we get 
Sait hig Sly, Slay (9.67) 
ox Ot ox Ot 
Thus, 
ag 2 OT pp OF 5, OF OS 5 9 05 Ot 
Ox Ox Ox ay ox Oy oy Oy (9.68) 
=-4q? 
Therefore the canonical form of the wave equation becomes 
Ou 1 
=-—~ f(é.7) (9.69) 
0&6n = 4a? Moon 


Note that we have used the same symbols for uw and f even though we have changed 
the variables. Integration of (9.69) with respect to n gives 


OH I" Edt +e (9.10) 
6& = da? te 
The lower limit for the integral is obtained by virtue of the fact that at tf = 0, we 
have n = & (see (9.66)). In addition, we note that 
Ou _ Ou Ot ou ox 1 Ou | Ou 
0& oOtd0g Oxdg adt Ox 
At zero time, we have 


(9.71) 


Ou Ou 
,0) =0, 0)=0 9.72 
At (x,0) re (9.72) 


The first is the second initial condition of Problem IJ given in (9.61) and the second 
is a direct consequence of the first initial condition in (9.61) (i.e., the initial data is 
identically zero for all x as is its derivative with respect to x). Thus, from (9.71) we 
must have 


aetE8)=0 (9.73) 

Using this information, we find that (9.70) becomes 
g(g)=0 (9.74) 
Ae -a5] ener (9.75) 


Note that we absorb the negative by reversing the limits of integration. Integration 
of (9.70) one more time with respect to € yields 


_1 {fre mate (9.76) 
4 n len os , 


Note that the integration is for € > n, thus we have 
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HAE <Ss SS SE (9.77) 
The domain of integration can be illustrated in Figure 9.7. The domain of 
integration can be written as: 


1 — 
uF), Seman (9.78) 


where Q, is the triangular domain shown in Figure 9.7. 
We map the variables back to the physical domain as: 


q=X-cl, 2=X+cr (9.79) 
us ss } } f(&D|J|aedr (9.80) 
4a QYy 


where the domain of integration shown in Figure 9.8 can be mapped to another 
triangular domain shown in Figure 9.8. 


Figure 9.7 Domain of integration for (9.76) 


The horizontal line AB in Figure 9.7 can be mapped as 

7=n, => X=x-a(t-Tt) (9.81) 
That is, in the physical domain, AB becomes an inclined line shown in Figure 9.8. 

Similarly, the vertical line BC can be mapped as an inclined line in the physical 


domain: 


&=& => X=x+a(t-t) (9.82) 
Finally, the inclined line AC becomes a horizontal line in Figure 9.8 as: 
f=n, > f=0 (9.83) 
In addition, the Jacobian for the mapping is 
oz oF 
yal OF -| # ate (9.84) 
on On| |l -a 


ox Ot 
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(x+at,0) (x+at,0) 


Figure 9.8 Domain of integration for (9.80) 
Substitution of (9.81) and (9.84) into (9.80) yields 
1 —_— = 
j= } } f(&F ded? (9.85) 
2a 44 Q4 
Finally, we can combine the solutions of Problems I and II as 


u(t) = Slots at) +o(e—at)] + |" (ede 
es (9.86) 
on | I. f(®,P ded? 


This completes the solution. The solution of 3-D nonhomogeneous wave equation 
will be considered in Section9.3. 


The domain of integration found in Figure 9.8 is exactly the domain of dependence 
shown in Figure 9.6. 

The region of influence is shown in Figure 9.9 for the case of initial 
deflection and velocity prescribed at x = xo and t= 0. 

Figure 9.9 shows the space-time event horizon for the one-dimensional case. 
That is, we could not receive any signal from a point source as long as the chosen 
point in space-time is outside the wedging zone in Figure 9.9. The domain of 
dependence and region of influence can be considered as a consequence of 
causality. This is a distinct feature of the wave type solution. Figure 9.10 is the 
cone of the event horizon including both past and future cones. 

In Chapter 15, we will see that this causality of the wave also provides a 
stability criterion on the numerical integration of wave equations (i.e., the Courant- 
Friedrichs-Lewy criterion or CFL criterion in (15.2.7)). 
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9.2.3 Two-Dimensional Waves 


For the case of radially symmetric spherical waves (i.e., waves in unbounded 
solids), the wave equation is 


1 vp ao _ldog 


(9.87) 


aa oe rér 


(9,0) os 
u(x,0) =u,d(x—X,) 
u,(x,0) =u,d(x —X) 


Figure 9.9 Domain of influence for a source point 


Future cone 


Domain of influence 


Past cone 
Domain of dependence 


Figure 9.10 Cone of event horizon (past and future) 
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For the case of radially symmetric spherical waves (i.e., waves in unbounded 
solids), the wave equation is 


1 ep _ Fo, 10g 


9.88 
a a Or or or oo 
Consider a time harmonic wave as 
p=yw(rje (9.89) 
2 2 
i As A (9.90) 


dr’ ordr @ 
This is the Bessel equation of zero order. For the wave phenomenon, it is 
customary to express its solution in Hankel functions instead of the Bessel function 
as 
y/(r) = CH” (kr) +c,H§” (kr) (9.91) 
where the Hankel functions can be expressed in terms of Bessel functions of the 
first and second kinds as (see Section 4.8): 


Hy” (kr) = Jo(kr) + i% (Ar) (9.92) 
Hy (kr) = Jg(kr) 1% (Ar) (9.93) 
rel (9.94) 

a 


See also the discussion related to Table 4.1 in Chapter 4 on the role of Hankel 
function in wave phenomenon. Now, consider the asymptotic form of the Hankel 
functions for r 00 as discussed in Chapter 4: 


H (kr) ~ = exp[i(kr — / 4)] (9.95) 


H (kr) ~ in exp[—i(kr — 2 / 4)] (9.96) 
wkr 
Substitution of (9.94) and (9.95) into (9.90) gives 
2 
y(r)= i expli(kr -)] +c, exp[-i(kr -)}} (9.97) 
awkr 4 4 


The first and second terms on the right reflect contracting wave and expanding 
waves respectively. For the time harmonic solution of (9.87), we have 


o(t,r) =e {aH (kr) +¢,H) (ir)} (9.98) 


This solution can be expressed in terms of integrals by noting the following integral 
representation of Hankel functions as (p.180 of Watson, 1944): 


H (kr) = =|" aacunaide (9.99) 


HO (kr) = -=f" exp {-ikr cosh p}d¢ (9.100) 


Substitution of these results into (9.97) gives 
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(oe) 2 oo) 
P(t,r) =c, =| exp {iat + ikr cosh g}d¢— cy — I, exp {iat —ikr cosh ¢}d¢ (9.101) 
71 0 71 


The general solution of the arbitrary time function can be founded by summing 
different time harmonics as: 


o(t,r)= } 7 [, exptiot + cosh ¢)} f (o)dddo 
es 7 (9.102) 


+ | e } * exp fia(t - cosh ¢)}¢(w)ddda 
—o # () a 


Therefore, this solution suggests that the general solution can be expressed as 
arbitrary functions of the form (Copson, 1975): 


v(t,r) = ie F(t- “cosh dot [ G(t+ “cosh dd (9.103) 


The validity of this solution can be verified by direct substitution of (9.102) into 
(9.87). The main feature of this solution is that for large time ¢ the integral in 
(9.101) does not drop to zero. In other words, there are tails to the disturbance. The 
physical meaning of this will be discussed in a later section. Another solution for 
two-dimensional wave in a finite domain in integral terms will be given in a later 
section (or the so-called Poisson integral). 


9.2.4 Three-Dimensional Waves 


So far, we have considered the solution for both one-dimensional and two- 
dimensional wave equations. In this section, we will consider the solution of a 
three-dimensional wave problem defined as: 


a2 a 


2 2 2 
1 ou Ou Ou Ou 


=— ++ (9.104) 
eo &  & 
Ou 

ul _ = (AX, y,Z), ae = w(x, y,Z) (9.105) 

Ot |,-9 

The solution is found expressed as: 
1 | a Q y 

1.x, yz) = ds + || ¥ as 9.106 
ate a2) Sq at Sq at ( ) 


where Sa is the surface of a sphere with origin at (x, y, z) and radius at and dS 
denotes the surface integral over the sphere (see Figure 9.11). This formula is 
known as Poisson’s mean value formula, and is also known as Kirchhoff’s formula. 
The proof of (9.105) will be considered in Section 9.2.6. Physically, it means that 
the solution at the point (x, y, z) and time ¢ only depends on the average value of 
the prescribed data on the surface of a sphere of radius at, and is independent of the 
value of the initial data within the sphere. As time ¢ increases, once the spherical 
surface passes the initial disturbance zone, the spherical integral given in (9.105) 
will drop to zero. Thus, there is a sharp tail. This is referred to as the Huygens 
principle, which will be discussed further in Section 9.2.8. Actually, Poisson’s 
formula can also be expressed as the mean value of the initial data on the sphere of 
integral as: 
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1 = = 
u(t,xX, y,Z)= 4) 2 tatptan) + aia (9.107) 
a 
where the superimposed bar denotes the average value taken over the spherical 


surface of radius at. In spherical coordinates, the mean value of @ over the sphere 
with center at (x, y, z) and radius of at can be evaluated as: 


radius = at v= ff, u,dS 


=| 


Figure 9.11 Poisson’s mean value solution for 3-D wave 


1 a pla ; dee 
D(x, N= >| "| o(x+atsin¢dcos 6, y+atsin gsin O, 
— Ara’t? 40 Jo (9.108) 


z+atcos¢)a°t’ sin gd0d¢ 


The average of y can be defined similarly. The validity of this solution is shown in 
later sections. 


9.2.5 Three-Dimensional Symmetric Waves 


In this section, we will first consider the solution for spherically symmetric cases. 
More specifically, we have the prescribed values in (9.104) as: 


P=9(r), y=y(r) (9.109) 
Consequently, we must have the solution of wu being: 
u=u(r,t) (9.110) 


For this case, the wave equation becomes 
1 au Ou 2a 


+ 9.111 
aoe a rer ( ) 
It is straightforward to show that (9.110) can be written as: 
2 27, 
o (Tu) _ 2 oO (ru) (9.112) 


ar’ or? 
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Accordingly, the initial conditions can be recast as 


O(ru 
(rw). =ro), LO! =o (9.113) 
Ot |,-9 
In addition, we clearly have an additional condition that 
(ru)|_, =0 (9.114) 


This problem is mathematically equivalent to the one-dimensional wave problem 
for a vibrating string discussed earlier. Therefore, the D’Alembert solution 
discussed in the previous section can be applied directly as: 


u(rst) = Ce a OT FP ey(O)dé, rat >0 
27 2ar ¢r-at 
_ (rt at)p(r + at) —(at—r)ep(at -r) 
2r 


at+r 
+t ey(oae, rar <0 
2ar Yat-r 


(9.115) 
This solution will be used again in the next section, when we discuss the general 
solution for a non-symmetric three-dimensional wave. 


9.2.6 Poisson or Kirchhoff Formula for Three-Dimensional Waves 


Let us define the surface average of the solution as 
mr. =—s [I u(é.n.e.0ds (9.116) 
4ar S 


where the integral over dS is for variables (€, n, ¢). In spherical polar form defined 
in Fig. 1.27, we can rewrite it as: 


(r,t) = =f, u(xtrsin Ocos¢,y+rsin Osin¢,z+rcosO,t}d@ (9.117) 
1 


where the surface integral is conducted over a unit sphere such that 
dS =r°da (9.118) 
Clearly, the value of uw at the center of the sphere can be found by taking the 
following limit: 
lim 4 (7,1) =u(x, y,Z,¢) (9.119) 
r> 


Differentiation of both sides of (9.116) gives 
oh = Pal ua ey OE ising + OF eg O)do 
on 0g 


- Oe (9.120) 
ou. ou. : Ou 
= | (—sin O0cos ¢ + — sin Osin ¢ + — cos O)dS 
Anr? Ids, 0 on 0g 
Applying Gauss’s theorem to (9.119), we obtain 
— 2 a2 2 

. lee ae L fll, 
or Arr a on OC 4nr (9.121) 


re 2 aaa Mlle ov 
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The last line of (9.120) results from the wave equation. The volume integral can be 


rewritten as 
ana’r? S oir f= ff, a _ y-=, git udSdp (9.122) 


Differentiation of (9. oe gives 


6) a r Ou a 
Fe ae Sl. udS (9.123) 
This expression can be rearranged as 
vt 8 (2 5% i @ On 
= dS =— 9.124 
r? or ” or Anr? at? I] e or? 


The following ideaniies can be proved easily 
2p — 2— 
AD zg 28 (9.125) 
or? or Gr? 


Oo ou 2 oi 


2r— 4 9.126 
or i ar or or? 
By using (9.124) and (9.125), we cay obtain 
2 6 2 6 
om = q” oH) (9.127) 
Ot or 
The boundary conditions (9.104) can be rewritten accordingly 
= — Oru = 7 
(ri)| =r@Q, uy) =rv, (ru)| =0 (9.128) 
Ot |-0 
where 
Q(r,t) = (9.129) 
r,t) = i W(En,S.0dS (9.130) 


The last boundary condition in een: is sided similarly to the argument in the last 
section. 


at= (r+at)p(r +at)+(r —at)p(r —at) < 1 [O" em@ae, ju 
2r 2, 
_(r+atyo(rt+ oe —r)p(at—-r) rn 7 fo(Ode, r—at<0 
(9.131) 


Considering the limit of r > 0, by virtue of (9.118) we have 
(r +at)p(r +at)—(at—r)p(at—r) 
2r 


u(x, V,Z,t) = te u(r,t) = lim 

rr r 
a ie. (9.132) 
+lim—["" eve 


r>0 2ar at-r 
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It is clear that both the first and second terms on the right of (9.131) are of the form 
0/0. Application of L’H6pital’s rule results in 

jaa (r +at)p(r + at) —(at—r)p(at—r) _ 
r>0 2r 

Applying L’ H6pital’s rule and Leibniz’s rule of differentiation on the integral given 
in (9.131), we find the second term as 


Q(at)+ ato" (at) = 22 pan (9.133) 
a Ot 


tim EEE =— [at (at) (9.134) 
Substitution of (9.132) and (9.133) into (9.131) gives 
me 1 _ 
u(x, y52,0) = —— [arp(at)] ++ [atip(at)] (9.135) 
a ot a 
Finally, in view of (9.128) and (9.129) we have 
P 
t,x, yz) = ds+|{_ = ¥ as 9.136 
u(t,x, yz) mk Ale at Sy, at ( ) 


This completes the proof of Poisson’s formula or Kirchhoff’s formula. 
In spherical coordinates, Poisson’s formula can be expressed as 


u(r, 0, ¢,t) = =" f- Q(x +atl, y+atm,z+atn)sin Odd ¢) 
4 ae (9.137) 


2a 
ae —{* | w(x-+atl, y+atm,z+atn)sin 0dbd¢ 
0 40 


where 
/=smin@cos¢, m=sin@sing, n=cos8, P +m +n? =1 (9.138) 


9.2.7 Hadamard’s Method of Descent for 2-D Wave 


For two-dimensional wave problems, an integral solution similar to (9.135) can 
also be established by specifying it to the two-dimensional case. This method is 
normally referred as Hadamard’s method of descent. First, the two-dimensional 
wave problem can be summarized as 


10u Ou dru 


w Gt ar? ay’ 


(9.139) 


uj, =e%), 4 =v) (9.140) 
Ot |,-0 
Since the surface integral is no longer a function of z, the incremental area dS can 
be projected to dX’ as shown in Figure 9.12. The contribution from the upper 
hemisphere to the projected circular surface is the same as that from the lower 
hemisphere. Thus, we have 


dS =2dx = 


dx 9.141 
cos@ ( ) 


As shown in Figure 9.12, the cosine function can be determined as: 
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2,2 2 2 
al —e=3) “= (9.142) 
a 


18 Be fe —(x-€Y -(y-ny? 


dz =cos@dS 
le —>| 


Va" +=) 


Projected 


Figure 9.12 Spherical surface projected to circular 


Using these results, we get from (9.135) 
a ee ee = | g 2 dz+|f v2 ay! 6.143) 
4ra| Ot 44z,, at cosO x at CcosO 
Substitution of (9.141) into (9.142) gives 


0S.7) 
E-x) -(n-y) 


dgdn| 


dgdn 


1 a 
u(t,x, y,Z) = 2nra IZ Ih, Je? = 


v(S,7) 

§-x) -@-yy 
This is the two-dimensional Poisson formula. In polar form, this formula can be 
written as 


(9.144) 


“fe Jer = 


1 0 ¢% p27 o(x+ pcosd, y+ psind 
ro [ { Ax+p yt+p ) ddd p| 
2a Ot 40 40 at? — p? 
+: vabee 4 nd (9.145) 
¥ y(x+ pcos, y+ psin ) edit 


2na Io Jo ar — p? 


Note that the integration given in (9.143) and (9.144) are evaluated for the whole 
circular region instead of the spherical surface. This difference in the 2-D and 3-D 
Poisson formulas makes a big difference physically, which will be discussed next. 
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9.2.8 Huygen Principle 


For the three-dimensional wave solution given by Poisson’s formula, Figure 9.13 
shows a particular situation that the initial disturbances @ and y are only given in 
domain 2. 


(i) Case I: 
at < at, (9.146) 
For this case, the wave signal from © has not arrived the center of the sphere. 


Therefore the solution given by Poisson’s formula gives a zero solution because the 
spherical surface has no initial values of @ and w. 


Figure 9.13 Interpretation of Poisson’s formula 


(i1) Case II: 
at, = at 2 at, (9.147) 
Once t = fj), the first wave signal arrives sharply. Within this time period, the 


intersection between the spherical surface of integration and the initial disturbance 
leads to a nonzero solution. 


(111) Case III: 

at > aty (9.148) 
Once ¢ > f, there is no intersection between the spherical surface of integration and 
the initial disturbance. Thus, the solution drops sharply to zero and there is no 
ripple once the wave energy has passed. That is, there is a sharp trailing edge of the 
wave solution. This is known as the Huygens principle. This Huygens principle 
actually does not apply to the two-dimensional wave solution. 


For two-dimensional waves, the solution given in (9.144) shows that the solution is 
a circular area integration. Thus, we have only two scenarios. 


(i) Case I: 
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at < at, (9.149) 


There is no wave signal overlapping the circular domain of integration, and thus the 
solution is zero. 


(ii) Case II: 

at > at, (9.150) 
The integration in (9.144) is conducted on the whole circular area instead of on the 
spherical surface in the three-dimensional case in (9.135). Therefore, there is 
always a tail of the solution in the two-dimensional case. Hence, the Huygens 
principle does not apply to the two-dimensional case. 


2-D space No sharp trailing edge 


Sharp trailing edge 


3-D space 


at—R at at+R 


Figure 9.14 Wave solution in 2-D and 3-D space according to 2-D and 3-D Poisson’s formula 


The existence of tails in the two-dimensional solution can also be seen in (9.102). 
In particular, the solution expressed in terms of an infinite integration is never zero 
even though the region of disturbance is passed. Huygens principle has a significant 
impact in our daily lives. Luckily, we live in a three-dimensional world, in which 
we can hear a clear voice without infinite echoes and ripples when someone speaks 
to you. The sound wave in three-dimensional space dies off instantly after it 
simulates our ears. Imagine that an animal living in a two-dimensional world will 
never be able to hear a clear voice. There are always infinite echoes flying around 
in the two-dimensional flatland world. Sound sources initiated at different times 
will all mix up. The surface waves of the sea can be viewed as a two-dimensional 
domain, and you never see a perfectly calm sea surface. 

More generally, the Huygens principle is true only for odd dimensions (e.g., 
3,5,7,..., except for 1-D) and is false for all even dimensions (e.g., 2,4,6,...). 


9.2.9 n-Dimensional Waves 


In this section, we will extend the analysis of waves in n-dimensional space. In 
particular, the wave equation reads as 


AD (Pur, + Pegs t+ Py, x, ) = Pu (9.151) 


An*n 


This distance in n-dimensional space can be defined as 
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n 
Pax 4x8 +22 = > XjXjp = XyXy (9.152) 
i=l 
Let us consider the case of symmetric radial waves such that @ = ¢(7,A) (i.e., there is 
no angular dependence of the wave function @. 
Differentiation of r with respect to any arbitrary variable x; gives 


Or 0,9, .2 21/2 _ 1 1 Xj 
— = — (ay +05 +. + x, = = 2x, = = «= 9153 
Ox; = [= ") eet to ne; : 
Using tensor notation and the chain rule, we have the following identities 
See. ee ee ee ee (9.154) 
Ox; Ox; 
0p Op Or _ x; OP (9.155) 
Ox; Or Ox; r Or 
2 
Fe ge ee (9.156) 
Ox; i r 
Using these identities, the Laplacian in n-dimensional becomes 
ap 0 x,09, Ox, 1d9 px, 2 (1 80) OF 
Ox,Ox; Ox; r Or Ox; r Or "Orr Or” Ox; 
1 0g oo 1 0g 
= + 9.157 
ae ar or? sr Or ( ) 
or? r Or 
Consequently, the n-dimensional wave equation for radial waves becomes 
2 2 _ 
1 0 p_ oO pin POP 8 ate (9.158) 


ao er r or or or 
This n-dimensional wave is also known as the Euler-Poisson-Darboux equation. Now, 
we look at some special cases: 
Special case: n = 1 


2 2 
ce al = oo (9.159) 
a at or 
The solution obtained from the method of characteristics is 
p= f(r—at)+ g(r+at) (9.160) 
Special case: n = 3 
2 2 
pew 2 (9.161) 


a ot?) O° ror 
It is straightforward to prove that it can be written as (compare Section 9.2.5) 
2 2 
é (9) _o 2 (9.162) 
a a or 
Again, the solution obtained from the method of characteristics is 
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= ] fea sea) (9.163) 
r r 


The first term on the right of (9.162) is an expanding wave whereas the second term 
on the right of (9.163) is a contracting wave. Therefore 3-D wave decay is 


Qn Z (9.164) 
r 


Special case: n = 2 


2a at + —- (9.165) 
This is the most difficult part, compared to the cases of 1-D and 3-D. The solution has 
been found in terms of Hankel functions in (9.90). 
To give insight to the 2-D wave, we consider an approximate solution for the 2- 
D wave. In particular, we first consider the following identity 


10°Wre)_ 1 af 1 ze} 
Vr r’ Jr Or apo 
1 dg, 1 — Op 
-+[-Ozne Sale OF toe rS } 


2 
a aN | (9.166) 


Using this identity, the 2-D wave can be expressed as 
1 &Wre)_@Wre), ¢ 
2 2 2 +7372 
a ot or 4r 
This cannot be solved easily as in the 1-D or 3-D case. However, for r > «, we 


have 
1 &eWre) _ &Wre) 


(9.167) 


(9.168) 
a at er? 
6= fou (9.169) 
vr vr 
Therefore, asymptotically the two-dimensional wave will behave as 
. (9.170) 


pxr—= 

vr 

This result of course agrees with the asymptotic solution for the two-dimensional 
waves given in Section 9.2.3 (see (9.96)). 
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9.2.10 Wavefront Condition 


In this section, we will consider the wavefront condition. Recall a wave equation of 
the following form: 


2 
i av~u (9.171) 
Ot 
Consider the case of harmonic waves such that 
u=ye'™ (9.172) 
Substitution of (9.171) into (9.170) leads to the following Helmholtz equation: 
Vyt+ky =0 (9.173) 
where 
b= (9.174) 
a 
Next, we look for a plane wave solution of the form: 
y(x)=e** (9.175) 
Combining (9.171) and (9.174) gives the solution form as 
u(x,t) = el io" (9.176) 


Since the wave type of the solution consists of a jump across the wavefront and at 
the wavefront the solution uv is a constant, thus we have the following condition on 
the wavefront: 

k-x-at=C, (9.177) 
where C{ is a constant. 

On the other hand, the wavefront can also be expressed generally as 
F (x,t) =0 (9.178) 

Taking the total differential of (9.177), we get 


UF (xt) = VF de + dt =0 (9.179) 


Since there is no change of F along the wavefront, the kinetics compatibility 

condition gives 

2S 24 (9.180) 
dt Ot 

The gradient of the wavefront surface F' is normal to the surface, or mathematically 

it requires 


VF 


VF =|VF\n (9.181) 


where n is the unit normal to the wavefront F' as shown in Figure 9.15. The velocity 
of the wavefront is defined as 


a 
dt 
The normal propagating speed of the wavefront surface is 


v (9.182) 


vy, =n (9.183) 
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F (x,t) =0 


Figure 9.15 Moving wavefront with unit normal n 


Substitution of (9.180), (9.181), and (9.182) into (9.179) gives 


IVF|v, += =0 (9.184) 
Ot 
Comparison of (9.176) and (9.178) gives the wave number and frequency as 
\VF|=|k| =<, = =-@ (9.185) 


From (9.183) and (9.184), the normal component of the moving wavefront is then 


pe vee” =e (9.186) 
at k 


Therefore, the wavefront is moving at the wave speed of the differential equation. 


9.3 NONHOMOGENEOUS WAVE EQUATION 


We now consider the case of the nonhomogeneous wave equation. We have 
demonstrated in Chapter 2 that the nonhomogeneous wave equation appears 
naturally in the solution of the Maxwell equation as well as in the dynamic problem 
of elastic solids subject to body force. 

The problem can be posed as: 


Oru 


ae AV ut f(x,¥,250) (9.187) 
ou 
ul 6 = Q(X, ¥,Z), “Su =w(x, y,Z) (9.188) 
Ot |,-9 


By the principle of superposition, the problem can be decomposed into two 
associated problems. 


Problem I: 


= 0 Vu (9.189) 
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Ou 

ti = P(x, Ys Z), be =wW(x, y,Z) (9.190) 

t=0 

Problem II: 
Ou 

a> >*V7ut f(x, YsZ0) (9.191) 
| ag 0 Gu) 56 (9.192) 

t=0 Ot =p 


The solution of Problem I has been given by Poisson’s formula in Section 9.2.6. 

The solution of Problem II can be found again by Poisson’s formula if a 
proper change of variables is introduced. In particular, we use the Duhamel integral 
as: 


t 
u(x, y,Z,t) = I W(x, y,Z,¢t-—73T)dt (9.193) 
0 
Application of the Leibniz rule of differentiation gives 
t 
cs =[ Ne pata eie eee On (9.194) 
ot Jo Ot 
This can be simplified by setting an initial condition for w as: 
w(x, y,Z,0;f) =0 (9.195) 
Differentiation of (9.193) again gives 
au tow 
=| oY a yZt—T dr + 2x, y,z, 031) (9.196) 
at? 40 ar? 
Taking the Laplacian of (9.192), we get 
t 
V7u(x, y, Zt) = | V?wdt (9.197) 
0 
Substitution of (9.195) and (9.196) into (9.190) results in 
1 2 
[ 4 -evwar=0, Ot) 3h (9.198) 
0° ar? Ot |,<0 
Therefore, the governing equation for w becomes 
2 
aes vw (9.199) 
ot 
ow 
w= S| =f» (9.200) 
= Ot |,-0 


The solution of w is clearly a special case of Poisson’s formula that we considered 
in Section 9.2.6. Thus, the solution for w is readily obtained by using Poisson’s 
formula as: 


L169 


Sa T) 


WX, Y, 2,657) = oll, (9.201) 


Substitution of (9.200) into (9.192) eee in the a of Problem II as: 
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f =: i = L(G,1,557) ag (9.202) 


Finally, the solution of the Py is eiieined by Soret (9.135) and (9.201) 
as 
a - S) a2 - c) 
= add, 
u(t,x, y,Z) = a al MJ, Rehends+ ||. IS 


Ta 
Lap “fen & $Emes0) ag 


” doe a Sat ra) 
The solution resulting from the ae term can be rewritten slightly by 
introducing the following change of variables: 


u(x, V,Z,t) = 


(9.203) 


R=a(t-t), dt= = (9.204) 
a 


The solution of Problem II becomes 


o 1 po ee f(Engt-R/a) i | 
wl oy20= I. | ‘ ae (——aR) 


_ I o S(6,7,6,t-R/a) 
err | } ; dSdR (9.205) 


_ I S(6,7,6,t-R/a) 
Ana’ I) Bp R id 


This solution can be regarded a retarded potential (e.g., Jackson, 1999). To see 
this, we recall the solution of Poisson’s equation in terms of the so-called 
Boussinesq potential. In mathematical terms, the Poisson equation is given as 


V7u(r) =—p(r) (9.206) 
The solution of this equation can be expressed as 
1 PO) a 
r)=— dV 9.207 
u(r)=—— [If (9.207) 


where 1/R is the Boussinesq potential or Newtonian potential. Comparison of 
(9.204) and (9.206) shows that the solution in the wave equation is indeed a 
solution resulting from a potential but at a reduced time ¢—R/a instead of at time ¢. 
If there is a time dependent solution at a point P(r) denoted by u(r, £) at time ¢, this 
solution at time ¢ can only be dependent on a signal from an earlier time. The time 
required to arrive at point P(r) is R/a (recall that a is the wave speed in (9.186)). 

Finally, the solution given in (9.202) can be expressed in terms of the 
retarded potential as 


mexn2=—) a fl, MEDD ass ff Lend gs 


Ta 


fs jf I(G,1,9,t-Ria) iy, 
4a Br R 


(9.208) 
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9.4 HELMHOLTZ EQUATION 


In this section, we will consider the solution of the Helmholtz equation in spherical 
coordinates. According to classification discussed in Section 7.2.2, the Helmholtz 
equation is of the elliptic type. But as we discussed in an earlier chapter, Helmholtz 
is also known as the reduced wave equation. The Helmholtz equation is somewhat 
between the wave equation and the Laplace equation. In general, the Helmholtz 
equation can be expressed as 


Veut+k-u=0 (9.209) 
More specifically, in spherical coordinates we have 
2 
7 Ope Oss ; Le (sing ae ; : 5 e S+hu=0 (9.210) 
r° or or “sing 0g 0p r*sin’? 0@ 
Applying the following change of variables, we obtain 
u = R(r)@O(A)@(¢g) (9.211) 
Substitution of (9.210) into (9.209) gives 
2 
OD L a (r? a + | a (sing Lon a ag = +k?R@®=0 (9.212) 
r- dr dr y*sing dg dp r’sin°y 00 
This equation can be = as 
2 
i d 2 dR : 1 fangs JIsin? p+/2? sin? p =-+ £® - 1 
Radr dr @sing dp dp © dé 
(9.213) 
where wis a separation constant. The last part of (9.212) can be expressed as 
2 
= +u@=0 (9.214) 
do’ 
The solution of © is 
© = Acos u0+ Bsin uO (9.215) 


There must be periodicity in @ and this leads to u = m (m = 0,1,2,3,...). The first 
equation of (9.212) can be rearranged as 


1d (24k ue 1 
Radr dr sin? g sing dep 


cae sing“ © ace (9.216) 
dp 


where the separation constant is written as v(v+1). Equation (9.215) leads to two 
ODEs: 
2 aR 


“¢ pre —v(v +1)]R =0 (9.217) 
ae: 
“(sing were —>—]® = 0 (9.218) 
sing dp dp sin’ @ 

For the differential equation of ©, we can introduce a change of variables 

x =cos@ (9.219) 
Using the chain rule of differentiation, we get 
d® d. 

dD _ d® dx -i- 2 d® (9.220) 


dp & do 
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With this result, we can rewrite (9.217) as 
2 


2 
(14 ay 2x52 +[v(v +1) =]®=0 (9.221) 
IX 


The general solution is the Associated Legendre functions: 
® = EP” (cos ¢~) + FO!" (cos @) (9.222) 
For solid spheres, we must have boundedness at ~ = 0 and z and this leads to F = 0 
and v =n (where n = 0,1,2,3...). 
Equation (9.216) can be reduced to a standard ODE by using the following 
substitution: 


R=r?y (9.223) 
2 
ieee [r°k? —n(n eg =0 (9.224) 
dr” dr 4 
This can further be rewritten as: 
2 
pe if ede [r?k? —(n+1/2) WV =0 (9.225) 
dr dr 
This is the Bessel equation and the solution for v is 
Ve = CS 41/2 (Ar) + DY 41/2 (Ar) (9.226) 


Finally, combining all these results the general solution for Helmholtz equation in 
spherical coordinates is: 


u=r '*{AcosmO + BsinmO\ {CT y.41/2(kr) + DY,41/2(kr)} Br" (cosp) (9.227) 


This can be shown to be the same as the one given in Section 7.4.3 of Chapter 7. 
Actually, the Bessel function together with 7!” can be written as new functions 


called the spherical Bessel functions (Abramowitz and Stegun, 1964): 


In(kr) = [ana (9.228) 


y, (kr) = Foun) (9.229) 


which are called spherical Bessel functions of the first kind and second kind, 
respectively. In terms of these new functions, the solution of (9.209) can be 
expressed as: 


u={Acosm0+ BsinmO}\{C j, (kr) + Dy, (kr)| P.” (cos@) (9.230) 


9.5 TELEGRAPH EQUATION 


In this section, the telegraph equation is considered. First, starting from the 
Maxwell equations, we derive the telegraph equation. The hyperbolic equation is 
then transformed to the canonical form before a proper change of variables is 
applied. Eventually, it is shown that the solution can be expressed in terms of the 
Bessel function. 
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9.5.1 Formulation 


Recall from Chapter 2 that the Maxwell equations can be written as: 


vV-E=£, V-B=0, 
« ae ie (9.231) 
Vx Ff =-—, VxB=u(Jt+e—) 
Ot Ot 

where B and E are the magnetic field vector and electric field vector. In addition, ¢ 
is the permittivity of the material, p1 is the permeability of the material, and p is the 
charge density. Let us introduce a constitutive relation for the conduction part of 
the electric current J: 

J =oEF (9.232) 
where o is the electric conductivity. Note that o = 0 for perfect dielectric. For the 
case of no electric charge p, we can rewrite the Maxwell equation as: 

V-£=0, V-B=0, 


oB 


9.233 
Vx £=-—, Vidaneees ( ) 
Ot Ot 


On the right hand side of the fourth equation of (9.232), the general current density 
comprises two terms, the first being the conduction current density and the second 
being the displacement current density. Maxwell was the first to realize this 
decomposition. 

Taking the curl of the third equation of (9.230) and taking the time derivative 
of the fourth equation of (9.230), we obtain 


2 
Viveweuvx. Vx Be yo E+, (9.234) 
Ot Ot Ot ot 
Recall from (1.362) of Chapter | the following vector identity: 
Vx(VxE)=V(VeE)-V°E (9.235) 


Equating the two equations in (9.233) and applying the first equation of (9.232) 
and (9.234), we find 
OB oF 
E —— = WE - po — 9.236 
fen p Be (9.236) 
This is Maxwell’s equation for the electric intensity vector E. Note that for a good 
conductor (¢ ~ 0), we have 
VE= ie (9.237) 
Ot 
The Maxwell equation becomes a three-dimensional heat equation. For perfect 
dielectric (o ~ 0), we have 


——=°WE (9.238) 


where c is the light speed. Thus, the Maxwell equation reduces to a three- 
dimensional wave equation. 
For the case of one-dimensional space, we have 
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Of OB OF 
ae ac ° o 
where E becomes the scalar electric field for the 1-D case. This can be used to 
model the telegraph problem. 


HE (9.239) 


9.5.2 Solution 


To solve (9.238), we first remove the first order derivative term by assuming 


E(x,t) =W(x,He" (9.240) 
Differentiation of (9.239) gives 
er OW ge +Ke"W (9.241) 
ot Ot 
2 2 
o ae . 208 OM. ext (9.242) 
Ot Ot Ot 
2 2 
aes aot 4 (9.243) 
Ox Ox 
Substitution of (9.240) to (9.242) into (9.238) leads to 
aw aw ow 
E = Oo + 2KUE K(uo + KuE)W 9.244 
a ee (Mt ary (Mo + KE) ( ) 
To remove the first derivative term, we set 
fa. (9.245) 
2¢ 
Substitution of this value of « into (9.243) gives 
2 2 
OW ST oy (9.246) 


ore Ox? — ie 
To put this into the standard hyperbolic form, we can absorb 1/(e) of the first term 
on the right to give 
2 3 
ewe err (9.247) 
Ot Os 2¢ 


where 
S=./ MEX (9.248) 
Applying the standard change of variables for the hyperbolic equation, we 
introduce 
€=st+t, n=s-t (9.249) 
Or, equivalently, we can write 
s=5E4m), t= 56-0) (9.250) 


Using this change of variables, we get 
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OW OW .aw ow 


- (9.251 
at? aE? — NOE =O?” 

2 2 2 2 
a yg ae (9.252) 

Os 0g Onog On 

Therefore, the canonical form (9.245) becomes 
2 
OW (pq (9.253) 
0g0n 4eé 


One major observation that we can make on (9.252) is that the differential equation 
is symmetric with respect to the two variables € and n. This observation suggests 
the following single variable 


5 =(5—S0)7—M0) (9.254) 


where 
So =VHEX) +l, Myo = V HEX — ho (9.255) 


with xo and fo are the initial point and time that telegraph signals were sent. In terms 
of this new variable, we have 


2 
-— C2! sap =0 (9.256) 
dé dé 
where 
oY 
eae 9.257 
--(2) (9.257) 


This is a second order ODE with a non-constant coefficient. It can be transformed 
into a Bessel equation by introducing 


v= 4ce (9.258) 
Differentiation of (9.257) gives 
a= |e (9.259) 
dg Ve 
Using the chain rule, we get 
aw ef aw (9.260) 
dg \o dy 


d°’w 1 [c dW cd’w 


+ 9.261 
dl? WNC dy Gay’ rl 
Substitution of (9.259) and (9.260) into (9.255) gives 
dw dw 
2 2 
+ +y°W=0 (9.262 
y dp ‘i cf ) 


This is the Bessel equation of zero order. The electric field must be finite at the 
starting point. Thus, the solution becomes 


B= Aly [nex %) Ut to)’ Je 28 (9.263) 
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This is the solution for the telegraph equation. 


9.6 DIFFUSION EQUATION 


As discussed in Chapter 7, second order PDEs can be classified into hyperbolic, 
elliptic and parabolic. It has been shown in Chapter 7 that all parabolic PDE types 
can be converted to nonhomogeneous diffusion equations. The solution for the 
nonhomogeneous diffusion equation has been considered in Section 7.4.2. In this 
section, we focus on the homogeneous diffusion equation. 


9.6.1 Heat Conduction 


Consider the heat conduction problem of a 1-D bar of length ZL modelled by the 
diffusion equation: 
2 
a cea O<x<L, t>0 (9.264) 
Ox” = Ot 

where u is the temperature field in the bar and a? is the coefficient of diffusion. In 
fact, this is one of the very first PDEs considered by scientists and mathematicians. 
We will derive this equation from the fundamental principle of heat conduction, or 
the so-called Fourier’s law of heat conduction. According to the Fourier law of heat 
conduction, the heat flow at any cross section can be estimated by the temperature 
gradient at the section: 


u(x +Ax,t)—u(x,t) _ Act (9.265) 
Ax ox 

where « is the coefficient of thermal conductivity and the negative sign indicates 

the decreasing nature of temperature along the direction of diffusion. The 

temperature change in this incremental element can be estimated as 


1QAt QAt 

s Am  spAAx 
where s is the specific heat of the material, p is the density of the bar, A is the 
cross-section area of the bar, Am is the mass of this segment of cross-section, and 


At is the change in time. As illustrated in Figure 9.16, the net heat flow rate QO is 
related to H as: 


A(x,t)=— lim cA 
Ax>0 


Au = 


(9.266) 


O = H(x,t)— H(x+ Ax,t) = k| Au,(x+ Ax,t) - Au, (x,0) |= me Sax (9.267) 
IX Xx 


Figure 9.16 Fourier law of heat conduction 
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Substitution of (9.266) into (9.265) gives 
phage ty —) (9.268) 


In fact, the formulation of this relates to the well-known Sturm-Liouville problem 
of eigenfunction expansion (see discussion in the next chapter on eigenfunction 
expansion). If the bar is prismatic or uniform in cross-section, we find 


2 
eee (9.269) 
Ot ax? 
where the thermal diffusivity is defined as 
=> (9.270) 
ps 


This completes the proof of the diffusion equation. In fact, diffusion of pollutants 
or chemicals in fluids can also be modelled by a similar equation: 


Hp, (9.271) 
Ot 
where the spatial derivative term is replaced by the Laplacian operator. 


Let us assume the following separation of variables 


u= X(x)T(t) (9.272) 
Substitution of (9.271) into (9.268) leads to 
oxX'T =xXT' (9.273) 


By dividing through by X7, this equation can be simplified as 
XAT __ (9.274) 
A oT 
Since X is only a function of x whereas 7 is only a function of ¢, the only possibility 
is that the left hand side and the right hand side are both constant. Note that we 
have assumed a negative value of this constant in the last part of (9.273). This 
choice is very important, and it is required by physical consideration of the 
problem. For initial boundary value problems with some initial non-uniform 
distribution of temperature along x and with conducting ends, we expect the 
temperature field is a function of time. More importantly, it must decay within 
time, and this fact leads to the choice of negative sign in (9.273). 
The two ODEs that result from (9.268) are 


X44 X =0 (9.275) 
T'+@°A°T =0 (9.276) 
The solutions of these equations are readily obtained as: 
X = Asindx+BcosAx (9.277) 
2,2 
T=ce*™'! (9.278) 


It is now clear that the negative sign chosen in (9.273) indeed results in a decaying 
temperature field for the case of an imposing initial temperature field in a 
conducting bar. Thus, the general solution becomes 


DD 
y =(AsinAx+BcosAxje* *' (9.279) 
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The value of 2 must be determined by boundary condition. The following example 
illustrates the procedure for obtaining the eigenvalue of A. For more comprehensive 
coverage of heat conduction problems, we refer to the classic book by Carslaw and 
Jaeger (1959). 


Example 9.3 Solve the following diffusion equation with prescribed boundary 
conditions: 


Cup, =U, O<x<L, t>0 (9.280) 
u(0,t)=0, u(L,t)=0, 1t>0 (9.281) 
u(x,0)= f(x), O<x<L (9.282) 


Solution: Using the separation of variables given in (9.271), we have the 
formulation for X as 


X"+2V7X =0 (9.283) 
X(0)=X(L)=0, t>0 (9.284) 
Substitution of (9.276) into the first equation of (9.280) gives 
B=0 (9.285) 
The second equation of (9.280) leads to 
sin JAL =0 (9.286) 
That is, we require 
VAL =n (9.287) 
Thus, there are infinite discrete eigenvalues 
A, awn lD, n=1,2,3,+ (9.288) 
The eigenfunction that corresponds to this eigenvalue is 
X,, (x) =sin(nxx/L), N=1;2; 3:00 (9.289) 
Substitution of this eigenvalue into (9.277) gives 
T, =k,e mally (9.290) 


The fundamental solution is 
2 
iactyae ro sin (nea LE), WA 123 uae (9.291) 


The general solution that can be used to fit any boundary condition can be 
expressed as: 


foe) 00 2 
u(x,t) = > oe (x,t) = yee ‘sin(nzx/L) (9.292) 
n=l n=l 
Application of the initial condition given in (9.281) results in 
u(x,0) = f(x) = }°e, sin(nzx/L) (9.293) 
n=l 


Multiplying both sides of (9.292) by the sine function of argument mzx/L and 
integrating it from 0 to L gives: 
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L = L 
I, f(x)sin (ma x/L) dx = Den] sin (mz x/L)sin(nzx/L)dx (9.294) 
0 
n=l 
Recall the following orthogonal property of circular functions (e.g., See Section 
10.5 for proof): 


L =0, x 
I i ais OO es nn (9.295) 
0 L L =L/2, m=n 
In view of (9.294), we have 
L 
i = I, f(x)sin(nzx/L)dx (9.296) 


Combining (9.295) and (9.291) gives the final solution of the problem. 


9.6.2 Terzaghi 1-D Consolidation Theory 


In the 1-D consolidation theory in soil mechanics, Terzaghi derived the following 
diffusion equation: 
2 
Ole 6 SMe gczc2d, 1>0 (9.297) 
ot 2? 
where ue is the excess pore water pressure in the soil as a function of depth z and 
time ¢. The coefficient of consolidation c, is defined as: 


ae! (9.298) 


Oe Fie 
where k, m,, and %, are the coefficient of permeability in Darcy’s law (which has a 
similar physical meaning as the Fourier’s law in heat conduction), coefficient of 
volume compressibility, and unit weight of water. The excess pore water pressure is 
defined as the difference between the pore water pressure in the soil and the long 
term steady state pore water pressure at the same point (typically hydrostatic 
pressure): 


u, =Uu(x,t)—Uu(x,%) (9.299) 

The consolidation theory describes the process of driving water from the clay due 
to the non-zero excess pore water pressure. We now look at its derivation. 

The flow velocity through the soil element shown in Figure 9.17 is governed 
by Darcy’s law as: 
oh__ ik %, 
Oz Vy OZ 
where k is the coefficient of permeability and is a function of soil type, i: is the 
hydraulic gradient in the z direction and is defined as total head loss per flow 
distance, and h is the total head in the soil (total energy measured in length). This 
is because the total head change is due to the change in pore water only. The 
continuity condition is 


(9.300) 


v, =ki, =—k 


ae dxdydz = ld 
Oz dt 


(9.301) 
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Combining (9.299) and (9.300) we obtain 


C2 ' 
< Me dxdydz = < =m, 2 dxdydz (9.302) 


Vw OZ 


Figure 9.17 1-D consolidation theory of soil due to Terzaghi 


where m, is the coefficient of volume compressibility (a parameter indicating the 
compressibility of soil in the unit of inverse of stress). However, the increase of the 
effective stress o’ is due to the decrease in the excess pore pressure. In other 
words, the loading taken up temporarily by pore water pressure is transferred to the 
soil skeleton in terms of effective stress increment. Thus, we have 


OF fe (9.303) 
Ot Ot 
Consequently, substitution of (9.302) into (9.301) gives 
2 
Me te (9.304) 


ot YW, az? 
This is evidently equivalent to (9.296). 

In an oedometer test in a soil laboratory, once a loading is suddenly applied 
to the clay of thickness 2d, an excess pore water pressure u; will build up at time 
zero. In this kind of test, porous stone is put at the bottom and the top of the clay so 
that drained conditions are created at the boundaries. Mathematically, the initial 
and boundary conditions are given as: 

u,(z,0)=u;(z), OS z<2d (9.305) 

u,(0,t)=0, u.(2d,t)=0, t>0 (9.306) 

We see that the solution for head conduction obtained in the last section equally 
applies here with the following identifications: 


u(x,t) <-u,(z,t), L < 2d, a? <c,,x<z, f(x) <u,(z) (9.307) 
In addition, if we assume the initial excess pore water pressure is constant, we have 
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du. = 2m+1)az_ _ 2.2 
u,=—! > : sint! 2 ye Ome a Tt (9.308) 
a <(2m+1) 2d 
where the time factor 7, is defined as 
Gt 
= ae (9.309) 
In obtaining (9.308), we have used the following identity: 
2d | nnz 
I ae =1l-cosnz 
0 
(2.310) 


_|2, n=odd 
0, n=even 
This solution is found very useful in devising the root time method as well as the 


logarithmic time method in estimating the coefficient of consolidation c, defined in 
(9.297) in the laboratory. 


9.6.3 Living Underground 


One of the main reasons to live underground is that rock and soil can act as a 
thermal insulator for underground structures. Due to seasonal changes of 
temperature, the ground surface is subject to periodic heating and cooling. The 
problem can be formulated as heat diffusion for temperature field u as: 


u, = YU,, (9.311) 
u(t,0) = acos at (9.312) 

The circular frequency for a yearly cycle is 
aid =2.0x107 57 (9.313) 


ae 
365.5 x 24x 60x 60 
The temperature field must decay with the depth as 


u(t,z) >0, z>0 (9.314) 
To solve the problem efficiently, we can rewrite the boundary condition as 
u(t,0) = ae’, lim u(t,z) =0 (9.315) 
We seek a solution in the form: 
u(t,z) = v(z)e"™ (9.316) 
Substitution of (9.315) into (9.310) gives the following ODE for v 
yv" —iav =0 (9.317) 
v(0) =a, im vW(z)=0. (9.318) 


The two independent solutions are 
v(z) = Ce" 4 Ge NO? (9.319) 


We note by Euler’s formula that 


T 


i=e2, (9.320) 
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thus, we have 


Vi =e* = cos + isin 4 = ging (9.321) 


I 
V2 


u(t, 0) 


Underground structure 


Figure 9.18 Temperature variations in underground structure due to seasonal temperature 
change on the ground surface 


Substitution of (9.320) into (9.318) gives 


v(z) = CNP ONOM 4 Ce Volante: (9.322) 
Applying the boundary and decay conditions to (9.321) gives 
v(z) = ae VOIR): (9.323) 
Back substitution of (9.322) into (9.315) gives 
u(t,z) = ae VO" Orz or foi) (9.324) 


For the boundary condition given in (9.311), we can take the real part of the 
solution given in (9.323) to give 
u(t,z) =ae VO cost at - fol 2y)21 (9.325) 
where the phase lag is 
0 =Jo/(2y)z (9.326) 
For the case that the phase lag is an integer multiple of 2, the temperature at the 


ground will be completely out of phase with that of the point of consideration. The 
first out-of-phase nodal point underneath the ground is at the depth 


Z=n\2y/@ (9.327) 
For soil underneath the ground, a typical value of heat conduction is 
y =10°m/s (9.328) 
Substitution of (9.327) and (9.312) into (9.326) gives 
Z=9.9m (9.329) 
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and at this depth the surface value will decay by a factor of 

e * =0.043214 (9.330) 
In Hong Kong, the maximum temperature is about 34°C and the lowest temperature 
is about 7°C (or a ~ 13.5°C). This gives a maximum temperature variation of about 
1.167°C at a depth of 9.9 m. In conclusion, we find that at a depth of about 10 m 
below the ground, the seasonal year-round temperature variation is from 19.9°C to 
21.1°C. This makes the underground structure ideal for human usage in terms of 
the roughly constant temperature environment. It also provides an ideal condition 
for storage of goods and supplies. In fact, rock caverns and subsurface structures 
have been used for thousands of years in countries near the Arctic Circle, like 
Finland and Sweden. This provides a scientific judgment for using underground 
structures. 


9.7 LAPLACE EQUATION 


The Laplace equation is one of the very first second order PDEs studied 
extensively by mathematicians. Distribution of electrostatic potential, streamline 
and potential function of incompressible potential flow of fluid, deflection of 
membranes, and torsion of prismatic bars are phenomena governed by the Laplace 
equation. It was named after French mathematician Laplace. It is defined as 


V-u =0 (9.331) 
where the Laplacian differential operator in 2-D and 3-D Cartesian coordinates are 
defined as: 

Ou Oru 2 Ou Ou Ou 
a aa? Y emg are 

ox” oy Ox” Oy” az 

Let us derive the Laplace equation from the seepage problem from a soil mechanics 


point of view here. In particular, consider an incompressible potential flow in 2-D 
space as shown in Figure 9.19. The net inflow to the element is 


V-u= 


(9.332) 


v,dydz + v,dxdy (9.333) 
The net outflow from the element is 
(vy, + ns dx)dydz +(v, + Os dz)dxdy (9.334) 
OK Oz 
Subtracting the outflow from the inflow, we have 
OV 5 ny (9.335) 
Ox OZ 
More generally, we can recast this continuity equation as 
V-v=0 (9.336) 
Using Darcy’s law, we have 
ea ee (9.337) 
a “Ox Ox Oz 
pote, BY (9.338) 


"& & Ox 
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where the coefficient of permeability along the x and z directions are defined as ky 
and k,, and hydraulic gradients along the x and z directions are i, and i,. Darcy’s 
law has been discussed in the last section when we discussed Terzaghi’s theory of 
1-D soil consolidation. There are two additional functions that we define in (9.336) 
and (9.337). The first one is the potential function @, and the second one is the 
stream function wy. If we assume isotropic flow (i.e., the coefficients of permeability 
along the x- and z-directions are the same), we can integrate both (9.336) and 
(9.337) to get 


x 


Figure 9.19 Continuity of fluid flow 


~(x,Z) =—kh(x,z)+C (9.339) 
We can see that the contour plot of the potential function @ is similar to that of total 
head or total energy. In seepage theory, they are referred as the equi-potential lines 
of the flow. If we substitute the definition of @ given in (9.336) and (9.337) into 
(9.334), we obtain 


pall ay ae (9.340) 


Therefore, the flow potential function satisfies the Laplace equation. If the flow is 
irrotational, we have 
Vxv=0 (9.341) 

This implies that there is no vorticity in the fluid and physically also implies there 
is no viscosity effect in the fluid. For the present 2-D case shown in Figure 9.19, we 
have the irrotational condition as: 

Ov, Ov, _ 

Ox OZ 
If we substitute the definition of y given in (9.336) and (9.337) into (9.341), we 
obtain 


(9.342) 


2 2: 
ate -0 (9.343) 
IX IZ 
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This is again the Laplace equation. Thus, both the potential function and stream 
function satisfy the Laplace equation. Physically the stream function indicates the 
flow line in the fluid. To see this, we can take the total differential of the function 
equal to a constant value, i.e., y= c: 


bp ee ep 8 (9.344) 
Ox OZ ; 
The direction of this stream function plot is 
e) we (9.345) 
dx vay Va 


This shows that the slope of the plots of y = c always equals the flow direction, 
regardless of the value of c. On the other hand, to study of the slope of plots of @ = 
c we can consider the total differential of @ = c: 


hp tg? ae gobi do=i8 (9.346) 
ox Oz : 
Rearranging this equation gives 
dz oe (9.347) 
dx ae V, 


For two straight lines intercepting at a point, the angle of interception can be 
calculated as 
m,—-m 
tan 9=—2—1 (9.348) 
1+mm, 
where m and mz are the slopes of the two lines. If the two lines are perpendicular, 


we have 9 = n/2 or 


m =-— (9.349) 
mM 
Comparing the slope of stream function given in (9.344) and slope of the potential 
function given in (9.346), and in view of the result given in (9.348), we can 
conclude that plots of the potential function and stream function are always 
perpendicular. 
Referring to Figure 9.20, we can also see that the value between different plots 
of stream function equal to the flow rate between these streamlines: 


Wo Wo YW ay ay Wo 
Ag = | v,dn = | (-v,dx + v,dz) = } (Sax +E de) = } dy =w.-v, (9.350) 
WY WY WY WY, 


Considering the flow continuity between the two flowlines in element ABCD given 
in Figure 9.20, we have 


Aq = Ay =v,An= 2 An (9.351) 
iS 


The last part of (9.350) is a consequence of the definition of @ given in (9.336) and 
(9.337). In particular, we can replace x or z in these equations by s and 
subsequently v, or v- by vs. Replacing the incremental change in (9.350) by 
differentiation, we have 
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ay = oe (9.352) 
On Os 
This relation between yw and @ gives the final condition for the potential function 
and stream function. Based upon them, the flownet technique can be developed. 
The details are, however, out of the scope of the present section. 


Figure 9.20 Flownet form by potential function and stream function in potential flow problems 


9.7.1 Dirichlet Problem 


This problem has been considered by many others (like Poisson, Green, and Gauss) 
long before German mathematician Dirichlet did. But because of Dirichlet’s 
contribution to the analysis of the problem, it was named after him. Dirichlet was a 
German mathematician, who was born to a French family in Germany. 
Mathematically, it is formulated as 
V-u =0 (9.353) 
u=Uy, onS (9.354) 
where S denotes the surface of the domain of the problem. It is also referred to as 
the first boundary value problem in potential theory. For vibrations of a string, it 
gives a fixed end condition. For equilibrium problems of soap film, the Dirichlet 
problem is like a closed wire loop with prescribed deflection of the soap film. This 
problem has been studied by many well-known mathematicians, including 
Poincare, Lyapunov, Gauss, Lord Kelvin, Weierstrass, Neumann, Wiener, 
Lebesgue, and Kellogg regarding its uniqueness and existence of the solution for 
different domains. In finite element formulation using calculus of variations, the 
Dirichlet boundary condition is normally referred to as the essential boundary 
condition (see Chapters 13 and 14). We will see in Section 9.7.8 that uniqueness of 
the solution of the Dirichlet problem can be guaranteed. 
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9.7.2 Neumann Problem 


Instead of imposing the unknown solution on the boundary as shown in (9.353), the 
Neumann problem prescribed the normal derivative of the unknown on the 
boundary. This formulation was named after German mathematician Carl Neumann 
(don’t confuse with Nobel Prize winner von Neumann): 


V-u=0 (9.355) 


“ =g, onS (9.356) 
in 


It is also referred to as the second boundary value problem in potential theory. For 
example, for electrostatic problems with an insulating boundary, the Neumann 
problem is formulated. For acoustic problems, the Neumann boundary with g = 0 
corresponds to a solid wall. For vibrations of string, the Neumann boundary 
condition corresponds to a “freely rotating end.” For a nonhomogeneous Laplace 
equation or Poisson equation, we cannot arbitrarily impose the function g. There is 
a compatibility condition that g must satisfy. For the fluid flow problems, if there is 
an internal source (nonhomogeneous term in the PDE), then the outflow condition 
on the boundary (modelled by the Neumann boundary condition in (9.355)) must 
satisfy the continuity of the flow (i.e., fluid that comes in from the source must go 
out from the boundary). For the case of electrostatic problems, if there are internal 
charges (i.e., nonhomogeneous terms in the PDE), the net electric flux felt on the 
boundary as a whole must reflect the effect of the internal charges. For the case of 
heat flow, if there is an internal heat source, the total heat flux passing the boundary 
must equal that of the internal sources for conservation of energy. In finite element 
formulation using calculus of variations, the Neumann boundary condition 
corresponds to the natural boundary condition (see Chapters 13 and 14). 

We will see in Section 9.7.8 that there is an integrability condition of the 
Neumann problem. 


9.7.3 Robin Problem 


Other than the Dirichlet or Neumann boundary condition, a more general type of 
boundary condition has been proposed. It is known as the Robin boundary 
condition or Robin problem. Many authors also simply refer to it as the third 
boundary condition (the first and second ones are referred to as Dirichlet and 
Neumann problems). Mathematically, it is formulated as 


V-u =0 (9.357) 
Ou 

au+b—=f, onS (9.358) 
on 


Note that the boundary condition involves both the unknown w and its normal 
derivative. For the special case a = 0, we recover the Neumann problem; and for 
the special case b = 0, we recover the Dirichlet problem. For a partially absorbing 
boundary, the Robin type of boundary condition can be used. For heat conduction 
problems, the Robin boundary condition corresponds to the heat conduction rate at 
the boundary being proportional to the temperature there. The problem is named 
after Gustave Robin, whose PhD advisor is the eminent mathematician Emil Picard 
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and his thesis committees consisted of Hermite and Darboux. However, according 
to Gustafson and Abe (1998a,b), Robin never used this type of boundary condition 
himself. Nevertheless, they thought that it was related to his PhD thesis on potential 
theory. In potential theory, both the single- and double-layer potential methods for 
solving boundary value problems in electrostatics are attributed to Robin. 
Gustafson and Abe (1998a) speculated that it was Bergman in 1948 who called the 
third boundary condition Robin’s boundary condition. But, it seems to be a mistake 
made by Bergman. 

The solution form of the Laplace equation depends on the coordinate system 
that we employ. In general, we can use separation of variables to solve the Laplace 
equation. However, it is natural to ask whether the Laplace equation can always be 
solved by separation of variables. So far, it is known that the Laplace equation is 
separable in the following thirteen coordinate systems only: Cartesian, circular 
cylindrical, spherical, oblate spheroidal, prolate spheroidal, elliptic cylindrical, 
conical, paraboloidal, parabolic, parabolic cylindrical, ellipsoidal, bispherical, and 
toroidal. For most mechanics and engineering problems, Cartesian, circular 
cylindrical, and spherical coordinates are, however, found sufficient. 

In particular, in Cartesian coordinate the general solutions are expressible in 
the product of circular functions and hyperbolic functions, in cylindrical 
coordinates the general solutions are expressible in the product of circular 
functions and Bessel functions, and in spherical coordinates the general solutions 
are expressible in the product of circular functions and Legendre polynomials. 
They are considered separately next. 


9.7.4 Spherical Coordinate 


In spherical coordinates, the Laplace equation can be expressed as (Chau, 2013): 


2 
Vi? i iy Eg yo sy) 
r? or or _’sing 0g 0p r’sin*p 067 
Applying the following separation of variables, we obtain 
u = R(r)O(A)@(¢g) (9.360) 


Similar to the discussion for the Helmholtz equation, this separation of variables 
leads to 


1 dR lL @. dd... 1a 
[ as r +— (sing Pie gee ay (9.361) 
Rar dr @sing dg dp © do’ 
where is a separation constant. The last part of (9.360) can be expressed as 
2 
F® | 7@=0 (9.362) 
do 


There must be periodicity in 0 leading to u = m (m = 0,1,2,3,...). The solution of © 
is 

© = Acosm0@ + BsinmO (9.363) 
The first part of (9.360) can be rearranged as 
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ld 2 dR Ww 1 od 
Rdr dr’ sin?g @®sing dg 


Gael) =n(n+1) (9.364) 
dp 


where the separation constant is written as n (n+1) with n = 1,2,3... The reason for 
n being an integer has been given in Section 9.4 for the Helmholtz equation. 
Equation (9.214) leads to two ODEs, and the first one is: 

2 
Lae re ao ee ee 
? dp 


5] =0 (9.365) 
sin” @ 
For the differential equation of ©, we can introduce a change of variables of x = 
cosg and subsequently obtain the Associated Legendre equation: 


sing d 


d’® d@® m 
(1+ x7) a 2x cs +[n(n +1) =]®=0 (9.366) 


The general solution is the Associated Legendre functions: 

® = EP" (cos¢) (9.367) 
Note that we have only retained the Associated Legendre polynomials of the first 
kind in (9.366) due to the boundedness of u at @ = 0 and x (see Section 9.4). The 
second ODE resulting from (9.256) is 

2 
pa = 2 FR _nin4DR=0 (9.368) 
dr dr 

This ODE is of Euler type. Thus, we can use the standard technique for solving the 
Euler type equation and the solution is 

R=Cr"+Dr"" (9.369) 
If the domain includes the origin of the spherical coordinate, we must have D = 0 
due to boundedness. Finally, the general solution for solid spheres can be expressed 
as: 


u=r"P."(cos~){Acos m0 + B sin m0} (9.370) 
For the case of rotational symmetry, the general solution can be reduced to: 
R= A,r"P, (cos@) (9.371) 


where P,, is Legendre polynomials of the first kind. 


9.7.5 Cylindrical Coordinate 


In cylindrical coordinate, the Laplace equation can be written as: 
Ou Ou u_ u lou. 1 Ou, Ou_ 


V-u= ae oe + a as + ee + carr + ag 0 (9.372) 
To prove this, we can use the following change of variables 
x=rcos0, y=rsind,r2 =x +, tan@=~ (9.373) 
Applying the chain rule of partial differentiation we have 7 
Ou Ou or Ou 00 _ ge sin@ Ou (9.374) 


+ cos 
Ox Or ox 00 Ox or r 006 
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Ou _ Ou Or ou 00 = sing cos @ Ou (9.375) 
oy or oy 060 Oy or r 00 
Reapplying the chain rule of partial differentiation again to (9.373) and (9.374), we 
obtain 


Ou > ,0°u 2sinOcosO du sin? @ 0°u sin? @ du 2sin@cosO du 
=cos’ 0 + 
ax’ er’ r orode r 067 r Oo r 00 
(9.376) 
Ou.» ,0°u 2sinOcos@ 0?u _cos*@ Gu _cos?O Gu 2sinOcosO Cu 
=sin’ 0 + + 
éy* or? r roo? Gor, r? 00 
(9.377) 


Adding (9.375) and (9.376), we finally get 
2 2 2 
Ou lou lou ou _y (9.378) 
er? or Or? BO az? 
By separation of variables, the solution can be assumed as: 
u = R(r)Z(z)O(0) (9.379) 
Substitution of (9.378) into (9.377) gives 
1 @R Wak Lge Laz _ 
R dr’ rdr Or? dQ Z az’ 
where A is a constant of separation of variables. We introduce an additional 
constant as: 


V-u= 


= (9.380) 


2 
Sum =- (9.381) 


where u is also a constant of separation of variables. Consequently, we have the 
following ODEs: 


ad’ 


2 
“—+W@=0 (9.382) 
do” 
2. 
d aan 2=0 (9.383) 
dz 
2 2 
Sees ¢ Vee ey (9.384) 
dr? rdr r 


Note that the periodicity of © requires that pp = m (i.e., m = 1,2,3,...). 
The corresponding solutions for these ODEs are 


© = Acosm0 + BsinmO (9.385) 
Z =Ccosh Az + Dsinh Az (9.386) 
= EJ, (Ar) +FY,, (Ar) (9.387) 


where u is another constant of separation of variables, and J,, and Y,, are Bessel 
functions of the first and second kinds. Finally, the solution becomes 
u ={Acosm@ + Bsin m9} {CcoshAz+ Dsinh Az} {EJ,, (Ar) +FY,,(Ar)} (9.388) 


In axisymmetric solid cylinder problems, we must set F' to zero because Y,, > 0 as 
r — 0. Thus, the solution becomes 
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u = J (Ar) {C cosh Az + Dsinh Az} (9.389) 


The value of A needs to be determined by boundary conditions. For the case of 
homogeneous boundary condition, the problem becomes eigenvalue problem and 1 
becomes eigenvalue, which will be discrete but infinite in number. 


9.7.6 Poisson Integral 


In this section, we consider the case of a two-dimensional unit circular disk. The 
two-dimensional Laplace equation is 

Ou 1 ou | 1 Ou 
a? rér 7 ae 
The boundary condition, condition of periodicity, and boundedness condition are 
respectively 


Vu (9.390) 


u(1,0) = h(0) (9.391) 
u(r,0 +2) =u(r,0) (9.392) 
lim u(r, 0) = finite (9.393) 


Again, application of separation of variables leads to 
u=v(r)w(0) (9.394) 
Substitution of (9.393) into (9.389) results in 


v"wt 1 +S vw"=0 (9.395) 
r r 
Grouping functions of r and 0 onto different sides of (9.394) gives 
2." ' " 
ee (9.396) 
v w 


where A is a constant of separation of variables. This gives two ODEs: 
2 d’y dv 
- 


2 
ae +r a A*v=0 (9.397) 
2 
Ow w=0 (9.398) 
do 
The solution of (9.397) is 
w= Acosj0+ Bsinrd (9.399) 
The condition of periodicity requires that 
Az=n, n=0,1,2,3,... (9.400) 


which is the eigenvalue of the problem. Using this value of A, (9.399) can be 
written as 


2 
fe YP nv=0 (9.401) 
dr dr 
This is again recognized as the Euler type of ODE, and thus the solution becomes 


v=Cr"+Dr" (9.402) 
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However, at the center (7 > 0) v needs to be bounded as required by (9.392), and 
we have D = 0. For the special case that n = 0, we have 


pay, dv _4 (9.403) 
dr? dr 
For this special case, we can let 
t(r) = ed (9.404) 
dr 
ae =0 (9.405) 
dr 
This is a separable first order ODE and direct integration gives 
t=C,Inr+C, (9.406) 


However, due to boundedness we have to set C; = 0. Finally, the solution can be 
expressed as: 


u(r,0) = +} (a,r" cosnO +b, 1" sinnd) (9.407) 
2 n=l 
Application of (9.390) gives 
a ~ . 
u(1,0)= + oe cosn0 +b, sinnd) = h(6) (9.408) 
n=l 
This is exactly the Fourier series expansion of (6) and thus we have 
a= ={" h() cos n6d0 (9.409) 
We-at 
b, = —{* h(O)sin n0d0 (9.410) 
nN d-x 


Thus, (9.406) can be written as 


u(r,0) = — =. ie Ce h($)cosnodg 
n=l (9.411) 


h(g)sin ngd¢) 


r" cosnO al 


r" sinnO [" 
+ ———— 
1 
This can be further simplified as 


u(r, 0) = <{" nots + a (cosnOcosng+sinnOsinnd)'dg (9.412) 


n=l 


Using the sum rule of cosine functions, (9.304) is reduced to 


1 
u(r,0) = = A= Ly cosn(O—¢)]d¢ (9.413) 
n=l 
The bracket term in (9.412) can be summed exactly and this was done by Poisson. 
More specifically, the bracket term is 


LLY rcosn(a- ) =Re(— Ltd): my (9.414) 


n=l n=l 
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where 
z" = {re? 9" =r"[cosn(@—¢) +isinn(O—¢)] (9.415) 
Since we are considering a unit disk, we have 
\z| <1 (9.416) 
Applying Taylor series expansion, we find 
Zz D5 33 . n 
— =2(l+7z+27° +7 +..)=) Zz 9.417 
tH ) 2 (9.417) 
With this summation formula, (9.413) is rewritten 
i 1 Z l+z 
—+) r"cosn(@—¢) = Re(=+ =Re 
; 2, (0-#)=Re5 +7) =Relzg 


: (9.418) 
(l+z)-z),_ (-|z|) 

= Ref —————-] = 
2-2? 2-2) 


Note that 
li—z|° =[l-rcos(@—¢)] +r? sin?(O-—¢) =1-2rcos(9-¢) +r? (9.419) 


lz” =r? (9.420) 
Back substitution of these values into (9.413) and (9.412) gives 


- 32 
u(r,0)=—— [i nor _ 

27 J-x 1+r? —2rcos(0—-¢) 
This is called the Poisson integral formula, and is a solution of the Dirichlet type of 
boundary value problem. This formula shows that the solution at any interior point 
is a weighted average of its boundary potentials. The weighting function is given in 
the square bracket of (9.420). In fact, for 7 + 1-6 (with 6 — 0), the Poisson kernel 
behaves like a Dirac delta function (see Section 8.11) of 6(6-¢) as r > 1-. Jesse 
Douglas’s celebrated paper in proving the existence of the minimal surface of soap 
film in the Plateau problem was based on the Poisson integral given in (9.420) (he 
was awarded the Fields medal because of this theory). 

The physical meaning of this weighting function can be understood better by 
referring to Figures 9.21 and 9.22. 

The denominator and numerator of the weighting functions are shown in 
Figure 9.21. In particular, the square root of the denominator of the weighting 
function is actually the distance between the observation point (7,0) and the 
boundary point at (1,¢). The square root of the numerator of the weighting function 
is also shown in the figure as the vertical height of the right angle triangle with the 
horizontal side being r and the hypotenuse being 1. If we consider the boundary 
potential at point B; with 0 = 9, the denominator approaches its minimum. 
Consequently, the weighting function approaches its maximum. That is, the 
boundary point with the closest distance to (7, 0) has the biggest effect on the 
solution. Conversely, for boundary point B2, the distance between the observation 
and boundary point is the farthest or the weighting function is the smallest. Figure 
9.22 illustrates the angular variation of the denominator and the weighting function 


ld¢ (9.421) 
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versus > for 0 = 37/4 and r = 0.6. It is clear that the minimum value of the 
denominator and the maximum weighting function occur at = 37/4. 


1+r° —2rcos(0-¢) 


| 


u(1,o) = h() 


ier 


t=?" 


1+r° —2rcos(@-¢) 


Weighting function = 


1+r? —2rcos(O- ~) 


v, 


Figure 9.22 The denominator and weighting function versus ¢ for 0 =37/4 and r = 0.6 
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9.7.7 Extremal Principle 


Physically, (9.420) has a very significant implication. Considering the solution at 
the center of the disk (r = 0) we have 


u(0,0) = =] moue (9.422) 


This shows that the value at the center of the disk is the average value of u on the 
boundary of the disk. For any non-constant function uw, there cannot be a local 
maximum or a local minimum at any interior point of the unit circle. Physically, 
when a membrane is under equilibrium (Laplace equation is the governing equation 
for this kind of steady equilibrium type of problems), there cannot be any internal 
bump in the membrane. For problems of thermal equilibrium, the Poisson integral 
formula implies that a body can achieve its maximum and minimum temperature on 
the boundary of the domain only. Conversely, if a body contains a local maximum 
or minimum in its internal temperature, it could not be in thermal equilibrium. 
Although we interpret this extremal principle from the solution for a unit circular 
disk, it actually applies to equilibrium problems of domains other than circular 
shapes. Alternatively, for problems governed by the Laplace equation, conformal 
mapping can be applied to any non-circular domain (e.g., Section 3.13 of Chau, 
2013) and to map it to a unit circle. In this way, we can also apply the extremal 
principle to other domains. In the next section, we will show a different approach in 
proving the extremal principle. 


9.7.8 Properties of Harmonic Functions 
In this section, we recall some of our observations for harmonic functions: 
(i) Integrability of the Neumann problem: 


Recall the definition of the Neumann problem given in Section 9.7.2. Note that for 
a harmonic u and setting v = | in Green’s second identity (see (8.41)), we have 


{| Sas =0 (9.423) 


Since the Laplace equation is for steady equilibrium problems, (9.422) implies that 
the fluid flow in and out of a control volume is the same in the case of potential 
flow, and the heat flow in and out of a volume is the same in the case of heat 
conduction. For a Neumann problem defined as: 


V7u=0, a = f(x, y,2), (9.424) 
Mp 


substitution of (9.423) into (9.422) gives the integrability of the Neumann problem 
as 


[[ /eo».2as =0 (9.425) 


(i1) Mean Value Theorem 
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Let us recall the integral formula from (8.52) and specify on the surface I’, a small 
sphere of radius a embracing the ae Mo as: 


O by 1 Ou 
u(M) ah. bu -— as (9.426) 


With the integrability given in re we have 


Ih. oe Sh. = dS =0 (9.427) 


where a is the radius of a small sphere around the point Mo. On the other hand, note 
that 


1 | 1 
2 2 ==“ 3 (9.428) 
On r r, or r r, a 
Substitution of (9.426) and (9.427) into (9.425) leads to 
1 
u(My) =——> } I udS (9.429) 
4ma* 2T, 


This result shows that the value of u at point Mo equals the mean value of wu on the 
surface of the sphere embracing the point. This is the mean value theorem for 
harmonic functions. 


(iii) Extremal Principle 


We can further use the mean value theorem to show the extreme principle. Suppose 
that uw reaches a maximum value at a point M inside the domain. Then, it must be 
larger than the average value on the surface of a sphere with an arbitrary radius R 
with point MM as the center, because there must be some point within less than the 
maximum value at ,. That is, 


ral 
—— udS <u(M (9.430 
Art R? 9S, uty) 
However, the mean value — in (ii) shows that we must have 
udS =u(M 9.431 
ell, wis = mn (9.431) 


This is in contradiction with (9.429). Therefore, we cannot have a maximum inside 
the domain. In other words, we can only have the maximum value on the boundary 
of the domain. Following a similar procedure, we can also show that the minimum 
can only occur on the boundary. 


(iv) Uniqueness of the Dirichlet problem 


Recall the definition of the Dirichlet problem defined in Section 9.7.1. Assume 
both uw; and uz are solutions and they are distinct for the following Dirichlet 
boundary value problem for the Laplace equation: 


V7u=0, ul. = f(x, y,Z) (9.432) 


Consider the function defined as 
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V=Uy,—-Uy (9.433) 
Since both w; and wz are solutions of the Laplace equation, substitution of (9.432) 
into (9.431) shows that v is also a harmonic function and v is identically zero on the 
boundary. The maximum principle in (111) shows instantly that the only possibility 
is that v is zero. Thus, u; and uz must be equal or the solution of the Dirichlet 
problem must be unique. 


9.8 PHYSICAL IMPLICATION OF LAPLACE EQUATION (SADDLE) 


Recall the Laplace equation for two-dimensional space 


2 2 
a = =0 (9.434) 
Ox 


Since the sum of two second-derivative terms is zero, one must be negative while 
the other one must be positive. Figure 9.23 shows that a circular domain in which 
both of the second derivatives in (9.433) are positive and both are negative, 
respectively. It is clear that there is a local minimum if both derivatives are 
positive, and there is a local maximum if both derivatives are negative. Therefore, 
as equilibrium solution in the forms shown in Figure 9.23 cannot satisfy the 
Laplace equation. 


V-u= 


2 2 
Eh EE og = 
ar ay Ou 9 Ou. 


jo>) 


Figure 9.23 The distribution of u for both derivatives are positive and negative respectively 


a 


No local extremum 


Figure 9.24 The illustration of a saddle plot for u satisfying the Laplace equation 
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Physically, for potential theory there is no maximum or minimum within the 
domain, or as expected from the extremal principle (see property (iii) in the last 
section) that maximum and minimum value can only appear on the boundary. 
Figure 9.24 illustrates a typical saddle plot of wu satisfying the Laplace equation. 
The shape of the saddle also resembles a piece of potato chip. Note there is no 
maximum or minimum within the saddle in the circular domain. 


9.9 LAPLACE EQUATION IN RECTANGULAR DOMAIN 


9.9.1 Prescribed Function along y-axis 


In this section, we consider the boundary value problem of a rectangular domain 
governed by the Laplace equation. In particular, consider the case that a non-zero 
boundary condition is imposed on the side at x =a only (see Figure 9.25) 
2 2 
a ee Gg, O0<x<a, 0<y<b 
ox” oy 
u(x,0)=0, u(x,b) =0, O0<x<a (9.435) 


u(0,y)=0, u(a,y)= f(y), OS ys<b 


u(x,b) =0 


u(a,y)= f(y) 


0 nx,0)=0 a 


Figure 9.25 Laplace equation for a rectangular domain with prescribed function on vertical side 


x=a 


This can be considered as a Dirichlet problem. Before we continue to solve this 
problem, it is instructive to note that for 2-D Cartesian coordinates our solution 
along the x- and y-axis will be either circular functions or hyperbolic functions (see 
below). It is well known from Fourier’s work related to heat conduction that any 
arbitrary function can be expanded in a Fourier series expansion in terms of sine 
and cosine, but never be expandable in hyperbolic sine and hyperbolic cosine. For 
the problem given in (9.434), in order to satisfy the boundary condition we clearly 
need to expand the given function fin the y-direction in the sine or cosine. Thus, it 
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is obvious that the fundamental solution in the y-axis must be sine and cosine, not 
hyperbolic functions. 
Let us now consider the separation of variables: 


u(x, y) = X(x)Y(y) (9.436) 
Substitution of (9.435) into the Laplace equation given in (9.434) gives 
X"Y+XY"=0 (9.437) 
Dividing through (9.436) by XY, we get 
D Gs y" 
=-—=j) (9.438) 
xX Y 


Since we have assumed_X is only a function, whereas Y is only a function of y, we 
must set both functions of x and y to a constant. We have picked a positive constant 
A, and we will see that this leads to our desired functions of sine and cosine for Y. 
In particular, we have two ODEs resulting from (9.437): 

X"-AX =0 (9.439) 


Y"+AY=0 (9.440) 
The homogeneous boundary conditions given in (9.434) leads to the corresponding 
boundary conditions for functions XY and Y as 
u(0, v) = X(0)Y(v) =0, O<y<sb => X(0)=0, 
u(x,0) = X(x)Y(0)=0, O<x<a => Y(0)=0, (9.441) 
u(x,b) = X(x)Y(b)=0, O<x<a => Y(b)=0. 
Note that we need not consider the nonhomogeneous boundary condition given in 


(9.434) at this moment. The governing equation and boundary conditions for 
function Y are 


Y"+AY=0, Y(O)=0, Y(b)=0 (9.442) 
As expected, the solution for Y is 
Y=csinJAy+c,cosVAy (9.443) 
The first boundary condition given in (9.441) leads to 
Y(0)=c, =0 (9.444) 
The second boundary condition given in (9.441) leads to 
¥(b) =¢, sinVAb =0 (9.445) 


Since we cannot set c; to zero, we must have the sine function be zero and this 
leads to 


Jab = na (9.446) 
This is the eigenvalue of the homogeneous boundary value problem. Thus, the 
eigenvalues and eigenfunctions are 


A,=n’/b?, Y,(y)=sin(nry/b), n=1,2,3,.. (9.447) 
For the function X, we have 
X"-AX=0, X(0)=0 (9.448) 
The solution of (9.447) is expressible in terms of hyperbolic functions: 
X(x) =k, cosh(nz x/b)+k, sinh (nz x/b) (9.449) 
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Note that we have already substituted the eigenvalues of 7 found in (9.446) into 
(9.448). The boundary condition given in (9.447) requires k; to be zero. Thus, we 
have 


X (x) =k, sinh (nz x/b) (9.450) 
We now combine solutions in (9.446) and (9.449) to get the fundamental solution 
u, (x,y) =sinh(nzx/b)sin(nty/b), n=1,2,3,.-., (9.451) 


The general solution of the problem is the sum of all these eigenfunctions with 
unknown constants: 


u(x, y) = ye wie = ye sinh (nz x/b)sin(nz y/b) (9.452) 


n=l n=l 
We are now ready to consider the nonhomogeneous boundary condition given in 
(9.434). More specifically, setting x = a in (9.451) gives 


u(a,y)= f(y) = >. sinh ( (nza/b)sin(nz y/b) (9.453) 


n=l 


Multiplying both sides by sin(mzy/b) and integrating from 0 to b, we get 


[, f()sin(mzy/b) dy =) c, sinh (na /)||'sin (ma y/b)sin nx y/ By 


n=l 
(9.454) 
In view of the orthogonality for the sine given in (9.294), we have 
b b 
I, f(y) sin(nz y/b)dy = sinh “ce, 2 sin? (nay /b) dy =sinh een . 
(9.455) 


Finally, we find the unknown constant c, in terms of the given function f on the 
boundary as: 


c, = sin( 2) i. f(y)sin(nz y/b)dy (9.456) 


mS 
oo fesin( "8 Jag 
u(x, y) = ash 2 sinh 2 Jsin{ "2 


b Al sinh{ 24) b b 


We know that the hyperbolic sine goes to infinity as 1 approaches infinity. For 
large n, we have the following ratio: 


sinh (nx x/ b) ett 
sinh(nza / b) gee 
We can see that the sum converges very fast as long as (a—x) is not too small. 


(9.457) 


= gon (a-x)/b (9.458) 
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9.9.2 Prescribed Function along x-axis 


In this section, we consider the boundary value problem of a rectangular domain 
governed by the Laplace equation with the non-zero boundary condition being on 
side y = b (see Figure 9.26): 
2 2 
Vga Se a9 0<x<a, 0<y<b 
Ox” oy 
u(x,0)=0, u(x,b) = g(x), O<x<a (9.459) 
u(0,v)=0, u(a,y)=0, O<y<b 
Recall from our discussion in the last section that we need to expand the given 
function g(x) in the Fourier series expansion of sine or cosine, we have to expand XY 
as sine or cosine functions. 
Following the analysis in the last section, we assume separation of variables 


as: 
u(x, y) = X(x)Y(y) (9.460) 
Substitution of (9.459) into the Laplace equation given in (9.458) gives 
X"VY+XY"=0 (9.461) 
Dividing through (9.460) by XY, we get 
ae gs —A (9.462) 
X Y 


u(a, y) =0 


0 u(x, 0) =0 a 


Figure 9.26 Laplace equation for a rectangular domain with a prescribed function on the 
horizontal side 


Note that we have to use —A instead of +A as the constant of separation of variables 
for the present problem. Consequently, the PDE results in two ODEs: 
X"+2X =0 (9.463) 
Y"-AY =0 (9.464) 
The homogeneous boundary conditions given in (9.458) leads to the corresponding 
boundary conditions for functions X and Y as 
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u(0, vy) = X(0)Y(y) =0, O< y<b => X(0)=0, 
u(a,v) =X(a)Y(y)=0, O<x<a > X(a)=0, (9.465) 
u(x,0) = X(x)¥(0)=0, O<x<a => Y(0)=0. 

The governing equation and boundary conditions for function Y are 


X"+A"X =0, X(0)=0, X(a)=0 (9.466) 
As expected, the solution for X is 
X =c,sinVAx+c, cosVAx (9.467) 


The choice of taking the negative side (9.461) is important because it results in sine 
and cosine functions as the solutions of X. Only with these solutions, can we satisfy 
the nonhomogeneous boundary condition imposed on y = b. The first boundary 
condition given in (9.465) leads to 


X(0)=c, =0 (9.468) 
The second boundary condition given in (9.465) leads to 
X(a) =c,sinVAa =0 (9.469) 


Since we cannot set c; to zero, we must have the sine function be zero and this 
leads to 


VAa=nr (9.470) 
This is the eigenvalue of the homogeneous boundary value problem. Thus, the 
eigenvalues and eigenfunctions are 


Agana ta J,(o=sin(nala), 2=1,2;3,00 (9.471) 
For the function Y, we have 


Y"-A"Y=0, Y(0)=0 (9.472) 
The solution of (9.471) is expressible in terms of hyperbolic functions: 
YO) =4, cosh (nz y/a)+k, sinh (nz y/a) (9.473) 


Note that we have already substituted the eigenvalues of A found in (9.470) into 
(9.472). The boundary condition given in (9.471) requires k; to be zero. Thus, we 
have 


Y(y) =, sinh(nz y/a) (9.474) 
We now combine solutions in (9.470) and (9.473) to get the fundamental solution 
u,, (x,y) =sinh(nz y/a)sin(nzx/a), n=1,2,3,.00, (9.475) 


The general solution of the problem is the sum of all these eigenfunctions with 
unknown constants: 
u(x, Vv) = yaw (x,y) = Ye sinh (nz y / a)sin(nzx/a) (9.476) 
n=l n=l 
We are now ready to satisfy the nonhomogeneous boundary condition given in 
(9.458). More specifically, setting x = 0 in (9.475) gives 
u(x,b) = g(x) = }°c, sinh (nzb/a)sin(nzx/a) (9.477) 
n=l 


Multiplying both sides by sin(mzzv/a) and integrating x from 0 to a, we get 
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[ g(x) sin(mz x/a)dx = s sinh (nz b / a) sin(mz x/a)sin(nzx/a)dx 
n=l 


(9.478) 
In view of the orthogonality for the sine given in (9.294), we have 


[ g@)sin (nz-x/ a)dx = sinh ae cq |. sin? (nz-x/ a) dx = gh ee @ 
0 a 0 a 2 
(9.479) 


Finally, we find the unknown constant in the solution in terms of the given function 
fon the boundary as: 


-1 
Beg nh( 222) [, g@sin(nax/a)ax (9.480) 
a a 0 


The final solution is 


yen), e@sin[=2\ag 
u(x,y)==>> “ sinh” » in * *) (9.481) 
a 


a sinh "5 
a 


We note that the analyses in the last and the present sections are similar. 


9.9.3 Prescribed Functions on All Four Sides 


In this section, we consider the more general boundary value problem of a 
rectangular domain governed by the Laplace equation with non-zero boundary 
conditions on all four sides (see Figure 9.27): 


2 2 
Vue o* oS 0 0<x<a, 0<y<b 
ox” Oy 
u(x, 0) = 8 (x), u(x,b) =i 82(x), O0<x<a (9.482) 


uQ0,y)= f(y), Ula,y)=fr(y), O<y<b 
Since the Laplace equation is a linear PDE, this problem can be solved by the 
method of superposition. As illustrated in Figure 9.28, we can break down the 
problem into four sub-problems, and each one of them has only one non-zero 
boundary condition. Mathematically, the four sub-problems are defined as: 


Problem P): 
2 2 
Vy, = 2M, FH Xo, O<x<a, O<y<b 
ox oy 
u,(x,0)=0, uw (x,b) =0, O<x<a (9.483) 


u,(0,¥) = f(y), u,(a, y) =0, 0<y<b 
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u(x,b) = g,(x) 


u(0,v)= f(y) u(a, yy) = f(y) 


0 u(x,0) = g,(x) a 


Figure 9.27 Laplace equation for a rectangular domain with nonzero conditions on all four sides 


ra 
0 S00) a 0 0 a 0 0 
Original Po Problem P; Problem Pz 
0 S(0) 
b 
+--+ 
0 g(x) @ 0 0 a 
Problem P3 Problem P4 
Figure 9.28 Method of superposition in solving Problem Po given in Figure 9.27 
Problem P2: 

Ou,  o 
V7 uy = enn 2%, O<x<a, O<y<b 

Ox oy 
u>(x,0)=0, u,(x,b) =0, O0<x<a (9.484) 
u7(0,y)=0, un(a,y)=fx(y), O<y<b 

Problem P3: 

au, a 
Vu; = “34 “3 =0, O<x<a, O<y<b 

Ox oy 
ux(x,0) = g(x), u3(x,b) =0, O<x<a (9.485) 


u3(0, y) =0, u3(a, y) =0, O0<y<b 
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Problem P4: 
2 2 
aga ay O<x<a, O<y<b 
ox oy 
ug(x,0)=0, uy(x,b) = g5(X), O0<x<a (9.486) 


u4(0, v) =0, u4(a, y) =0, O0<y<b 


The solution of the original problem is the superposition of each of the solutions of 
the four sub-problems: 

U=Uy +Uy +U3 +Uy (9.487) 
Problem P; is solved in Problem 9.7 in Section 9.11, Problem P>2 is considered in 
Section 9.9.1, Problem P3 is given in Problem 9.9 in Section 9.11, and Problem P,4 
is solved in Section 9.9.2. Therefore, by the method of superposition, the solution 
is obtained in (9.486). 


9.10 SUMMARY AND FURTHER READING 


This chapter considers the most important types of second order PDEs. Nearly all 
textbooks on PDEs focus on the discussion of three second order PDEs. They are 
the wave, diffusion, and potential (Laplace) equations. The solution of these 
equations is the main focus of the present chapter. Inevitably, there is a slight 
overlap between the present chapter and Sections 7.2 to 7.4. The role of 
characteristics in wave type or hyperbolic PDEs has been discussed in detail in 
Chapter 7 and is not repeated in the present chapter. 

For wave equations, we discuss the classic solution of D’Alembert for the  1- 
D wave, the consequence of characteristics in terms of the domain of dependence 
and of influence zone. We then continue to discuss the 2-D wave and the 3-D wave, 
and the 3-D symmetric wave. The classical formula of Kirchhoff or Poisson for 3- 
D wave is discussed and its degeneration to 2-D by Hadamard’s method of descent 
is summarized. The consequence and implication of the Huygen principle is also 
discussed. The case of n-dimensional waves is discussed in detail. The jump 
condition at the wavefront is also presented. In view of its importance in solving 
the Maxwell equation and elastodynamics problems (see Chapter 2), the solution 
method for nonhomogeneous waves is discussed. Related to the wave phenomenon, 
we also discuss the Helmholtz equation and telegraph equation. 

For the diffusion equation, we focus on the solution of the homogeneous 
diffusion equation in the present chapter, as the nonhomogeneous diffusion 
equation has been presented in Chapter 7. The one-dimensional heat conduction 
equation and 1-D consolidation equation are derived and solved. The heat 
conduction problems in underground structures subject to ground seasonal 
temperature variations are considered. The one-dimensional consolidation problem 
is also found expressible as a diffusion equation. 

For the Laplace equation, the Dirichlet, Neumann, and Robin problems are 
defined and discussed. The Laplace equation is solved in both spherical and 
cylindrical coordinates, in terms of Legendre polynomials and Bessel functions, 
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respectively. The classic result of the Poisson integral is considered together with 
the extremal principle. Some consequences of the extremal principle are discussed 
together with other properties of the harmonic functions, including the integrability 
of the Neumann problem, the uniqueness of the Dirichlet problem, and the mean 
value theorem. Graphical presentation of these consequences is demonstrated in 
Section 9.8. Finally, the boundary value problems of potential theory for the 
rectangular domain are considered using superpositions. 

Nearly all textbooks on PDEs cover all wave, diffusion, and Laplace 
equations. The reader can refer any of these textbooks for further reading. 


9.11 PROBLEMS 


Problem 9.1. Derive the following Poisson integral formula for the problem of a 
circular disk of radius R governed by the Laplace equation. Mathematically, we 
have 


Ou Leu , 1 0-u 


V-u= ae gas 2 ag =0, O0<r<R (9.488) 
u(R, 0) = h(0) (9.489) 
u(r,0+ 27) =u(r, 0) (9.490) 
limu(r, O) = finite (9.491) 
Ans: 
R= 


| a 
u(r,0) == [PO dp (9.492) 


+r° —2rR cos(O- dD: 


Problem 9.2. Find the solution of the following problem of a Laplace equation of a 
unit disk: 


Ou lau, 1 Ou 


Vru= AP ee eae (9.493) 
u(1,0) = Acos@ (9.494) 

Ans: 
u(r,@) = Arcos@ (9.495) 


Problem 9.3. Employing the procedure used in deriving the telegraph equation, 
derive the following equation for magnetic intensity H: 


ME aid =V?H - yo — (9.496) 


where H is defined as 
B=wuH (9.497) 
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Problem 9.4. In the 2-D potential flow problem derived in Section 9.7 for the 
Laplace equation, derive the following relation between the potential function and 
stream function: 


OY ee (9.498) 
Os On 


Hint: Ask yourself what these values are physically! 
Problem 9.5. Rederive (9.351) by the following steps: 


(i) Prove the identity 

Op _ Op Ox | Op oz 

Os Ox Os Oz Os 

where a is the angle between the x-axis and the stream line y = constant. 
(ii) Prove the identity 

Oy Ow Ox Oy & 

én ax On Oz On 


=v, cos” atv, sin? a =v, (9.499) 


=~v, sina(-sina) + v, cos” a = v, (9.500) 


(iii) Use the results of parts (i) and (ii) to prove (9.351). 


Problem 9.6. Use separation of variables to find the solution of the following 
diffusion problem: 
2 
Pe: ae _ ou (9.501) 
Ox” = Ot 
u(0,t) =0,u,(L,t)+u(L,t)=0, u(x,0)=f(x),0<x<L (9.502) 


Ans: 
= 2 
u(x,t) = > ce" sin( JZ, x) (9.503) 
n=l 
L 
c, = = : f(x)sin( JA, x) de (9.504) 
where A, satisfies the following eigenvalue equation: 
tan,//,L =—JA, (9.505) 


Problem 9.7. Use separation of variables to find the solution of the following 
Laplace equation for the rectangular domain shown in Figure 9.29: 
2 2 
ou 4 26, O<x<a, O<y<b 
ox" Oy 
u(x,0)=0, u(x,b) =0, O0<x<a (9.506) 


u(0, vy) = f(y), u(a,y)=0, O<y<b 
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u(x,b) =0 


u(0,v) =f) u(a, y) =0 


0 u(x,0) =0 a 
Figure 9.29 Laplace equation for a rectangular domain with nonzero boundary on x = 0 


Ans: 


NIX 
u(x, y) = Yevsin@ *)[sinh( = 7) — tanh “) cosh 1.507) 


n=l 


ein r= ya 2 e (9.508) 


Problem 9.8. Use separation of variables to find the solution of the following 


Neumann problem of the Laplace equation for the rectangular domain shown in 
Figure 9.30: 


2 2 

Lee) O<x<a, O<y<b 

ox” Oy 

u,(x,0)=0, u,(x,b) =0, O0<x<a (9.509) 


u,(0,y)=0, u,(a,y)= f(y), OS <b 
Show that 


u,(x,b) =0 


u,(a,y)= f(y) 


0 u,(x,0) =0 a 


Figure 9.30 Neumann problem of the Laplace equation for a rectangular domain 
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(i) the solution is 


u(x, y) = aes cosh(~).co sae ) (9.510) 
n=l 
where co is an arbitrary constant and 
=m. [. 40 yeos( “22 ay n=1,2,.. (9.511) 
: dad 
sinh(—— r 


(ii) the necessary condition for the problem being solvable is 
b 
[, fov@=0 (9.512) 


Hint: Physically, this condition corresponds to net influx equals net outflux. 


Problem 9.9 Use separation of variables to find the solution of the following 
Laplace equation for rectangular domain shown in Figure 9.31: 


u(x,b) =0 


u(a, y) =0 


0 u(x, 0) = g(x) a 


Figure 9.31 Laplace equation for a rectangular domain with prescribed function on y = 0 


2 2 
ae 5, 0<x<a, 0<y<b 

ox” Oy 

u(x,0) = g(x), u(x,b)=0, O0<x<a (9.513) 


u(0,y)=0, u(a,y)=0, O<y<b 


Ans: 


u(x, y) = Sc, sini inh *)—tanh( = eosn( *y) (9.514) 


n=l 


(— [ eonn( 2) (9.515) 
0 a 
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Problem 9.10 Use separation of variables to find the solution of the following 
one-dimensional wave equation subject to initial velocity: 


nw O<x<L, t>0 
u(0,t)=0, u(L,t)=0, +20 (9.516) 
u(x,0) =0, u,(x,0) = g(x), O<x<L 


2a = 
au,, =U 


Ans: 


u(x,t) =} ¢, sin(nx/L)sin(nzat/L) (9.517) 


n=l 


c 


, an g(x)sin(nax/ L)dx (9.518) 
na 


Problem 9.11 Use the method of superposition to solve the following one- 
dimensional wave equation subject to both initial deflection and initial velocity: 


au, =Uy, O0<x<L, t>0 
u(0,t)=0, u(L,t)=0, +20 (9.519) 
u(x,0) = f(x), u,(x%,0) = g(x), O<x<L 


Ans: 


u(x,t) = ie sin(nzat/L)+d, cos(nzat/L)\sin(nzx/L) (9.520) 


n=l 


Cc 


n 


2" g(x)sin(nzx/L)dx (9.521) 
na 


d, = =I f(x)sin(nxx/L)dx (9.522) 


Problem 9.12 Use separation of variables to find the solution of the following 
Laplace equation for the rectangular domain shown in Figure 9.32: 


Ou Oru 

—z +, =0, V<x<a, 0<y<b 

ox" Oy 

u(x,0) =, u(x,b) = Ax, O0<x<a (9.523) 


u(0,y)=0, u(a,y)= By, Os ysb 
(i) Find the solution. 


(ii) Find the compatibility condition in terms of A and B such that the boundary 
condition is continuous 
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u(x,b) = Ax 


u(a, y) = By 


0 u(x,0) =0 a 


Figure 9.32 Laplace equation for a rectangular domain with prescribed functions on y = b and x 


=a 


Ans: 


ry el) aaa nay 
u(x,y) = eT 1a, in msi nh(—) 
a 


(9.524) 
. 00 2B(- iy 


sin) sinh“) 
n=l nt sinh) b b 


Aa = Bb (9.525) 


Taylor & Francis 
Taylor & Francis Group 
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CHAPTER TEN 
Eigenfunction Expansions 


10.1 INTRODUCTION 


Eigenfunction expansion is a classical and a very powerful mathematical technique 
in solving certain types of boundary value problems of finite domain. When we talk 
about eigenvalues, we normally refer to the eigenvalue of a square matrix. 
However, in the context of the analysis of differential equations with certain 
homogeneous boundary conditions, we find that solutions exist only for certain 
values of the parameter embedded in the ODE. These certain values can be 
interpreted as eigenvalues and are determined from the particular set of boundary 
conditions, and play a similar role as the “normal” eigenvalues for a matrix. The 
corresponding solution form for each eigenvalue is called an eigenfunction, which 
is very much similar to the role of an eigenvector in the case of matrix analysis. In 
matrix analysis the number of eigenvalues depends on the rank of the matrix (e.g., a 
jxj matrix has j eigenvalues). Unlike the matrix analysis, the number of eigenvalues 
in homogeneous boundary value problems is typically infinite. In the case of beam 
vibrations, each eigenvalue is a natural vibration frequency of the beam and each 
vibration mode is an eigenfunction (e.g., with different number of nodes for the 
case of beam vibrations). When the same mathematical system is subject to 
external excitations (external in the sense that the excitations are independent of the 
response of the system), the general solution can be expressed as a summation of 
these eigenfunctions with a different contribution from each of these 
eigenfunctions. In the case of the dynamics of beam vibrations, a beam starts to 
vibrate when it is subject to continuous time-dependent forces (forcing or 
nonhomogeneous term). It turns out that the time-dependent vibrations can always 
be expressed in terms of a summation of the fundamental vibration modes, 
independent of the nature of the forcing terms. The contribution from each mode is 
determined from the eigenfunction expansion of the arbitrary forcing term in terms 
of the vibration modes. 

This mathematical technique of eigenfunction expansion was normally 
associated with the name of Joseph Fourier (or Fourier series expansion). However, 
the first appearance of expanding a function in terms of an infinite sine series was 
given by Leonhard Euler in 1744 (twenty-four years before Fourier was born) when 
Euler wrote a letter to a friend. The formula that we are going to discuss in 
expressing an arbitrary function in terms of Fourier sine and cosine series was 
actually first proposed by Daniel Bernoulli and completed by Leonhard Euler as 
early as 1777. However, it is appropriate at this juncture to mention that there was 
actually a dispute on the correctness of such infinite expansion between Daniel 
Bernoulli and Leonhard Euler. This all starts with the investigation of the vibration 
of a violin string, which was posed by Johann Bernoulli in 1728, and solved by 
Jean D’Alembert and Leonhard Euler. But it was the solution posed by Daniel 
Bernoulli that led to the conclusion that any arbitrary function can be expressed as 
an infinite sum of sine functions. Both Euler and D’Alembert argued that it is 
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impossible to do this, and the conclusion would not be right. However, to his 
credit, Bernoulli stood by his conclusion. Ironically, it was Euler who completed 
the correct analysis for Bernoulli in determining the coefficient of the expansion in 
1777. In 1807 (twenty-five years after the death of Daniel Bernoulli), Fourier 
presented an astonishing paper at the Academy of Science in Paris asserting that an 
arbitrary function can be expressed in such an infinite series. Fourier specifically 
stated that any even function can be expressed as a sum of infinite terms of even 
functions of sine, and any odd function can also be expressed as a sum of infinite 
odd functions of cosine. Fourier analysis was based on his analysis of the heat 
equation, which he was most famous for (the heat diffusion law is also known as 
the Fourier law of heat conduction). Masters at the time, like Lagrange, simply 
thought that Fourier’s claim was impossible. Resistance to his theory is so great 
that his paper was never published. His work on this eventually appeared in his 
book in 1822. The admirers of Fourier’s work include Lord Kelvin, and his first 
paper at the age of fifteen was on Fourier series. The rest is history, and nowadays 
Fourier series expansion is accepted universally in the domains of mathematics, 
engineering and science. Therefore, do not feel embarrassed if you cannot 
understand this eigenfunction series expansion when you first learn it. Clearly, you 
are not alone and this topic is far from obvious. 

Another well-known problem in expanding discontinuous functions in infinite 
series is that there are always wiggles around the point of sharp discontinuity. And 
these wiggles never go away no matter how many terms that we add in the infinite 
series. The following story is adopted from Nahin (2006). In 1898, a letter to the 
journal Nature was submitted by Albert Michelson, a recipient of the Nobel Prize 
in physics in 1907, and he disputed that the Fourier series expansion is not valid at 
the point of discontinuity because of this overshooting and undershooting. The 
well-known British geophysicist A.E.H. Love replied a week later, arguing that 
there was a mistake in Michelson’s reasoning and the series that Michelson 
considered does not even converge. A few weeks later, Michelson replied in a short 
note stating that he was not convinced. In the same issue, J.W. Gibbs joined in by 
criticizing Michelson’s reasoning. There was no reply from Michelson. A few 
months later, Gibbs published the most famous Nature letter that is cited in many 
textbooks. Without giving any details, Gibbs simply stated that the actual 
magnitude at the jump is overshot by 8.9% at one end estimated by the following 
formula: 


I Sine au —= = 0.089 (10.1) 
0 u 2 
Remarkably, this overshooting percentage is a constant and does not improve by 
adding more terms. It was Maxime Bocher in his 1906 paper on the behavior of 
Fourier series at the discontinuity who coined this overshooting “Gibbs 
phenomenon.” This term has been used ever since. The most interesting part of this 
story is that the result obtained by Gibbs in 1899 (given in (10.1)) was actually 
been published by a British mathematician Henry Wilbraham in 1848 (fifty-one 
years before Gibbs). The only difference is that Gibbs’s paper contained no details 
while Wilbraham gave details of derivation. 

We have covered enough history about eigenfunction expansions. In the next 
section, we start with the existence of the eigenvalue and eigenfunction for certain 
boundary value problems. 
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10.2 BOUNDARY VALUE PROBLEMS 


In this section, we restrict our discussion to second order ODEs, but the same idea 
can be extended to the discussion of higher order PDEs. We have seen in an earlier 
chapter that a PDE can be converted to a number of ODEs through the use of 
separation of variables. Let us consider the following ODE subject to three 
different types of initial or boundary conditions: 


y"+ p)y’+aOyv=8), Vv(to) = Yo. (to) = (10.2) 
y"t+ p(x)y'+q(x)y = g(x), WaA)=V, WA)=) (10.3) 
y"+ p(x)y'+q(x)y =0, y(a)=0, y(B) =0 (10.4) 


Although these boundary value or initial value problems have the same ODE 
structure, they are largely different problems. The mathematical techniques 
involved in solving them are vastly different. Note that for this second order ODE 
we need two conditions to fix the two unknown constants in the general solution, 
thus there two conditions to be satisfied in each of the problems defined in (10.2) to 
(10.4). Equation (10.2) is an initial value problem and the two initial conditions 
are given at the same point ¢ = fo. A typical example of this mathematical system 
arises from dynamic problems of mechanical oscillators, and y(f) can be interpreted 
as the magnitude of the oscillations. The initial displacement and velocity were 
imposed at t = to. This type of problem can be solved readily by Laplace transform. 
Equation (10.3) is a boundary value problem with nonhomogeneous terms. The 
third type is the homogeneous boundary value problem, and is the main focus of 
the present chapter. 


10.3 TWO-POINT BOUNDARY VALUE PROBLEMS 


In general, the determination of the solution for (10.4) is not an easy business, 
depending on the exact forms of functions p(x) and g(x). For example, the general 
form (10.4) also embraces the Bessel equation as a special case, and thus finding 
the general solution may not be straightforward as shown in Chapter 4. In this 
section, we will consider the simplest case of (10.4) with p = 0 and qg = 47. The 
solution form for this special case is easily obtained in terms of sine and cosine 
functions. However, the ideas discussed in the subsequent sections can also be 
applied to the more complicated case of (10.4). 

We now illustrate the idea of the existence of eigenvalues and their associated 
eigenfunctions in the following examples. 


Example 10.1 Consider the solution of the following two-point boundary value 
problem: 
y"+2y=0, y(0)=0, y(z)=0 (10.5) 


Solution: The general solution of the ODE is 
y=q cos /2x +c, sin /2x (10.6) 
To find the unknown constants, we have 
y(0)=c, =0 (10.7) 
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y(t) =c, sin 22 =0 (10.8) 
Since sin(V2n) # 0, we must have both constants as zero. Thus, the only solution is 
the trivial solution y = 0. 


Example 10.2 Consider the solution of the following two-point boundary value 
problem: 


y’+y=0, y(0)=0, y(z) =0 (10.9) 


Solution: The general solution of the ODE is 


y=c,cosxt+c,sinx (10.10) 

To find the unknown constants, we have 
y(0) =, =0 (10.11) 
(7) = —-c; +€,0=0 (10.12) 


Both these end-point conditions lead to the same result that c; = 0. Thus, the 
solution is 

y=c,sinx (10.13) 
where c? is arbitrary. Therefore, there are infinitely many nontrivial solutions. 


We see that the ODEs in Examples 10.1 and 10.2 are the same except for the 
coefficient in front of the second term on the left hand side. However, the solutions 
are very different. 
We now consider a much more general form of ODE: 
y"+A’y=0, y(0)=0, y(z) =0 (10.14) 
where A > 0 is an arbitrary constant. We have seen that this equation has infinitely 
many solutions if A? = 1 and has no non-trivial solution if 4? = 2. Thus, 7 = 1 can 
be interpreted as an eigenvalue of the ODE. To see whether there are other 
eigenvalues, we consider the general solution of (10.14) 
y=c, cosAx+c, sinAx (10.15) 
The first boundary condition leads to c; = 0. The second boundary condition results 
in 
sinAz =0 (10.16) 
Thus, there are infinite eigenvalues: 2 = 1, 2, 3, 4,... and the corresponding 
eigenfunctions are 
sinzx, sin2zx, sin3zx, sin4zx,,... (10.17) 
We have considered the simplest type of boundary conditions or the so-called 
Dirichlet type. In general, the boundary conditions can be specified as Neumann, 
Dirichlet, or Robin type, and they will be discussed in the next section. 
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10.4 NEUMANN, DIRICHLET AND ROBIN PROBLEMS 


We have seen from the last section that homogeneous boundary conditions may 
lead to an eigenvalue problem, and the particular form of the eigenvalue is a 
function of the boundary conditions. For the present harmonic equation, the 
Dirichlet type boundary value problem is given as: 


y"ta’y=0, O<x<L (10.18) 
y(0) = y(L) =0 (10.19) 
The Neumann type boundary value problem is given as: 
y tay =0, O<x<L (10.20) 
y'(0) = y'(Z) =0 (10.21) 
The Robin type boundary value problem is defined as: 
yt y=0, O<x<L (10.22) 
-ly'+hy=0, x=0 (10.23) 
Ly'+hy =0, x=L (10.24) 


For hy = A2 = 1 and /; = b= 0 in the Robin type boundary value problem, we 
recover the Dirichlet type boundary value problem. For /; = f2 = 0 and /; = = 1 in 
the Robin type boundary value problem, the Neumann type boundary is recovered. 
For each boundary, we can impose either of a Dirichlet, Neumann, or Robin type of 
boundary condition, and, thus, we have 3 by 3 or 9 combinations of boundary 
conditions. The general solution is 

y = AcosAx+ Bsin Ax (10.25) 


We now consider the eigenvalue and the corresponding eigenfunction for each of 
the nine cases. The boundary conditions for each case are given as: 


Case 1: 
y(0) = (LZ) =0 (10.26) 

Case 2: 
y'(0) = y'(L) =0 (10.27) 

Case 3: 
y(0) = y'(Z) =0 (10.28) 

Case 4: 
y'(0) = y(L) =0 (10.29) 

Case 5: 
—hy'(0) +h y(0) = y'(L) =0 (10.30) 

Case 6: 
—ly'(0) + yO) = y(L) =0 (10.31) 

Case 7: 
y'(0) =-hy'(L)+ hy(L) =0 (10.32) 

Case 8: 
y(0) =—-hy'(L) + y(L) =0 (10.33) 


Case 9: 
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—l,y'(0)+h,y(0) = 0, —Lhy'(L) + y(L) = 0 


(10.34) 


The results are summarized in Table 10.1. The eigenvalues for Cases 5—9 need to 
be solved numerically. Figures 10.1 to 10.3 show the evaluation of the eigenvalues 
for Cases 5, 6, and 9 respectively (open circles in the graphs). 


Table 10.1 Eigenvalues and eigenfunctions for 9 different combinations of boundary conditions. 


Case | BC: x=0 BC: x=L Eigenvalue equation Eigenfunction 
1 1, =0,h, =1 | 1, =0,hA, =1 i a0d: _ sin A,,x 
L 
2 L=1h4=0 | ,=Lh,=0 sn AL= 0% _ at cos /,,x 
L 
lL =0,h,=1 | 4 =1,4, =0 - in A, 
3 1 2 2 cosAL = 0,4, = (ea Le sin A,,x 
2 2L 
L=1,h,=0 | 1 =0,A, =1 - An 
4 1 2 2 en et eee fel I m | cosa,x 
2 62L 
LA, #0 lL=Lh =0 = 
5 poy 2 b) ae _ cos 1, (L x) 
L, cos A, L 
Lh, #0 lL, =0,h, =1 i - 
6 1 2 2 enti _ A Bi dat x) 
Al, sind, L 
7 L=1h4=0 | Lh, #0 tan 1 = 2 cos /,,x 
l, 
8 1, =0,h, =1 | L,h, #0 ei =e sin 1,,x 
2 
9 Lh, #0 1,,h, #0 j pe eee Ba ee 
AGI +—sin/,x 
né1 


3a 
L 2L 


Figure 10.1 Evaluation of eigenvalues for Case 5 
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1 a 37 2n 5a 
2L L OE L 2L 


Ah, [ 1, +p / 1p) 
V —hhy hh 


AA, / 1 +hy/ 1) 
A? —Iyhy Lily 
Figure 10.3 Evaluation of eigenvalues for Case 9 
Cases 7 and 8 are essentially the same as those given in Figures 10.1 and 10.2 (only 
replacing /, by hz). 
10.5 FOURIER SERIES 


In this section, we consider the Fourier series expansion of an arbitrary function 
ix). We assert that the following Fourier expansion exists: 
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a ~< mmx __ max 
x)=—+ Am COS +6,, sin 10.35 
0-249 {ay ae =) (10.35) 
where —L < x < L. The main objective is to find the unknown coefficients as a 
function of f(x). First of all, we recall the so-called orthogonality of sine and cosine 
functions. In particular, we find that 


L 0, m#n, 
| cos ee cos ey = (10.36) 
-L L L L, m=n; 
L 
} os te gin MX ay = 0, all m,n; (10.37) 
-L L L 
L 0 x 
| i ae ee (10.38) 
-L £ £ L, m=n. 


That is, the sine function of order m in the argument is orthogonal (in the sense of 
integration) to the sine function with order n in the argument. The same is also 
observed for the cosine function. Sine and cosine are orthogonal regardless of the 
value of n and m. 

Note that the definition of orthogonality given in (10.36) to (10.38) is also 
known as the inner product between two functions in the theory of functional 
analysis, also called linear operator theory. Functional analysis was a branch of 
rather “abstract” mathematics developed in the late nineteenth century that studies 
the general properties of the product of certain operators acting on a class of 
functions. Such linear operators may embrace the studies of integral equations, 
functional in the sense of the calculus of variations or the differential operator and 
its inversion. We will see in a later section that the definition of orthogonality 
between eigenfunctions (or inner product) will change from one ODE to another 
ODE (e.g., see (10.91)). 

The validity of these formulas can be verified as: 


L L = 
J ag cog ae ~ | Sco” ENO cag Oe 
- 24-1 L L 


L L 
: . L 
_1L cal es race de (10.39) 
27 (m+n) (m—n) Ne 
=0 
In the first step, we have applied the following trigonometry identity: 
cos Acos B = leos(4 +B) +c0s(4~B)) (10.40) 
When n = m, we have the special case 
iz 


ul ule 
2max/L |, (10.41) 


ve L 
[, (cos maya = ; ie {1+.cos(2ma x/ L)}dx = ; {e+ 


=f 
In obtaining the result in (10.41), we used the following double angle identity for the 
cosine: 
cos2A =cos* A—sin” A=1—2sin? 4=2cos* A-1 (10.42) 
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Combining (10.39) and (10.41), we have the validity of (10.36). To prove (10.37), we 
have 


(m—n)ax 


[ in cog 4 -sI, on ee Se iid 
= i, 
1 LZ .cos[(m+n)zx/L] és cos[(m za), i 
20 (m+n) (m—n) 4 
1 LZ cos[(m+n)z]—cos[-(m+n)z] 
. { 
1 (m+n) 


, cosl(m n)az |—cos[—(m—n)z | 


(m—n) 


} 


=0 
(10.43) 
This demonstrates the validity of (10.37). Finally, orthogonality of the sine can be 
proved as: 


ae max a (m+n)ax 
[, sin 7 -5, {co ]-cos[, 7 ]}dx 
: : L 
JIL sote-nyer enue, (10.44) 
20 (m—n) (m—-n) LZ 

=0 

In obtaining (10.44), we used the trigonometry identity 
cos(A — B)-—cos(A+ B) =2sin Asin B (10.45) 


Finally, for the case of m = n and in view of (10.42), we have the special case 


sin(2max/ L), F 
2max/L |, (10.46) 


L DL 
[, (sin my a - : ‘- {1—cos(2mz x/ L)}dx = : {x 


we 
Therefore, (10.44) and (10.46) gives the identity (10.38). With these orthogonal 
properties for the sine and cosine, we can multiply both sides of (10.35) by 
cos(nzx/L) and sin(nzx/L) separately, and integrate with respect to x from —L to L 
to find a, and b, respectively. In particular, the coefficients a, (n = 1, 2, ...) can be 
found as follows: 


n ts ay [0 an, nm xX 
x) cos dx = yet cos * cos dx 
[7 ; oe on 7 
(10.47) 
NIX 
+ 5 bi, [, sin “~~ cos dx 
m=1 L 


In view of (10.36) to (10.38), Pe the second term on the left of (10.47) is 
nonzero. Therefore, we have 


[, F(x)eos “dr = a, cos? “= 


2nNTX 


dx = La, (10.48) 


Rearranging (10.48), we obtain coefficient for ne cosine term 
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n 


a, =f, f(a)eos 


To find the constant coefficient ao, we can integrate both sides of (10.35) to yield 


* “dx, n=1,2,0 (10.49) 


L ay fh = Lomax ph. max 
[, S (x)dx = ok dx + Dn fs cos A dx + Dan if sin z dx = Lay 
(10.50) 
Thus, combining (10.49) and (10.50) we find 
L 
a, al P(x)eos an, n= 0,1,2y0 (10.51) 
= 


Similarly, we multiply both sides of (10.35) by sin(na/L), integrate with respect to 
x from —L to L, and apply the orthogonal property to get 


b, = = [, f(xosin 


The detailed steps of getting (10.52) are left for readers to fill in. The next example 
illustrates the procedure of Fourier series expansion. 


n 


“de, N=1,2ye0 (10.52) 


Example 10.3 Find the Fourier series expansion for the following non-smooth 
periodic function: 
—-x, —2<x<0 


x, O<x<2’ 


f0={ F(x+4) = f(x) (10.53) 


This function is shown in Figure 10.4. 


y 


Figure 10.4 Periodic triangular function 


Solution: The constant coefficient can be obtained by integrating from —2 to 2 as 
0 2 
| ( x)ax+2f xdx=1+1=2 (10.54) 
2J-2 240 
To find the unknown constants a, we can substitute (10.53) into (10.51) to get 


0 5 : : 
a= a ( x) cos MIX | e008 ee = 8/(m)°, modd 
-2 2 2/40 2 0, iietets 


a) = 


10.55 
m 2 ( ) 


Summarizing this result, we have 


7 2 
22, 8/(mz)°, m odd (10.56) 
0, m even. 
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Since f(x) is a symmetric function, we do not need to calculate b,. Therefore, the 
periodic triangular function can be written as 


aX mm x . MX 
x)=—+ a, COS +b sin 
f(s) = 24 {ay cos 2 +b, sin) 


m=1 


8 ( mx 1 3ax 1 5mx 
1 cos +—cos +—cos ture 
2 3? 2 5 2 
© (10.57) 
8 > cos(mz x / 2) 


I = > cos(2n — DES /2 
J (2n-1) 

Figure 10.5 plots the 1-term, 3-term, and 5-term Fourier series expansions. The 

sharpness at the corner increases with more terms. The solutions for 3-term and 5- 

term expansions are indistinguishable in Figure 10.5. 


r( 


m=3,5 


Figure 10.5 Fourier representation of triangular function 


10.6 STURM-LIOUVILLE PROBLEM 


In Sections 10.3 and 10.4, we have focused our discussion on the harmonic 
equation with the solution being the sine and cosine. The main observations 
discussed in previous sections also apply to a wider class of general differential 
equations. In 1836 and 1837, Sturm and Liouville in a series of papers considered 
the eigenvalue problems of a more general class of differential equation that 
includes many important differential equations in physics and mechanics as special 
cases. They showed that for such a system the eigenvalues are all real, the 
eigenfunctions of two distinct eigenvalues are orthogonal in the sense of integration 
or the inner product if a proper weighting function is included in the definition, and 
there are infinitely many eigenvalues. The Sturm-Liouville boundary value problem 
is self-adjoint and this property leads to real eigenvalues. It also bears a close 
resemblance between the so-called Hermitian matrix (or symmetric matrix for the 
case of a real matrix) and the Sturm-Liouville problem. In particular, both of them 
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have real eigenvalues. The eigenvectors of the Hermitian matrix and eigenfunctions 
of the Sturm-Liouville boundary value problem are guaranteed orthogonal. The 
main difference is that the Hermitian matrix has a finite number of eigenvalues but 
the Sturm-Liouville boundary value problem has an infinite number of eigenvalues. 
Although most physical problems lead to self-adjoint PDEs or ODEs like the 
Sturm-Liouville problem, there are also non-conservative systems that result in 
non-self-adjoint PDEs or ODEs. That will be covered briefly in Section 10.10. 
In particular, the Sturm-Liouville problem is formulated as: 


Bp” |-atipearGiy =—l[y]+Ar(x)v=0, O<x<1 (10.58) 
dx dx 


a y(0) + ayy'(0) = 0 (10.59) 


By) + By") =0 (10.60) 
where p(x) > 0 and r(x) > 0 and p, dp/dx, q and r are continuous. All p(x), g(x), 
r(x), @&1, Q, fi, and f» are real. Physically, r(x) can be thought of as coming from 
separation of variables in the original associated PDE problems. The eigenvalue of 
the problem is denoted by 2. The linear differential operator L is defined in 
(10.58). The boundary conditions are separated or unmixed. That is, one boundary 
condition on one end point only. When we set p(x) = r(x) = | and g(x) = 0, the 
eigenvalue problem considered in Section 10.4 is recovered as a special case. 


There are four main properties of this boundary value problem: 
(i) All of the eigenvalues /, and eigenfunctions @¢, of this problem are real. 


(ii) If ¢i(x) and g(x) are two eigenfunctions corresponding to the eigenvalues 11 
and Az, respectively, and if A; # Az, then 


[ s@de Corer =0 (10.61) 


That is, ¢i(x) and ¢&:(x) are orthogonal with respect to the weighting function 
r(x). The integral defined in (10.61) actually provides a new definition for the 
inner product with r(x) being the weighting function in the integrand. 


(iii) All eigenvalues of the Sturm-Liouville problem are all simple and each 
eigenvalue has its own linearly independent eigenfunction. There are infinite 
eigenvalues: 

A, <4, <Ag<r <4, <n, limd, =0 (10.62) 
no 

(iv) The eigenfunctions are complete (i.e., they can be used for eigenfunction 

expansion of arbitrary functions). 


The first two properties will be proved next. However, we will first consider the 
Lagrange identity in the next section. 
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10.6.1 Lagrange Identity 


Because the Lagrange identity is central to the Sturm-Liouville problem, it is 
considered next. We start by considering an integral of operator L defined in 
(10.58) on a function wu multiplying by another function v over the domain: 


[Hava 7 [ Powys quer (10.63) 
Applying integration by parts to the first term on the right of (10.63), we get 
-{ (pu')'vdx = {v(pu')]o +f (pu')v'dx (10.64) 
Next, reapplying integration by parts to the last term of (10.64), we find 
[ pv'du =[v'pu}}, - fic p’Juds (10.65) 


Substituting these results into (10.63), we obtain 
1 1 
I, L[u}vdx =- pP(xu'(x)v(o)], + p(x)u (x\v'@)f, + I, [-(pv’)'u + quv]dx 
: (10.66) 
Ls ' 1 
=—p(x) [u (x)v(x) —u(x)v (x)] \ + J, uL[v |dx 
The last term on the right can be moved to the other side to get 
: f t 1 
J, {Z[u]v —ul[v] } dx =—p(x) [u (x)v(x) —u(x)v (x)] F (10.67) 


This identity just obtained in (10.67) is known as the Lagrange identity. This 
identity closely resembles Green’s theorem: 


{ (uV?v—wu)dV = i 2 -y has (10.68) 


which has been discussed in Chapter 8. Now suppose re both wu and v satisfy the 
boundary condition given in (10.59) and (10.60). That is, we have 
a,v(0)+a,v'(0)=0, A,v(1)+ f,v'(1) =0 (10.69) 
au(0)+a,u'(0)=0, Awu(1) + Bu'() =0 (10.70) 
Rearranging (10.69) and (10.70), we can rewrite the first derivative term on the two 
boundaries as 


vi(l)= oe v(1), v= ud) (10.71) 
v'(0)=-2 Wo), u'(0) = -“-u(0) (10.72) 
a 23) 


Substitution of (10.71) and (10.72) into the right hand side of (10.67) gives 
= p(x)[u')v) -u(ayv'@)] 
= —p(1)[u'(v@) —uv'()]+ p(0)[w'(0)v(0) — u(0)v'(0) | (10.73) 


=P _ oa u(1)v(1) ]— pO) OO) u(0)v(0)]=0 
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It is interesting to note that right hand side of (10.67) is identically zero regardless 
of the values of a1, o2, Bi, and B2. Therefore, for the Sturm-Liouville problem 
defined in (10.58) to (10.60) the following form of the Lagrange identity applies: 


1 
| _{Zluv—ully] } dx = 0 (10.74) 
The Lagrange identity can be rewritten in another slightly different form: 
(L{u],v)—(u, L[v]) =0 (10.75) 
where the bracket is the inner product and defined as 
1 
(u,v) = | uv (10.76) 
0 


This definition of the inner space is the fundamental tool in functional analysis. The 
complex conjugate is denoted by a superimposed bar. However, for the present 
Sturm-Liouville problem both wu and v are real. 


10.6.2 Real Eigenvalues 


We are going to prove the eigenvalues and eigenfunctions are real by contradiction 
(i.e., property (i) in Section 10.6). Suppose that the eigenvalue 2 and its 
eigenfunction ¢(x) are complex as: 

A=utiv, g(x) =U(x)+iV (x) (10.77) 
where u, v, U, and V are real. Substituting the eigenfunction «x) into the 
Lagrange’s identity in (10.74), we have 


(Z[¢1,¢)—-(¢, LI¢]) =0 (10.78) 
Using (10.58), we can write (10.78) as 
(Ard,¢)-(¢, Arg) =0 (10.79) 
a (x) (x)r(x)dx = 7 [ ooo# (x)F (x) dx (10.80) 
Since r(x) is real, we have 
(A -2 )f on# (x)r(x)dx =0 (10.81) 
_,\el . : _ 
(4-4)| [u (x) +V7(x) | r@)dx =0 (10.82) 


The integral in (10.82) is real positive, so we must have 
(2-2) =2Im(A) =0 (10.83) 


Therefore, 4 must be real. Consequently, the corresponding eigenfunction is also 
real, but the details of the proof will not be discussed here. 
10.6.3 Orthogonal Property of Eigenfunctions 


Assume that there are two different eigenvalues 2; and A2 with corresponding 
eigenfunctions (x) and g(x). Obviously, they satisfy 
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L{g]= Ard, (10.84) 
L{g,]= Ard, (10.85) 

Let uandvbe ¢i(x) and ¢2(x), and (10.74) becomes 
(L141, 4) -(4,, L[¢]) =0 (10.86) 


Using the definition in (10.75) and substituting (10.84) and (10.85) into (10.86), 
we get 


A, | A (x) hy (x\r(a)de = A, [.o (x)6, (Fade (10.87) 

Since both A1, A2, r(x), #i(x) and go(x) are real, we can simplify (10.87) as 
(A, -A, fa (x) (x)r(x)dx = 0 (10.88) 

As (1 # A, then we must have the following orthogonal identity satisfied. 
[ A@oe ora =0 (10.89) 


We are going to see that a lot of important differential equations can be recovered 
as special cases of the Sturm-Liouville problem. Thus, once we can identify r(x) 
from the differential equation, we can find the orthogonal identity in the form of the 
integral given in (10.88). Thus, different ODEs require different definition of an 
inner product and a different orthogonal property. Problem 10.5 shows that the 
Bessel equation is a special case of the Sturm-Liouville problem with r(x) = x and, 
thus, Problem 10.6 demonstrates that the orthogonal property of the Bessel function 
is given in (10.281) of Problem 10.6. 

It is often convenient to multiply the eigenfunction by a constant such that the 
following normalization condition is satisfied: 


1 
| $2 (x)r(x)dx =1, n=1,2,..- (10.90) 
0 
Thus, in view of (10.88) and (10.89), the orthogonality relation can be rewritten as 
1 
J nd, Ora) = 5, (10.91) 


This normalization process is also known as Gram-Schmidt normalization. 


Example 10.4 Find the normalized orthogonal eigenfunctions of the following 
problem by Gram-Schmidt normalization 
y"+Ay=0, y(0)=0, (1) =0 (10.92) 


Solution: We have considered this problem before. The weight function is r(x) = 1. 
The eigenvalues and eigenfunctions are 


A, =n'n, y,(x)=sinnax 10.93) 
Wh Vn 
To find the orthonormal eigenfunctions, we choose k,, so that 


1 
) (k, sinnax)de =1, n=1,2,... (10.94) 
0 


This can be integrated to get 
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1 1 
i= val sin? no xdx = k2 2] (1-cos2nrx)dx=K2/2, (10.95) 
0 0 


Therefore, the orthonormal eigenfunctions are 
¢, (x) =V2sinnzx, n=1,2,.-, (10.96) 


10.6.4 Heat Conduction Problem 


Although so far in this chapter we have been focused on the two-point boundary 
value problems for ODEs, actually the discussion here is motivated by its 
applications to problems governed by PDEs. In this section, we will demonstrate 
this by considering the heat conduction problem. Consider the one-dimensional 
heat conduction problem of a rod of finite length Z subject an initial temperature 
distribution and to an isothermal boundary at the two end points, as shown in 
Figure 10.6. 


u(x,0) = f(x) 


Figure 10.6 A one-dimensional conducting rod subject to initial nonhomogeneous temperature 


The heat conduction problem along a rod is governed by a one-dimensional 
diffusion equation: 

up, =U, O<x<L, t>0 (10.97) 
where @? is the thermal diffusivity and it depends on the material of the rod. The 
isothermal boundary condition is given as: 


u(0,t)=0, u(Z,f)=0, ¢>0 (10.98) 
The initial condition of the temperature on the rod is prescribed as 
u(x,0)= f(x), O<x<L (10.99) 


Note that u should be interpreted as the temperature subtracting the constant 
temperature at the boundaries, such that the boundary condition becomes 
homogeneous. The following separation of variables is assumed: 


u(x,t) = X(x)T (0) (10.100) 
Substitution of (10.100) into (10.97) gives 
o? XT = XT, (10.101) 


This equation can be rearranged as 
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XxX T. 
Bat OO a (10.102) 
X aT 


Note that the minus sign chosen in the separation of variable constant is added to 
ensure the time dependence of the temperature to decay with time. This leads to 
two ODEs as 
X,,+AX =0, T,+Aa°T =0 (10.103) 
The boundary condition becomes 
u(0,t) = X(0)7T(t) =0, u(Z,t) = X(L)T(H =0, 1 >0 (10.104) 
The initial condition needs to be applied after the fundamental solution for both x 
and t dependences are obtained. We can see that the x-dependent problem becomes 
X,, taX =0, X(0)=X(L)=0 (10.105) 
This is precisely the problem that we consider in Section 10.4 and is a special case 
of the Sturm-Liouville problem. 

We have only demonstrated that the special case of the harmonic oscillator 
equation of the Sturm-Liouville problem can result from the heat equation. 
Actually, other forms of Sturm-Liouville problems can also result from wave 
equation, Laplace equation, and other PDEs after the application of separation of 
variables. 


10.6.5 Integrating Factors 


In this section, we demonstrate that a general second order ODE can be converted 
to the Sturm-Liouville type problem by multiplying an integrating factor. 

P(x)y"+ O(x)y'+ R(x)y =0 (10.106) 
Suppose that there exists an integrating factor 4x) such that after multiplying it by 
(10.106) we have the following structural form: 


4 ip? \4 nky 0 (10.107) 
dx dx 
This can be expanded as 
uPy" +(u'P + uP')y'+ Ruy =0 (10.108) 
Comparing (10.106) with (10.108), we obtain 
u'P = uW(O-P’') (10.109) 
This can be rearranged to give 
ee ae (10.110) 
uu P P 


Integrating and rearranging, we get 
ese Sm (10.111) 
P(x) x0 P(s) 
where xo is the initial boundary (xo = 0 for the Sturm-Liouville problem). Now, we 
can use this result to convert the following ODE to a Sturm-Liouville problem: 
P(x)y" + O(x)y'+[S(x) + AR(x)]y = 0 (10.112) 
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d dy 

—[uP—]-uSy+AuRy =0 10.113 

tt rs, MSy + AuRy ( ) 
Thus, we have 

P=HP, q=uHS, r=uR (10.114) 
where 

A) 1, 
_ 10.115 
M(x) = a ool a” (10.115) 


A slightly different approach in obtaining this integrating factor is given in Problem 
10.2. This illustrates a way to convert a second order operator to look like a Sturm- 
Liouville type problem. More precisely, we have made the operator formally self- 
adjoint but this change of variables does not help in changing the boundary 
conditions. In other words, the formally self-adjoint operators may not lead to self- 
adjoint problems. 


10.6.6 Eigenfunction Expansion 


Let @i, ¢@, ..., Gn... be the normalized eigenfunctions for the Sturm-Liouville 
problem, and let fand f' be piecewise continuous on 0 < x <1. The eigenfunction 
expansion becomes 


L(x) = Y eG, (2) (10.116) 


n=l 


o> | f(x) g, Xr (xdx (10.117) 


To show this, we can multiply (10.116) by 7(x)@n(x) and integrate over the domain; 
we have 


[, Fed¢_ oredr = a [, £64,094,0)re)ar = 268 (10.118) 
Hence, we have 


= [ir (x) Bp (x)r(ax)adx (10.119) 


This completes the proof of the eigenfunction expansion for the Sturm-Liouville 
problem. 


10.7 EXAMPLES OF STURM-LIOUVILLE PROBLEMS 


In this section, we list a number of important second order ODEs in physics and 
mechanics. All of these equations are special forms of the Sturm-Liouville 
problem. Thus, all the mathematical structures of eigenvalues and eigenfunctions 
listed in Section 10.6 apply equally to all of the following ODEs. The generality of 
this problem is the main reason why the Sturm-Liouville problem has received 
considerable attention in the past. 
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Legendre equation: 


2 
(1 ae 2x nin Du=0 (10.120) 


2 


Associated Legendre equation: 


ne ala are or u=0 (10.121 
Xx X _ 
Bessel equation: 
dud 
x? 2 xT +(x? —n?)u =0 (10.122) 
be 
Harmonic oscillator: 
2 
artolu=0 (10.123) 
Schrodinger equation for the 1-D harmonic oscillator: 
2 
-< Vey = By (10.124) 
X 
Laguerre equation: 
2 
“ +(1 Aran =0 (10.125) 
be xX 
Associated Laguerre equation: 
2 
d 
H(k-+1 ye H(a—ku=0 (10.126) 
he 
Hypergeometric equation: 
d°u du 
x(1—~x) a2 +[c-—(at+b+1)x] 7 abu =0 (10.127) 
Chebyshev equation: 
2 
al ae a nu =0 (10.128) 
he 
Hermite equation: 
2 
s 2x +2au =0 (10.129) 


The proofs for some of them are set as problems at the end of the chapter. 


10.8 NONHOMOGENEOUS BVP 
10.8.1 Nonhomogeneous Differential Equation 


Let us consider the following nonhomogeneous problem: 


L[b]=AP)9'T + gb = Mr()Oa) + f (2) (10.130) 
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a (0) + an y'(0) =0 (10.131) 


ByQ)+ By") =0 (10.132) 
Note that we have considered this problem before in Section 10.6 if f(x) = 0. We 
first consider the eigenvalue problem of the corresponding homogeneous problem: 

L[ y]=Ar(x)y(x) (10.133) 
Note that A is, in general, different from wu given in the corresponding 
nonhomogeneous equation. Let the eigenvalues and eigenfunctions for the 
following be found and are 


n> P(X), N=1,2,....n, No (10.134) 
Let us expand the one ¢ of the othe mosereons problem: 
“ 1 
P(x) = Yb,$: (x), 5, = I, A(x), (x)r(x)dx (10.135) 
n=l 


To find the coefficient b,, we need to satisfy the governing equation given in 
(10.130). Thus, we have 


L[¢]@) = Dont (24, (x) = eer (x(x) + f(x) (10.136) 
n=l 
Since the eigenfunctions automatically satisfy the boundary conditions given in 
(10.131) and (10.132), we do not need to consider them here. Clearly, we want to 
expand the nonhomogeneous function f(x) in terms of the eigenfunctions of the 
homogeneous problem as: 


Te = 
ae => 6,4,(2) (10.137) 


Note that we are expanding f(x) scaled with respect to r(x). Using the formula 
derived in Section 10.6.6, we obtain 


on =], Fe ascortoae=f, see,cae (10.138) 
Since we assume that the ie problem can be solved and f(x) is given in the 
problem, thus (10.138) can be evaluated to give c,. Substitution of (10.137) and 
(10.138) into (10.136) gives 


ys Ant (2) (%) = pry> byt ()+70D caf (x) (10.139) 
n=l n=l n=l 


Putting all terms on the same side, we get 


n=l 


r(x) [(A, — 2), — ey ] 8,2) = 0 (10.140) 
n=l 
Since both r(x) and ¢,(x) are not zero, we must have 
c 
ae (10.141) 
(A, —) 


Now, we have the solution as 
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(10.142) 


(=>) — 
n=l A, = 
This solution is valid only for An # u. 

If A, = p and c, # 0, there is no solution for the problem since there is no b, 
that can satisfy (10.140). However, if A, = » and c, = 0, then 5, is arbitrary and 
there are infinite solutions with an arbitrary constant (non-unique solution). Note 
that c, = 0 also implies from (10.138) that 


[, 099,C)4e=0 (10.143) 


Therefore, this will happen if and only if fis orthogonal to the eigenfunctions. 


Example 10.5 Find the solution of the following nonhomogeneous problem with 
homogeneous boundary conditions using eigenfunction expansion: 
-y"=3y+x, y(0)=0,y()+y() =0 (10.144) 


Solution: First, we consider the associated homogeneous problem: 
y"+Ay=0, y(0)=0, (1) + (1) =0 (10.145) 
The Sturm-Liouville problem has been considered in Section 10.4 (Case 8) and the 
orthonormal eigenfunctions have been given in Problem 10.11: 
eat. /A, = aus (10.146) 


A 


Wl 


$,(x) = V2sin Jy x 21 Dives (10.147) 


n 


(1+cos” fA ala We , 


Using these results in (10.139) and (10.142), we have 


y(x) = ys “bul a= xd, (xd (10.148) 


ma 


Integrating the second equation of a 148) by integration by parts, we find 


22 sin. fA, 
&= v2sin Jay cM Daves (10.149) 


A, 1+ cos” VA, 


Finally, the answer is written as 


sin JA, 
4 A, 10.150 
y(x) = ae 73)(I-+ eo Tm) sin J/,,x ( ) 


This completes the solution. 
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10.8.2 Nonhomogeneous Initial Value Problem 


In the last section, we considered the case of a nonhomogeneous differential equation. 
Now we return to the initial value problem of heat conduction given in (10.97) to 
(10.99). We consider ps ak case - the Sturm-Liouville problem: 


roe <tpc95 = q(xju, O<x<l, ¢t>0 (10.151) 
au(0,t) +a (040) =0 (10.152) 
ful, + pS ~( t)=0 (10.153) 
ice O<x<l (10.154) 


This is a generalized heat conduction with variable material property p(x) in the 
presence of heat source g(x). Heat equation is recovered if p(x) = a” and g(x) =0 and 
r(x) =1 (see (10.97) to (10.99)). Use the normally presumed separation of variables 

u(x,t) = X(x)T(t) (10.155) 
Substitution of (10.155) into (10.15 vi et 


r(x)AT, = “tex a q(x)XT (10.156) 
This equation can be aks as 
1 
or eg POF sat qx) X} == tan (10.157) 


where 4 is the constant of separation of variables. i. we have the following 
ODEs 


£ tx) sal —Q(x)X =—Ar(x)X (10.158) 
dx dx 
T, + AT =0 (10.159) 
The boundary condition becomes 
a,X(0)+a,X'(0) =0 (10.160) 
BX (+B) X"(I) =0 (10.161) 


We have already shown that there are infinite eigenvalues and orthogonal 
eigenfunctions. For the initial value problem given in the second equation of 
(10.159), the solution is 


T(t)=Ce™ (10.162) 
The basic eigenfunction becomes 


u(x,t) = Deng, (xe (10.163) 
n=l 
where the eigenvalues and ara are 
ns M(x), 2 =1,2,.. n> (10.164) 
The initial condition requires 
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u(x,0) =D ¢,6, (x) = £2) (10.165) 
n=l 
We now expand the initial temperature distribution by eigenfunction expansion 
1 
oy = J A, 0) So)ax (10.166) 
The final solution is 
4 aaet* 
ux = Diare | r(O9,(OL Oa (10.167) 


n=l 


10.9 SINGULAR STURM-LIOUVILLE PROBLEM 


So far, we have assumed that the problem is regular. We have also quoted in 
Section 10.7 that the Bessel equation is considered as a special case of the Sturm- 
Liouville problem. However, the Bessel equation is not regular at the origin (i.e., 
singular at the origin). We will demonstrate the singular Sturm-Liouville problem 
using the Bessel equation. Physically, this problem arises from the vibrations of an 
elastic membrane (e.g., Figure 10.7, which is generated by the software 
“Mathematica”. If we restrict vibrations to be axisymmetric, such a problem can 
be formulated as 


Ou lou. Ou 


2 
a = , O<r<l, t=0 10.168 
GP por ot? 
u(t) =0, £20 (10.169) 

u(r,0)= f(r), O<r<l (10.170) 
(1,0) =0, O<r<l (10.171) 


At r = 0, the boundary condition needs not be u(0,f) = 0 (this will lead to a zero 
solution). For this singular Sturm-Liouville problem, we need to modify the 
boundary condition at the singular point (in this case, it is = 0). The boundedness 
condition at the origin can be formulated as: 

u,u’ bounded, asr—>0 (10.172) 
That is, the membrane is supported at the circular edge and is subject to an initial 
deflection with zero initial velocity. Let us assume the following separation of 
variables: 

u(r,t) = R(r)T(t) (10.173) 

Application of (10.172) into (10.168) gives 

ie (hs ar 


ae 10.174 
> oF ( ) 


This results in two ODEs 
rR" +rR'+A°r?R=0 (10.175) 
T'+A’a°T =0 (10.176) 
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The general solutions for (10.176) are 
T =k, sin Aat + k, cos Aat (10.177) 


Figure 10.7 A typical vibration mode of circular membrane support at the circular edge 


Introducing a change of variables € = Ar, we can convert (10.175) to 
2 d’R 
dé? 
As mentioned in Section 10.7, this is a Sturm-Liouville problem. The solution of 
this Bessel function of zero order is 
Rr) = Gy Jo (Ar) + eX (Ar) (10.179) 
The solution must be bounded as r —> 0, and this leads to co = 0. The boundary 
condition (10.169) gives 


+e Fk, R= 0 (10.178) 
dx 


Jy)(4,) =9 (10.180) 
There are infinite roots for (10.180) and the general solution of the problem 
becomes 


u(r.) = > Jo Ant lk, Sin A,at + ¢, cos A, at] (10.181) 
n=l 
The initial condition given in (10.171) gives 
k, =0 (10.182) 


The initial condition prescribed in (10.170) requires 
u(r,0) = f(r) = >So (Ayr) (10.183) 
n=l 
To find the unknown constant c,, we multiply (10.183) by rJo(Anr) and integrate 
from 0 to | to get 


[ feso(ayrar 
c, = , n=1,2,... (10.184) 
I Jo (Aryl ar 


Note that we have employed the orthogonal relation given in (10.281). Substitution 
of (10.184) into (10.181) with (10.182) gives the final solution as 
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1 
ope [of 0A ar 
u(r,t) = 


Jo(A,r) cos A, at] (10.185) 
a I, r[Jo(A,r)P dr 


10.10 EIGENFUNCTION EXPANSION FOR NON-SELF-ADJOINT DE 


So far in this chapter, we have implicitly assumed that the PDE or ODE is self- 
adjoint. In fact, the Sturm-Liouville problem is self-adjoint. In this section, we 
consider the eigenfunction expansion of the following non-self-adjoint ODE: 


L[uj+Au=0, +B. (10.186) 
Let the eigenvalue and eigenfunction for this system be 
Ays Uy (10.187) 
Let the adjoint problem of (10.186) be 
L*[v]t+uv=0, +BC. (10.188) 


The corresponding eigenvalue and eigenfunction for the adjoint problem are 
(Stakgold, 1979) 
Mn =Ans Vy (10.189) 
Note that for differential operators, it can be shown that the eigenvalue of the 
original problem and the eigenvalue of the adjoint problem are related by the first 
equation of (10.189) (see p.357 of Stakgold, 1979) 
For a non-self-adjoint operator, we have 
LreL (10.190) 
Note, however, that L* = L is called formally self-adjoint, but it may not imply that 
the problem is self-adjoint. In particular, a formally self-adjoint problem may not 
be self-adjoint if the boundary conditions are not set properly. 
Let u be the eigenfunction of the eigenvalue problem given in (10.187) and 
we apply the Lagrange identity as 


((L+Au,v) =(u,[L*+A]v) (10.191) 
Thus, in view of (10.186) we have 
([L+Alu,v) = (u,[L*+A]v) =0 (10.192) 


For the solvability of any nonhomogeneous problem, for a particular value of tp we 
have either a solution for 
[L*+u]v = g(x) (10.193) 
or 
[L*+y]v =0 (10.194) 
has an eigenfunction and p is the eigenvalue. 
To prove (10.194), we now suppose that [1 is not an eigenvalue so that 
[L*+p]v #0 (10.195) 
for any v. Then, (10.193) always has a solution for any arbitrary g. This, in turn, 
requires the solvability condition for any function g being satisfied: 
(u,g)=0 (10.196) 
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This is clearly impossible unless u is identically zero. For example, g is any 
function and thus we can as well substitute g = u into (10.196) to give 


(u,u) = |u|? =0 (10.197) 
Using the terminology of vector space, we have the double vertical bar being the 
norm of the function u. Thus, u is identically zero. This means that our assumption 
is false and thus u= A is the eigenvalue and (10.194) must be satisfied. 
If the boundary conditions of the adjoint problem L* in (10.188) is chosen 
properly, the following Lagrange identity remains valid: 
(v,,,Lu,,)—-(L*v,,,u,,) =0 (10.198) 
Now if we subtract the scalar product of (10.188) with wu, from the scalar product 
of (10.186) with v,, we find 


(V,,.LUy,) —(L* Vv, .Um) = (Am —An Vn >Um) = 9 (10.199) 

Therefore, if 
Ay, 2A (10.200) 
(Vj.Um) =9 (10.201) 


That is, we have established the orthogonal relation between the eigenfunction of L 
and the eigenfunction of L*. This is referred to as the biorthogonal relation and this 
differs from the orthogonal relation between eigenfunctions of the same set of Z or 
L* (e.g., p.201 of Friedman, 1956). This is the main difference between self-adjoint 
ODEs and non-self-adjoint ODEs. 

Now, we assume that a function g(x) is Lebesgue square integrable: 


l 
I, g(x)dx <0 (10.202) 
Then, we can expand g in the eigenfunctions of problem (10.186): 


B(x) = Yay tty () (10.203) 


n=l 


Multiplying both sides of (10.203) by v(x) and integrating over the domain gives 


(pps 8) = Gy (Voy) = x Vrs an) (10.204) 
n=l 
The last term of (10.203) is obtained in view of the biorthogonal relation derived in 
(10.201). Thus, we have 


m= BULLI (10.205) 
(n> Um) 
Finally, we get the eigenfunction for g(x) 
5 8) 
x)=) Ou, (x 10.206 
g(x) aa 0) (10.206) 


For the special case of a self-adjoint problem, the eigenfunction expansion 
simplifies to 


g(x) = Se (x) (10.207) 


n=l (u, Uy 
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Example 10.6 Find the eigenfunction expansion for f(x) in the following problem: 
u"+Au= f(x), u(0)=0,u'(0) =u(1) (10.208) 


Solution: Note that the second boundary condition mixes the two end points. This 
mixed boundary condition will lead to a non-self-adjoint problem. The 
homogeneous problem is 


u"+Au=0, u(0)=0, u’(0) =u(1) (10.209) 
The general solution is 
u=csinVax+dcos Jax (10.210) 


The first boundary condition leads to d = 0. The second boundary condition leads 
to 

A, =sin,/A, (10.211) 
Apparently, there is only one root 29 = 0 with a corresponding eigenfunction of uo 
= x. However, we are going to see that this mixed boundary value problem is not 
self-adjoint, and thus there is no guarantee for the eigenvalue being real. Let us 
consider the adjoint problem: 


I 1 
(Lu,v) = } u"vdx =[u'v —uv'|o +f uv"dx 
0 0 


(10.212) 
=[u'd —uv']) +(u, LV) 
Again the superimposed bar denotes the complex conjugate. Therefore, we find 
that L* = L. Thus, the operator is formally self-adjoint. The boundary terms in 
(10.212) is 
[u'v — uv’ Jp =u'(l)v (1) —u()v'() —u'(0)v (0) + u(0)v"(0) 
=u'(Iv()—u()[vO)+v')] 
The last line of (10.213) is obtained by using the boundary conditions in (10.209). 
To set the result in (10.213) to zero, we require 
v(1)=0, v(0)+v‘(1) =0 (10.214) 
Thus, the adjoint problem is 
v"’+uv=0, v(I)=0, w(0)+v'(1)=0 (10.215) 
Since the boundary conditions are not the same as the original problem, it is not 
self-adjoint although the operator is formally self-adjoint. In view of the second 


boundary conditions given in (20.215), we see that the solution of the first equation 
of (10.215) can be written as 


(10.213) 


v=csin fu(1—x) (10.216) 
Substitution of (10.216) into the second boundary condition in (20.215) results in 
the following eigenvalue equation 


VH, =sin Jy, (10.217) 
which is identical to (10.211). Recall from (10.189) that the eigenvalues of the 
original operator of its adjoint operator are related by 

ed, (10.218) 


a 
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For [to = 0, we can show that the corresponding eigenfunction is vo = 1—x. For n > 
0, the eigenfunction of (10.215) becomes 

v, =esin JZ, (Ix) (10.219) 
Since the problem is not self-adjoint, we shall also look for complex eigenvalues. 
To do that, let us assume 


VA =a+iB (10.220) 
Substitution of (10.220) into (10.211) gives 
VA =sinJA = sin(@ +if) =sinacosh 6 +icosa@sinh 2 (10.221) 
Thus, we have 
a=sinacoshf, £=cosasinh Z (10.222) 


Let us look for asymptotic behavior of the complex eigenvalues for large A (since 
we want to see whether there are infinite roots). Suppose a — ©; from the first of 
(10.222) we also have 8 — o. The second equation of (10.222) gives 


- a —>0, cosa—>0 (10.223) 
sinh £ 
Therefore, we can set o as: 
a=(2m+ a + Ey (10.224) 


such that cosa — 0, and €2» is a small number. Substitution of this result into the 
first equation of (10.222) gives 


1 . 1 
a=(2m+ a + &,, = sin[(2m + a” + &,,|cosh B 


. 1 ; 1 
=[sin(2m + oo COS Ey, +SIN E>, COS(2m + al cosh £ (10.225) 


= cosh £ 
For large a and 8, we have the approximation 


cosh B = (2m + a (10.226) 
This can further be simplified as 
cosh f = ste +e?)x se = (2m+ a (10.227) 


The last part of (10.227) is from (10.226). This can be solved for B as: 
Br tln[(4m+1)z] (10.228) 


The plus and minus signs result from the fact that the cosh is an even function. To 
find an approximation for a, we note that (10.220) can be rewritten in the form of 


Euler’s formula as 
VA =atip=Ja7?+p° {cos 6 +isin 0} (10.229) 


where 
6 = tan"! 2) (10.230) 
a 


The following approximation is made: 
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Ja +B” ={[(2m+ a + Eqn) + {in[(4m + 1)z]}7}7 


; j (10.231) 
=(2m +5) + 55° +,..+0(€,,)} 
where 
He In{(4m+1)z] (10.232) 
1 


In arriving at (10.231), we have assumed that €2,, << 6 and it is obvious that 6 < 1. 
Similarly, we approximate cos0 as 


1 
a at ET Say 


ar +B m+ atl +58 +...+O(E>_)} (10.233) 


cos 0 = 


1 
as oe Ht Osa) 
We now back-substitute (10.231) and (10.233) into (10.229) to get the following 


approximation: 
a= yo +B cosO 


= 2m+S)a{l +55" bah fl 5e Bat (10.234) 


I 1 a4 
=(2m+—)a{l-—6' +... 
Cie a ary } 
Plugging in the definition of (10.232) into (10.234), we find 


4 
v=One5p-2 ave on (10.235) 
2 {(4m+l)x} 
Substitution of (10.235) into (10.229) gives 
4 
aCe oe—2 ene +...£iIn[(4m+1)z] (10.236) 
2 {(4m+l)x} 


Note that this result does not agree with the result given on p. 204 of Friedman 
(1956). The powers 3 and 4 in the second term on the right are missing in Friedman 
(1956). Strictly speaking, this result is good for large values of m. Nevertheless, 
this approximation is illustrated in Figure 10.8. 

For these eigenfunctions, we can check the bi-orthogonality as: 


(4%) =f sin( YZ, sin[ 7, 2) 
7 f. sin(/Z,x)sin[JZ,, (1 x)]dx 


Note the following trigonometry identity: 


(10.237) 


sin Asin B = sleos(4~ B) cost + B)] (10.238) 
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Using this (10.238), we get 


Im{Vv4} 
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eee 
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e 
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Figure 10.8 The complex roots of (10.236) for non-self-adjoint problem are plotted 


Uy Wn) = 5 J, (08L ay [ay (1=2)] cost 2x + Jy = 2) 


at sin J2,x+ Van*—VFnl_ sinlJA,x- Jane + JFnl|| 

(2 Van Vn Van + Vn 

i sin /Z, +sin Jf, sin fA, -sin fA, 

2( Vn tlm Van Van 
In view of the eigenvalue equation (10.217), we can simplify (10.239) as 
Pati fl 
(Uy >Yin) (10.240) 

aie Van Vn - Vn 


When n = m, we can redo the integration to get 


1- 
(u,,.V,) =————"— — (10.241) 


(10.239) 


0 


For the case n = m = 0, we have 
1 
| Ya (10.242) 
0 6 
Applying (10.206), we can write the expansion as 
f(x) =a) + Da, sin JA, x (10.243) 


n=l 


where 


a, = 6 {, f(xl—xdx (10.244) 
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a ee 
ae Ne J, /@)sin JA, -x)dx (10.245) 


As we have seen from this example, the most difficult part of the analysis is not 
getting the eigenfunction expansions for the function f(x) for non-self-adjoint 
problems but finding the complex eigenvalues and corresponding eigenfunctions in 
the first place. 


Another problem associated with non-self-adjoint eigenfunction expansion is that 
sometimes the eigenfunctions that we find may not be complete. As a consequence, 
we need an extra function (but not the actual eigenfunctions) to make it complete. 
Such eigenfunctions are known as “generalized eigenfunctions” (see Friedman, 
1956). If such generalized functions exist, we must add them. Otherwise, the 
answer is not correct. Therefore, we have to be more careful in dealing with non- 
self-adjoint problems. That is, eigenvalues may be complex and generalized 
eigenfunctions may be necessary in ensuring completeness. In the next example, we 
will illustrate the existence of “generalized eigenfunctions.” 


Example 10.7 Find the eigenfunction expansion for f(x) in the following problem: 
u"t+Au= f(x), u(0)=0, u'(0) =-w'(1) (10.246) 


Solution: The associated eigenvalue problem is 

u"+Au=0, u(0)=0, u'(0)=-w'() (10.247) 
The second boundary condition in (10.247) is mixed, including both boundary 
points in a single condition. The adjoint problem of (10.247) is found to be 


v"+Av=0, v'(0)=0, v(0) =-v(1) (10.248) 
Therefore, the problem is not self-adjoint. The solution for (10.247) is 
u = AcosVAx+BsinVAx (10.249) 


The first boundary condition leads to A = 0, and the second boundary condition in 
(10.247) leads to the following eigenvalue equation: 


Ja =—Vacos VA (10.250) 


The only possible solution is 


cosV¥A =-1 (10.251) 
Note that for this case A is not zero. Therefore, we have 
VA =(2n+)a (10.252) 
Thus, the eigenvalues and eigenfunctions become 
A, =(2ntl)’x7, n=0,1,2,... (10.253) 
u, =sin(2n+l)ax, n=0,1,2... (10.254) 


Luckily, the eigenvalues are all real (unlike the last example of the non-self-adjoint 
problem). For the adjoint problem, we also have the eigenvalue being 
LH, =A, =(2n+1)?x’, n=0,1,2,... (10.255) 
The corresponding eigenfunctions are 
u, =cos[(2n+l)z0--x)], n=0,1,2,.. (10.256) 
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The biorthogonal condition for these two eigenfunctions is 
1 
(u,.v,) = ( sin[(2n + l)rx]cos[(2n +1)a(1 — x) dx (10.257) 
0 


Using the sum rule for the cosine function, we find 
cos[(2n +1)z(1— x)] = cos(2n + 1)z cos(2n + l)zx +sin(2n+1)zsin(2n+1l)zx 


=—cos(2n + l)zx 
(10.258) 
Substitution of (10.258) into (10.257) gives 
1 
(i.e) 22 {. sin[(2n +l) rx] cos[(2n + l)ax]dx 
1 
= ; I, sin[(4n +2)arx]dx (10.259) 


L cos[(4n+2)zx]| _ 

2 (4n+2)x |, 
Thus, we have demonstrated that the bi-orthogonal relation is satisfied. Apparently, 
the eigenfunction expansion for f(x) is 


f= Sy Le PD sinton +)rx] (10.260) 
n=l Vy oUn) 

But, this result is wrong because the eigenfunction is only for all the odd terms in 

the sine series and it is an incomplete expansion. The correct eigenfunction for this 

case should be: 


f(o= ye sin J, x+b,xcos /Z, (1-x)] (10.261) 
where " 
a, =—4 [, f(x\—x)sin JZ, (1—x)dx (10.262) 
b,=4 {, f(x)cos yA, (I x)dx (10.263) 
JA, = (n+) (10.264) 


Note that this is no longer a Fourier sine series expansion. Without going into detail 
in obtaining this, we should simply say that we need an additional function for 
completeness. The function in this case is 


i, =xcosfA,,x (10.265) 


This function satisfies the boundary conditions, but, however, does not satisfy the 
original eigenvalue problem. That is, 


(L+A,)ii, =—2,fA, sin fA, x #0 (10.266) 
Instead, it satisfies the following ODE 
(L+A,)°u, =0 (10.267) 


This is also equivalent to solving the following problem 
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(L+A, it, =u, (10.268) 


Actually, (10.267) or (10.268) provides the definition of the generalized 
eigenfunctions. In a sense, this formulation is similar to Case 2.2 of Chapter 5 for a 
system of first order ODEs when the matrix is non-Hermitian. Recall the similarity 
between the properties of eigenvalues of a Hermitian matrix and the properties of 
eigenvalues of self-adjoint ODEs. 

The main purpose of this example is to illustrate that generalized 
eigenfunctions may be needed for non-self-adjoint problems. Eigenfunction 
expansion for a non-self-adjoint problem is by no means straightforward and is 
very difficult. Their detailed discussion is, however, out of the scope of the present 
chapter. 


10.11 SUMMARY AND FURTHER READING 


In this chapter, we have summarized the essential ideas of eigenfunction 
expansions. We started with two point boundary value problems. Dirichlet, 
Neumann, and Robin problems are presented in the context of heat conduction. 
Fourier series expansion is introduced before we considered the general Sturm- 
Liouville problem, including derivation of the Lagrange identity, proof of the 
realness of eigenvalues, and the orthogonal property of eigenfunctions. We also 
summarized the list of classical equations that the eigenvalue problem can be 
studied under the framework of Sturm-Liouville problem. The use of eigenfunction 
expansion in solving the nonhomogeneous boundary value problem is 
demonstrated. A singular Sturm-Liouville problem is demonstrated by considering 
the vibration problems of circular membrane. Finally, the more advanced topic of 
eigenfunction expansion for non-self-adjoint ODEs is considered. 

There are many good references for eigenfunction expansions, including 
Stakgold (1979) and Friedman (1956). The interested reader can consult these and 
other references. 


10.12 PROBLEMS 


Problem 10.1 A linear second order ODE is given by: 
2 


ag (x) +a, aan (x) + Aa,(x)Ju = 0 (10.269) 
dx dx 
Show that it can be as a Sturm-Liouville problem if the following is satisfied: 
a,(x) = day(®) (10.270) 
dx 


We can ignore the boundary condition for this problem. 


Problem 10.2 A linear second order ODE is given as: 
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2 
a(x) + (8) Laas) + Aas (a) =0 (10.271) 


(1) Compare this equation with the Sturm-Liouville problem and show that 
PO) _ PI) _ 9) _ 1) _ ay eae 
Ay(x) a(x) a(x) a(x) 

(ii) Take the first two terms from the ‘aia obtained in (i) to show that 


I(x) = xpl|“ ae) (10.273) 
ME x) Ay (x 2 
(111) Show that p, g, and r in the Sturm-Liouville problem are 
a (x) 


pix) = expt 2 yeh q(x) =1(x)az (x), (x) = 43(x) (x) (10.274) 


Therefore, (10.271) can - converted to the Sturm-Liouville problem of 
multiplying /(x) obtained in Part (ii) if ai(x) # a(x). 


Problem 10.3 Use the result of Problem 10.2 to convert the following ODE to a 
Sturm-Liouville problem: 


2 
OAS er (10.275) 
dx? dx 
Ans: 
x2 x 
ae 2 Dia ge 2 y =0 (10.276) 
dx dx 


Problem 10.4 Use the result of Problem 10.2 to convert the following ODE to a 
Sturm-Liouville problem: 


2 
£Y 2 jy=0 (10.277) 
dx dx 
Ans: 
d_.d 
Se ®)+Ae*y =0 10.278 
rm ie y ( ) 


Problem 10.5 Show that the Bessel equation given below can be converted to a 
Sturm-Liouville problem: 


d? d 
2a y Ly 2 2 
x tx —+4+(Ax* -—v~)y=0 10.279 
a2 tie yy (10.279) 
Ans: 
2 
gD cies” yi) (10.280) 
dx dx x 


Problem 10.6 Show that the Bessel functions satisfy the following orthogonal 
relation: 
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[we x), (2 x)dx=0, n#m (10.281) 
0 a a 


where J(anx/a) satisfy (10.132). 
Hint: Use the result of the Sturm-Liouville problem. 


Problem 10.7 Show that the Laguerre equation can be converted to the Sturm- 
Liouville problem: 


d*y dy 

er den + ay=0 10.282 

a0 (l-x) no ( ) 
Ans: 

lt Oe ee re (10.283) 

dx dx 


Problem 10.8 The Legendre equation is defined as 


d°y_, dy 
1-x? 2x +n(n+)y=0 10.284 
(nx?) F 2a n(n Dy (10.284) 
Recast the Legendre equation in the form of the Sturm-Liouville problem. Find the 
eigenvalue A of the corresponding Sturm-Liouville problem. 

Ans: 


Hie 2 
i ll- tan ty =0 (10.285) 


Problem 10.9 Suppose that the boundary conditions given in (10.59) and (10.60) 
are replaced by the following periodic boundary conditions 


P(O)= pl), yO)=y0), y'(O)=y'A) (10.286) 
Show that the following special form of the Lagrange identity remains valid 
1 
| {L[ulv—ul[v] } de = 0 (10.287) 
0 


Problem 10.10 Show that the following special form of the Lagrange identity 
remains valid 


| {L[ulv—ul[v]} dx = 0 (10.288) 
for the following singular boundary conditions: 
(i) p(0)=0, ay(0)+ By'(1)=0 (10.289) 
(ii) pd)=90, ay(0)+ By'() =0 (10.290) 
(iii) p(0) = p(a) =0 (10.291) 


Problem 10.11 Find the orthonormal eigenfunctions of the following problem 
y"+Ay=0, y(0)=0,y(1)+ y'(I) =0 (10.292) 
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Ans: 
V2 sin VA, x 
(1+ cos” hs a , 


9, (x) = = 1,200 (10.293) 


CHAPTER ELEVEN 


Integral and Integro-Differential 
Equations 


11.1 INTRODUCTION 


In the first part of this chapter, we will summarize the idea of integral transforms 
versus eigenfunction expansion that we discussed in the last chapter. Various types of 
integral transforms will be introduced, including the Fourier transform, Hankel 
transform, Mellin transform, Hilbert transform, and Laplace transform. When integral 
transforms are applied to ordinary differential equations, ODEs will become algebraic 
equations that can be solved readily most of the time. If integral transforms are 
applied to partial differential equations of two variables, the PDE will become an 
ODE. The resulting ODE is, of course, much easier to solve than the original PDE. 
For PDEs with n variables, each time an integral transform is applied, the resulting 
PDE will only involve n—1 variables. Therefore, when an integral transform is applied 
repeatedly, the PDE will eventually become an algebraic equation. However, due to 
space limitations we only cover the basics of integral transform and its introduction is 
primarily for setting the scene for the more difficult problems of integral equations 
and of integro-differential equations. 

The second part deals with problems that need to be modeled by integral 
equations (governing equations involving integral of the unknown) and _ integro- 
differential equations (governing equations consisting of both differentiation and 
integrations of the unknown function). The word “integral equation” was coined by 
Du Bois-Reymond. Integral equations that we discuss include the Abel integral 
equation, Hilbert integral equation, Fredholm integral equation, and Volterra integral 
equation. Integro-differential equations will be considered using the Laplace 
transform. 

Historically, the integral of the Laplace transform type and of the Mellin 
transform type was first considered by Laplace 1782. The Fourier transform was 
developed by Fourier in 1811 when the theory of heat conduction was considered. 
Formulating the mechanics problem of a sliding bead along a frictional wire, Abel 
formulated the Abel type of integral in 1823. In particular, Abel considered the 
tautochrone problem that for a given slide time what would be the shape of the wire, 
in contrast to the brachistochrone problem of the Bernoulli brothers. The Fredholm 
type of integral equation was first considered by Liouville in 1837 and 1838 based on 
the idea of successive substitutions of Neumann. However, the work of Neumann was 
not published until 1870. In 1890, French mathematician Picard formulated the 
successive approximation in a more general and a widely applicable form. In more 
recent literatures, the method of successive approximation has been mostly associated 
with Picard, instead of associated with Liouville and Neumann. This procedure was 
reconsidered by Volterra in 1896 and 1897 to solve the so-called Volterra integral 
equations. Volterra recognized that integral equations could be interpreted as a 
limiting form of a system of 7 linear algebraic equations for n unknowns as n > . 
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The eigenvalue problem of integral equations was considered by Poincare in 1896. In 
1900 and 1903, Fredholm solved the more difficult problem of Fredholm integral 
equations and of asymmetric kernel functions in the integral. Following up the idea of 
Volterra, Fredholm again divided the interval into equal parts and used linear 
algebraic equations to consider when the solution of a nonhomogeneous integral 
equation exists. He had been able to solve the Fredholm integral equation in terms of 
a determinant of the kernel K and the so-called first minor of the kernel K. This leads 
to a very general theorem called the Fredholm alternative theorem, which is also 
applicable to differential equations. Hilbert wrote a series of six papers from 1904 to 
1910, and made major progress in the analysis and showed that the solution can be 
expressed in orthogonal eigenfunctions of the eigenvalue of kernel K. The analysis 
somewhat resemblances Fourier’s eigenfunction expansion. This is called the Hilbert- 
Schmidt theorem of integral equations, as Schmidt in 1907 extended Hilbert’s work to 
non-symmetric kernels. This eigenfunction expansion for an integral leads to the 
concept of the abstract functional analysis and functional space (like the Hilbert 
space). Hilbert was able to show that the Fredholm integral equation is equivalent to 
the solution of differential equation. The eigenvalue problem of ODEs is also 
equivalent to the eigenvalue problem of an integral equation considered by Poincare 
and Fredholm. The major concept of the spectrum theory of the eigenvalue of the 
differential equation was also formulated. In particular, when an eigenvalue of an 
ODE with associated boundary conditions is in the form of a discrete spectrum, the 
eigenfunction expansion should be used to obtain the solution. However, when there 
is a continuous spectrum of eigenvalues for the associated problem, typically for the 
infinite domain, the problem has to be solved by integral transform. 


11.2 INTEGRAL TRANSFORMS 


There exists different techniques of integral transforms, which are motivated by 
solving different types of linear differential equations. Eigenfunction expansion 
discussed in the last chapter is suitable for solving problems of finite domains, for 
which eigenvalues and eigenfunctions can be found, but they are useless in solving 
problems involving the infinite domain. For finite domains, the eigenvalues are 
discrete even though there are an infinite number of them; however, for the infinite 
domain, eigenvalues are continuous. We said that there is a continuous spectrum of 
eigenvalues. For problems with a spectrum of eigenvalues, we need to apply an 
integral transform instead of applying eigenfunction expansions. 

In fact, the integral transform technique is developed for solving linear 
differential equations. We will discuss briefly the most commonly encountered 
integral transforms, including the Fourier transform, Hankel transform, Mellin 
transform, Hilbert transform, and Laplace transform. All of them can be cast into the 
following general transform and its inversion as: 


F()= J" Ks, fd, -w<a<fpso (11.1) 


f= { H(s,t)F(s)ds, 0S a<B<oo (11.2) 
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Different kernel functions K(s,f) lead to different types of integral transforms. The 
limits of integration also vary depending on the type of transform. 


11.2.1 Laplace Transform 


One of the most popular integral transforms is the Laplace transform, which is 
applicable to problems in the semi-infinite domain. It can be shown that the Fourier 
transform to be presented next is equivalent to the Laplace transform for the 
infinite domain (Sneddon, 1951). The Laplace transform was originally proposed 
by Euler and developed to the present form by Laplace. It had been known as 
operational calculus and its application in engineering is mainly publicized by 
Oliver Heaviside in the 1980s. The kernel of the Laplace transform is 


K(s,t)=e™" (11.3) 
The resulting integral transform is 
F(s)=UsOl= fe" fdr, (11.4) 
PQ) = LR = [et Fs) ds (115) 
271 4 c-ix 


Its application has been mainly for the time variable in differential equations. A 
brief summary of the Laplace transform can be found in the appendix of Chau 
(2013) and in Spiegel (1965). Since the solutions of linear differential equations 
with constant coefficients are based on the exponential function, the Laplace 
transform is particularly useful for such differential equations. 

In view of its importance in application to differential equations, we will 
discuss the Laplace transform in more detail. Formula (11.5) is called Bromwich’s 
contour integral for the inversion of the Laplace transform. This formula is now the 
standard form of the inverse of the Laplace transform. Bromwich learned 
Heaviside’s operational calculus (i.e., the Laplace transform of today) and derived 
this important result. Bromwich died at the age of 44 and his result was unnoticed 
until J.R. Carson and H. Jeffrey made it well known. 

Suppose that we want to evaluate the following integral (1.e., definition of 
Laplace transform): 


F(s)=UsOl= fe" fdr, (11.6) 


To solve for f(t) in (11.6) or to derive (11.5), we can consider the following 
complex Fourier transform: 
; eee eee ee 
e* f(x) =| em ee" £(t)dt\d@ (11.7) 
TT ¢-c0 —0o 
The transform will be discussed more in the next section. If we impose the 
condition that 
f(x)=0, for x<0, (11.8) 


using (11.8), we have (11.23) becoming 
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eo F(x) = fie ee F(t)dt]da 


ce) 
0 


(11.9) 


Now, c is an arbitrary constant and we want to choose its value such that c will give 
the limit of convergence of the solution or c will be larger than the real part of all 
singularities of the function F(s). Mathematically, we want to choose c such that the 


following integral exists: 
is el f(@)\dt 
Multiplying (11.9) by e“, we get 
f= = [. cage e "}' Fa) dt do 


Now we can apply the following change of variables: 
s=ct+ia, or ds=ida 
The limits of the outer integral will be shifted by 
Oa>n, sS=ctin; @O>-0, S=C—I0 
With this change of variables, (11.11) can be rewritten as 
792 emtf eo f(Odt|ds = — | e™ F(s) ds 
ni c—ico 0 2ni c—ico 
This is the Bromwich contour integral, and the proof is completed. 
We will look at a number of special cases. 


Example 11.1 Find the Laplace transform of a constant C. 


Solution: The Laplace transform of a constant is 
L{C}=[ "cea 
This can be solved easily by direct integration 
L{C}= cf evar 


(oe) 


Ce 


Ss 


0 


Example 11.2 Find the Laplace transform of 
Lg = [oemetat 
i, 


Solution: This can be solved directly by integration: 


(11.10) 


(11.11) 


(11.12) 


(11.13) 


(11.14) 


(11.15) 


(11.16) 


(11.17) 
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—(s—a)t ca 


L{ a \ = [oe “etae =u 


0 s-a 


0 (11.18) 


1 
= , s>a 
sa 
Thus, the formula can be obtained in a straightforward manner. 


Example 11.3 Find the Laplace transform of the cosine 


L{ cos(at) } = J, e ” cos(at)dt (11.19) 


Solution: We have already seen from Example 11.2 that an exponential function 
can be integrated readily. We can first express the cosine function by Euler’s 
formula as 
wo (plat —iat 1 1 1 
L[cos(at)]= | i) pop 
0 2 2|s-ia stia 


(11.20) 


Ss 


a ara 
S +@ 
Note that the order of s is higher in the denominator than in the numerator, and this 
observation also applies to the former two examples. 


Example 11.4 Find the Laplace transform of the sine 


F(s) =L{ sin(at)} = Le sin atdt (11.21) 


Solution: Although we have shown in Example 11.3 that the Laplace transform of 
the cosine can be evaluated using an exponential function via Euler’s formula. In 
general, we can also use this technique for (11.21), but we will proceed following 
integration by parts here 


F(s)= ae" cos at) / al. a. | e cos a 
0 a 0 


(11.22) 
1_s ee: 
=a | e  cosat 
a a\40 
Reapplying integration by parts again, we have 
1 : 0 oY ep 
F(s)= Lisle sin at)/a| +2/ ce sinat 
aa 0 ado 
: (11.23) 
1 
=—--5 F(s) 


aa 
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We observe that the unknown appears on the right hand side as well. We can put all 
unknowns on the left hand side to obtain 

a 
a 


F(s)= s>0 (11.24) 
Ss 


This is the required formula. 


Example 11.5 Find the Laplace transform of the Heaviside step function: 
0, t<c 
H(t-c)= ' (11.25) 


t2c 


This is the Heaviside step function. 


Solution: Substitution of (11.25) into (11.6) gives 


L{H(t-o)} = Joe eH (t—c)dt =| edt (11.26) 
This can be integrated directly as 
L{H(t-c)} = | "etda—te#| <2 (11.27) 
c s Ps s 


One of the main applications of the Laplace transform is to solve ODEs. Similar to 
the Fourier transform, the Laplace transform can convert differentiation to an 
algebraic equation. In particular, we can find the following transform of 
differentiation of a function: 


|£ } rioe"| [sol -se™ Jar 


(11.28) 
=0- f(0)+5 } “¢(Ne“dt 
0 
Therefore, we get 
| £0 oO}. sF(s)— f(0) (11.29) 
The process of integration by parts can be repeated to get 
2 
| £20 |-2r)-s10)- f'(0) (11.30) 
| eno Zz o). S3F(s)—s? f(0)—sf (0)- £0) (1.31) 
fee o =5"F(s)—s" “f0)—8" "fF O)—.-f" 0) = 1.1.32) 
t 
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Using these formulas, we can easily convert linear ODEs to algebraic equations. 
Without going into detail, we also record the following formulas: 


[fo ]--v" 2) 


22} i F(u)du (11.34) 


These formulas can convert ODEs with non-constant coefficients to ODEs with 
constant coefficients. 
Before we apply to these formulas to solve ODEs, we note that 


Ll f+eg(t)} =<, Le f(dt +c, | em g(t)dt 
=o L{fO} +eLl{g@o} 


This is the property of a linear operator. Similarly, we can also show that the 
inverse Laplace transform is also a linear operator. If we define F(s) as a sum of a 
number of transformed functions: 

F(s)=F,(s)+F,(s)+' +F,(s), (11.36) 


=(-1)" F(s) (11.33) 


(11.35) 


we have 
fOH=L"{F(S) HL {A(s)} +L {F,(5)} . (11.37) 
There is also a shifting property of the Laplace transform: 


Lies} = [eet sat = fe Pde = F(s—c). (11.38) 


where 
L{fO}=FG). (11.39) 
We now consider a very powerful theorem called the convolution theorem. Consider 


the following inversion of the Laplace transform: 


1 y+ioo of i y+ico r i) wis 
— F(s)G(s)ds = —| e F(s){_ g(t)e “‘dtds 
2ni 277i 4 y-ic 


ytio 
=(, oo | Fle dsde (11.40) 
2nidy 


=|, e@)fe-nar 
In obtaining (11.40), we have reversed the order of integration. Therefore, we have 
E'{F(s)G()} = | ee) fe-2de (11.41) 
This is the convolution theorem and its application is demonstrated below. 
Example 11.6 Find the inverse Laplace transform of the following function in the 
transformed space: 


His" — 11.42 
(s) PEs?) ( ) 
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Solution: We can interpret F and G as 


1 
F(s)=—=, Gs)==", (11.43) 
s s +a 
To find the inversion, we first note that 
L{r"} =| ea =—+ pre" = nt” edt] (11.44) 
0 S 0 0 


By applying L’Ho6pital’s rule repeatedly, the boundary term for ¢ > 2% is found to 
be identically zero: 
n n-l 
t 


! 
im) = int eS in (11.45) 
tow oe" — tow ge™ too gp! 
Thus, we have 
ie} = t"e “dt - “| “16-5 (11.46) 
0 s70 
Repeating the integration by parts n times, we obtain 
cel ! po ! 
L{t"} =( te“ dt = =| edt = (11.47) 
0 gs" 0 Vial 
Applying of formula (11.47), we have 
1! 
L'{y=t (11.48) 
s 


In view of this result and the result of Example 11.4, we can apply the convolution 
theorem to get 


CW@} =F) 
Ss s(s 


t 
a= i L'{F(s)G(s)} = | (t—t)sin(at)drt (11.49) 
+a’) 0 
This is the required inverse of H(s) in terms of integral. 


Example 11.7 Solve the following ODE with prescribed initial conditions: 
y"+y=sin2t, y(0)=2, y'(0)=1 (11.50) 
Solution: Applying the Laplace transform to both sides of the ODE, we have 
2 
“ED + Ly} = L{sin 21} (11.51) 
t 


Using the results from (11.24) and (11.32), we get 
[s°Lty} —sy(0)— y| + L{y} = L{sin 2t} = 2/(s* +4) (11.52) 
Letting L {vy} = Y(s) and grouping terms, we get 


(s° +1)¥(s)-sy(0)- y'(0) =2/(s? +4) (11.53) 
Substitution of the boundary conditions gives 
(s? +1)¥(s)-2s-1=2/(s? +4) (11.54) 


Solving for Y we find 
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- Is? +57 +85+6 


Y(s) (11.55) 
(s? +1)(s? +4) 
Applying a partial fraction, we can divide the right hand side of (11.55) as 
3 22 
j= ee —— (11.56) 
(s* +1)(s~ +4) sot]  s° +4 
Balancing terms of the numerators on both sides of (11.56), we have 
28° + 5° +8594+6=(As+B)(s?+4)+(Cs+D)(s? 41 
( | ) ( | ) (11.57) 
=(A+C)s? +(B+D)s* +(444+C)s+(4B+D) 
Solving for the constants, we get 
A=2, B=5/3, C=0, D=-2/3 (11.58) 
Thus, we have 
¥(s)= + a a (11.59) 
so+1 s°4+1 s*+4 
Taking the inverse of the Laplace transform, we find 
Cy@ier{ 2 oe ia Pr ria ze (11.60) 
s° +1 s° +1 so +4 


Applying the results from Examples 11.3 and 11.4, we obtain the final result 


y(t) =2oost-+2sint sin (11.61) 


11.2.2 Fourier Transform 


The Fourier transform is one of the most important integral transform techniques. It 
can be used to solve linear differential equations in Cartesian coordinates, and has 
been widely used in many engineering problems. For example, FFT (Fast Fourier 
Transform) is a standard technique in analyzing vibration data. There is a close 
relation between Fourier series expansion and Fourier transform. Mathematically, 
for differential equations defined for finite domain eigenvalues and eigenfunctions 
can be found and used to solve boundary value problems for such finite domain. 
These eigenvalues are discrete and not continuous. Fourier series expansion is the 
most fundamental series expansion (see Chapter 10). For the infinite domain, the 
spectrum of eigenvalues is continuous, and integral transform needs to be used. 
Fourier transform is the most important transform technique for Cartesian 
coordinates. 
The kernel for the Fourier transform is 


K(s,t)=e™" (11.62) 
For this kernel, the pair of Fourier and inverse Fourier transforms are 


F(s)=31f1= ze [ie roar, (11.63) 
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Lf? 3 
fO=T'FO)l=F= } e™ F(s)ds, (11.64) 
ih —00 
Be cautious that there are many slightly different definitions adopted in the 
literature, mainly regarding the constant (27)!7. Here are the most popular 
definitions of Fourier transform: 


F(s)=If@]= a0 Ps" F(t) dt, (11.65) 
A 4-0 
where a and f are 
a= J2n, B=HI (11.66) 
a=1 B=1272 (11.67) 
a=1, B=Hl (11.68) 


There are six popular choices, and we are using o = (27)! and B = —1. Look at the 
definition of the Fourier transform carefully before using any table of transforms 
prepared in textbooks or derived by others. For real functions, we can take the real 
part and imaginary part of the above transform to give the following Fourier cosine 
and sine transforms: 


F)=3.L/)]= 2 [, costes @de, (11.69) 
fa) =5,'F(O)1= 2 [, coer ede (11.70) 
F(Q)=3,L/@)]= 2 [, sien fear, (11.71) 
fa)=5,'FO|= 2 J, smnenr@ae (11.72) 


They are useful in solving one-dimensional wave problems in a semi-infinite line 
subject to different boundary conditions. More detailed discussions on these 
transforms are found in Sneddon (1951), who reported their applications to solve 
vibration problems, heat conduction problems, hydrodynamic problems, nuclear 
physics, and elastic stress of 2-D or axisymmetric solids. For a table of Fourier 
transforms, readers should consult Erdelyi (1953) (1.e., Bateman Manuscript 
Project). 

Here we will only demonstrate the technique in solving a simple PDE 
problem. Figure 11.1 shows a half-plane with a prescribed function f(x) on the 
surface on y = 0. 

Mathematically, the problem is defined as: 

2 2 
Vu(sy) = S545 =0, 0<y<0,-w<x<0 (11.73) 
x 
with boundary conditions 


u(x,0) = f(x), u>0, asr= x? +y? ao (11.74) 
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The surface function f(x) and its derivative f(x) must approach zero as r > o. 
Multiplying the Laplace equation given in (11.73) by e‘* and integrating with 
respect to x from —co to co (or equivalently taking the Fourier transform), we get 


Oe. au, d? Ou, d’U(s,y) 
j=3— )+ {= 4A. + ; 
ax dy” ax? : 
The first term can be found by integration by parts as: 


ax?” J» ax ax ~~ Ox (11.76) 


= —is[ue’* |? + (is)? [ ue dx =(-is)’U 


=0 (11.75) 


All boundary terms vanish in view of the boundedness of the solution and its 
derivative at infinity given in (11.74). Thus, the PDE becomes an ODE as: 


Figure 11.1 A half-plane satisfying the Laplace equation with a prescribed function f(x) on the 


surface 
2 
use ey) =0 (11.77) 
dy 
Applying the Fourier transform to the boundary condition gives 
S{u(x,0)} =U(s,0) = 3{f(x)} = F(s) (11.78) 
The solution of (11.77) is 
U = Ae + Bet (11.79) 
The boundary condition requires A = 0, and thus we have 
U = Be b (11.80) 


Enforcing the boundary condition given in (11.78), we obtain 
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U = F(s)e (11.81) 
To find the solution, we are going to see that the Fourier transform of an integral 
can result in a very powerful theorem called the convolution theorem or Faltung 
theorem (such theorems also exist for other transforms discussed in this chapter): 


Sf * g(x)} = sf fe —u)g(u)dut = sel [re —u)g(u)due™*dx 


(11.82) 
Here the asterisk represents the integral involving the two functions. We then apply 
the following change of variables: 
VvV=xX-W, W=u (11.83) 
It is straightforward to see that the Jacobian is —1, thus we have 


1 a ia sua 
Sf * ee} ==] f SO)gOne 8 Pdu a 

aes (11.84) 
—— | "fe "dv | ” g(wye dw = 2 F(s)G(s) 

27 2-20 —00 
Therefore, we get 
f * 2(x) =V2a3 | {F(s)G(s)} (11.85) 

Using this idea, we can identify G(s) from (11.81). The remaining job is to find the 
following inversion: 


- —|s|y 1 ° —isx bi —isx —sy 
a =| e ese |. ee Yds 
1/5 —0O 


.. 40 J 
1 5(y-ix) —s(ytix) 1 1 1 
_ ; : — | c = = 5 = + Wl (11.86) 
i 1 y-ix - yrix 5 | in ae yVrix 


Oe es am 
=p 


Using (11.85) and (11.86), we can express the inversion of (11.81) as 


yes! ne ae eee 9s ee 
u(x,y)= 3" {F(s)e =e] oye ame 


~2 [Dac 
lage te 
Therefore, the problem is solved in terms of the given function f In conclusion, we 
also find the following formulas as by-products: 


(11.87) 


ey = Cisy"U(5) (11.88) 
dx" 


X'LF(s)G(s)] = ze [. f(x—u)g(u)du (11.89) 
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Hey =F Ss) (11.90) 
H y +x 


The first one is the main reason, and it is the reason why we can convert 
differentiation to algebraic manipulation and thus solve PDEs. The second one is 
called the convolution theorem and is a very powerful technique. For the given 
example, we have yet provided the exact form of the given function f/ but we have 
found a general formula for the result in terms of its integral. If the given function f 
is so complicated that we cannot integrate analytically, we can always integrate the 
resulting integral numerically. We can also generalize (without proof) here that the 
Fourier transform is applicable to PDEs formulated in the Cartesian coordinate 
system, and the variable of the problem is of infinite extent (see the range of x: —oo 
<x <o). Therefore, the Fourier transform is suitable for solving linear PDEs with 
a domain of half-plane, half-space, full-plane, or half-space and formulated in 
Cartesian coordinates. 


11.2.3. Hankel Transform 


For potential problems that can be formulated in cylindrical coordinates, the 
Hankel transform can be applied to convey the radial dependency to algebraic 
equations. The kernel in the Hankel transform can be expressed as: 


K(s,t) =J,(st)t (11.91) 

With this kernel, the resulting Hankel transform and its inversion are 
FE)=HUP@= [af )J, Ende, (11.92) 
FQ)=HF@N= | gF@OI, Ende (11.93) 


Many problems in cylindrical coordinates can be solved by Hankel transform. It 
can be shown that the Hankel transform of order zero (v = 0) is equivalent to a 
double Fourier transform for axisymmetric problems in Cartesian coordinates. 
Let us consider the application of Hankel transform to the following PDE 
(e.g., Section 4.9.2 of Chau, 2013): 
2 2 2 
ci gee md o Wn =0, O<r<a (11.94) 
or ror r 
where m is an integer. 
This is the Laplace equation in cylindrical coordinates for m-harmonics in 
circumferential coordinates (i.e., 0) (see Figure 11.2). In particular, the Hankel 
transform pair of y;, are defined as: 


¥ (6,2) = [, TW (152) (ST ar (11.95) 


Vnlts2)= | By (E.2n (EAE (11.96) 
Substituting (11.96) into (11.94), that is, we have 
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2 2 
StS SO eG alEnde=0 (1199 


oor ar 


Y =y,, cosmO 


Cc | > 
aN 


Figure 11.2 Cylindrical coordinates 


Reversing the order . differentiation and integration, we obtain 


fred “ale 1 dale) mJ ayy, (gas 


(11.98) 
o da W (6,2) 
+) 6 a (End =0 
0 dz 
Recall that the sbi eee of Jin(7é) is 
2 
ZnlS0) LOSI) 4 (42 97, (Er) =0 (11.99) 
dr? r dr r 
Substitution of (11.99) into (11.98) yields 
0 aw (Ez 
[, FED uEE=0 (11.100) 
Iz 
Therefore, the bracket within the integral must be zero 
he PS 
OP ale) _ gy, (6,2) =0 (11.101) 
Iz 
The solution of this ODE is easily obtained as: 
Wn (G2) = Ane” + Bye? (11.102) 
Finally, we have the solution of the PDE in (11.90) is 
WV (1s2) = i} : El A,,c8 +B,e 8 WI, (Erde (11.103) 


The constants, of course, need to be fixed by boundary conditions. In this example, 
we are content to get the general solution for (11.94). We observe that the Hankel 
transform is applied to the Laplace equation in cylindrical coordinates of infinite 
extent. In fact, one of the main differences between eigenfunction expansion 
discussed in Chapter 10 and integral transform is that eigenfunction expansion is 
used for the finite domain whereas integral transform is applicable to the infinite 
domain. In Muki’s formulism for elastic solids in cylindrical coordinates is based 
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on the Hankel transform (e.g., Chau, 2013). In fact, many physical problems in 
cylindrical coordinates are expressible in Bessel functions via Hankel transform. 


11.2.4 Mellin Transform 


For the two-dimensional elastic and potential problem in the shape of a wedge 
formulated in cylindrical coordinates, the so-called Mellin transform is found 
useful. It also finds applications in finding the sum of infinite series, the asymptotic 
value of an integral involving a large parameter, signal analysis, and imaging 
technique. Much of the results in the book of Bleistein and Handelsman on 
asymptotic analysis of integrals is in fact based on the Mellin transform. More 
detailed discussions on asymptotic analysis of integrals will be discussed in 
Chapter 12, but we will avoid the use of the Mellin transform in Chapter 12. In 
probability theory, the Mellin transform is an important tool in studying the 
distributions of products of two random variables. In particular, the Mellin 
transform of the product of two independent random variables equals the product 
of the Mellin transforms of the two variables. The Mellin transform is closely 
related to the two-sided Laplace transform. 

The so-called Mellin transform has been considered by Laplace and used by 
Riemann in his study of the zeta function. It was, however, Mellin who provided a 
systematic formation of the transform and its application to solve ODEs and to 
estimate the value of integrals. Thus, the Mellin transform is named after Finnish 
mathematician Hjalmar Mellin, who was a student of Mittag-Leffler and 
Weierstrass. The kernel for the Mellin transform is 


K(s,) =" (11.104) 
The Mellin transform and its inversion are defined as: 
F(s)= MLf@l= [x faa, (11.105) 
= 1 c+ico = 
fe) =M TF =——[ x F(s)ds (11.106) 
2711 4 cio 


where c is a constant that it lies on the right of all singularities of the kernel 
function. With a proper change of variables, the Mellin transform can be converted 
to a two-sided Laplace transform. In particular, a two-sided Laplace transform can 
be written as: 


L[g(t)|= } e(t)e "dt (11.107) 

Let us consider the following change of variables: 
t=-Inx, ea (11.108) 

x 


Applying (11.108) to (11.107), we get 
0 io) 
Lg l=-[ fox tdr=[° f)xtde=M if} (11.108) 


where we have defined 
g(-Inx) = f(x) (11.110) 


660 Theory of Differential Equations in Engineering and Mechanics 


Thus, we have the following identity: 


L{g(-Inx)} = M{f(a)} = F(s) (11.111) 
Let us recall the inverse Laplace transform as 
c+ic0 
= | Li{e(ieds (11.112) 
2701 I c-ic 
Applying (11.108) in (11.112), we find 
1 c+ico 
g(-Inx) = sal L{e(-Inx)}e*™*ds (11.113) 
TTL ¢c-io 
Substitution of (11.110) and (11.111) into (11.113) gives 
ct+ico 
oo | M6 f(x)}x°ds (11.114) 
271 4 cic 


This gives the inversion of the Mellin transform in (11.106). 
There is a Parseval formula for the Mellin transform. To see this, let us 
assume the existence of two Mellin transforms of two functions as: 


MIf(~)]=F(s), M[g(x)]= G(s) (11.115) 
We now consider the following Mellin transform: 


MUP @)I= | fgx" "ae 
_ 1 ioe) a2 c+ico =5 
= eal x a(x) on F(z)x “dzdx 


oer (11.116) 
=— fe, g(x)x* 7 "dx 
Oni 
1 
vF (z)G(s — z)dz 
~ Oni 
Substituting s = 1, we get Parseval’s formula: 
i) c+ico 
| f(O2(x)ae= =| F(z)G(1—z)dz (11.117) 
0 211 J c—ic 
Example 11.8 Consider the Mellin transform of the function 
f@®=-e" (11.118) 
Solution: The Mellin transform of (2) is 
M[f()]= } * Pts gy (11.119) 
0 


This can be converted to Euler’s gamma function by using the following change of 
variables 

¢=pt, d€ = pdt (11.120) 
Using this new variable we have 


MUOl= =f, eds == 


(11.121) 
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One major application of the Mellin transform is of course to solve differential 
equations. Thus, we consider the following formula for taking the Mellin transform 
of the derivative of function /: 


da‘ f(t 
mpi £0)- yh, FO) (11.122) 
where (s); is the Pochhammer’s ae defined in Chapter 4 as: 
(s), =s(s+1)**(s+k-D (11.123) 


To derive this formula, we can apply integration by parts to the following 
k wo . Jk 
mit tO ft), } FIO ne, t'dt 
dt 0 dt 


_ yes fol —(k+s—D [of Ha} 


(11.124) 
We can drop the boundary terms if f ~1/t* and 0 < Re(s) < f. Repeating the 
integration by parts k times, we obtain 


cd f(t) sol 
Mt <= (Mk +s-D(k +8) sft FOet aris 
=(-1)‘(s), F(s) 


This completes the proof. 


Example 11.9 Find the solution of the following problem of a wedge governed by 
the potential theory (see Figure 11.3): 


Ou lou 1 Ou 
V-u =——+ 4 =0 11.126 
ar? ror rae? 


u(r,ta)= f(r) (11.127) 


u(r,a) = f(r) 


u(r,—a@) = f(r) 
Figure 11.3 Wedge domain of potential theory 


Solution: We are expecting 
r—>0, u(r,@) = bounded (11.128) 
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1 
r>o, u(r,a)~—, B>0 
rh 
We can first rewrite the Laplacian as 
2 Ou cu Ou 
z tr—4 = 
ar? Or 8? 
Applying the Mellin transform to (11.130), we get 
so 2 2 
I pg? Ou Cu Ou 
0 


0 


+r—+ )dr =0 
ar’ =r: 88” 
Substitution of (11.122) into (11.131) gives 
eu 


s(s+lU-sU 4 =0 
do” 


Simplification of (11.132) gives 
2 
cd +s°U =0 
do 
The general solution is 
U(s,0) = A(s) cos(s@) + B(s)sin(s0) 
Boundary condition (11.127) leads to 
U(s,+a@) = F(s) 
Substitution of (11.134) into (11.135) leads to 


U(s,a@) = A(s) cos(sa) + B(s)sin(s@) = F(s) 
U(s,-@) = A(s)cos(sa) — B(s)sin(sa) = F(s) 


Solving A and B we get 


Age. xij =0 
cos(s@) 
Therefore, we have 
U(s,0) = F(s)cos(s@) 
cos(s@) 


Finally, we get 


u(r,0)= bf rs FOOD) 


2711 4 c—ico cos(s@) 


11.2.5 Hilbert Transform 


(11.129) 


(11.130) 


(11.131) 


(11.132) 


(11.133) 


(11.134) 
(11.135) 


(11.136) 
(11.137) 


(11.138) 


(11.139) 


(11.140) 


The Hilbert transform was proposed in 1905 by Hilbert in considering a problem 
posed by Riemann. Subsequently, it was developed by Weyl, Schur, and Riesz. The 


kernel for the Hilbert transform is 


1 1 
K(s,t) = ——— 
T SHE 


(11.141) 
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F()= Af ol= [LO ae, (11.142) 
2 m(x—€) 
FS) 
AF d 11.143 
fo) = FORO =-[ a dé (11.143) 


where the integral is singular at x = € and in the above formulas we have taken the 
Cauchy principal value as: 

Ei tim LO ae f JO) ey 11.144) 

-0 W(x —6) er0 d-0 A(x—¢) S48 M(X—¢) 

The term Hilbert transform was coined by Hardy in 1909. As shown in Mura 
(1987), Weertman (1996) and Chau (2013), the two-dimensional dislocation pile- 
up in elastic bodies can be formulated as Hilbert transform. As demonstrated in 
Broberg (1999) and Chau (2013), the two-dimensional crack problem can also be 
solved using the Hilbert transform in terms of dislocation pile-up. Its application is 
also found in Muskhelishvili’s (1975) formalist for solving two-dimensional elastic 
problems using complex variables. If the limits of the integral are finite numbers, 
we have the finite Hilbert transform (Tricomi, 1957): 


FO=Ffoyl={_ ae (11.145) 


fo=F FO) an f= = ood ies = (11.146) 
a 


The proof of this result is found in Tricomi a a transform arises from the 
analysis of an airfoil using aerodynamics (Tricomi, 1957), analysis of electronics 
(Nahin, 2006), and in the analysis of reflected SV waves (Ben-Menahem and 
Singh, 2000). 


11.2.6 Other Transforms 
The more popular types of integral transforms have been summarized in earlier 
sections. In order to provide a more comprehensive summary of the integral 


transform, a summary of eleven other integral transforms is given below. 


(i) Hartley transform 


F(é) =I f(x] = z } : (cosEx + sin Ex) f(x)dx (11.147) 


1 ips : 
f(x) =h"'[F(6)] = ara (coséx+sinéx)F(E)dé (11.148) 
5 i) 
This was proposed by Hartley in 1942 as an alternative to the Fourier transform. It 


deals with real functions exclusively, compared to the traditional Fourier transform 
using complex functions. 


(11) K-transform 
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F()=KUf@]= } : K,(xé)afxé f (dx (11.149) 
fQ)=KWPO=— J" Ly aE PCa (11.150) 


where Ky and Jy are the modified Bessel functions. Table of K-transform can be 
found in Erdelyi (1954). 


(111) Kontorovich-Lebedev transform 


« K,.(x) 
F(@)=K, [f= | 2 sear (11.151) 


/0)=KP@=—y J) Esinlxe)K FEMME (11.152) 


where Kiv is the modified Bessel function of the second kind of imaginary order. 
This was proposed by Kontorovich and Lebedev in 1938 for solving diffraction 
problems, and further detail can be found in Lebedev et al. (1965). 


(iv) Mehler-Fock transform 


F(Q)=M,;Uf (l= } 7 sinh xP! 1/(cosh x) f(x)dx (11.153) 


P(x) = Mp'LP@I= | Etanh(we)PE2(coshxyF (dE (11.154) 


where the kernel is the associated Legendre polynomial of the first kind of 
imaginary order. This was proposed by Mehler in 1881 and its basic theorems were 
proved by Fock in 1943. Detail can be found in Nasim (1984) and Sneddon (1972). 


(v) Weber-Orr transform 
FE)= Wolf = [Vt E)¥,(ag)-¥, a) (aI f@de (11.155) 


f= Welt EY =e” ODM VE Ib) 
Jy (as) + ¥, (as) 

This was proposed by Weber in 1873 for considering problems of infinite regions 

outside a circular cylindrical hole. A similar result was re-discovered by Orr in 

1909, and thus it is also referred to as the Weber-Orr integral transform (e.g., 

Olesiak, 1990). Rigorous justification for the method was done by Watson in 1944 

and Titchmarsh in 1922. 


F(é)dé (11.156) 


(vi) Associated Weber transform 
FQ) =WywlSOM= | y@¥(as)-¥, (8) (as)bf de (11.157) 


el a Y _Y oF 
f@)=WalFEN= | lab Meas) —Y (6) (a8) 
° B(aé)+¥2 (az) 


éF(E)dé (11.158) 
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The term of associated Weber transform was apparently coined by Krajewski and 


Olesiak in 1982 (Nasim, 1989). It has been found useful in thermoelastic problems 
of an infinite domain containing a circular cylindrical hole (Olesiak, 1990). 


(vii) Weierstrass transform 


F(é)=WLf()]= Te (ae fax (11.159) 
f(x) =W"'LF(é)] = Te [eo @4rueae (11.160) 


This transform was also known as Gauss transform, Gauss-Weierstrass transform, 
or Hille transform. It is related to heat or diffusion problem. 


(viii) Y-transform 


FE)=wlS= |S)? (OOY, xed (11.161) 


0) =y'FO= |S)? FOH, Cd (11.162) 


where Ay is the Struve function, which is a particular solution of a 
nonhomogeneous Bessel equation. It is useful for solving problems with singular 
behavior at the axis of symmetry. This is closely related to Hankel transform. 


(ix) H-transform 


FQ)=% (f= |. 8)" (OH, Cav (11.163) 


SQ) = HLF OI= | ey"? FO, (ede (11.164) 


The Y- and H-transforms are complementary pairs of integral transforms. 


(x) R-transform 
F(6)=&[ f(x) = fire {€ cos(xg)+hsin(xé)}dx (11.165) 


get a eh & cos(xé) +hsin(xé) 
f@=R rel= f= J, FOr OR 


This is for heat conduction problems with a boundary as 


F (0,y,2)-W/(0,9.2) = ¥O.2) (11.167) 
x 


where / is the parameter prescribed in the boundary condition. 


}d& (11.166) 


(xi) Abel transform 


F(@)= ALf (a) =2)° A ae (11.168) 
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_ yl 1p? 1 dF 
f@)= ALF) NT mes (11.169) 


The Abel integral transform is useful for axially or spherically symmetry problems. 


11.2.7 Governing Equation of Kernel Functions 


In this section, we will illustrate how to derive the proper integral transform for a 
differential equation. In particular, consider the following second order PDE for a 
semi-finite quarter space and boundary conditions as: 


2, 2 
ata) a HO) telat = 0 0<xy<o (11.170) 
u(0,y)=0, u(x,0) = f(x) (11.171) 
R= x+y 30, u(x,y)>0 (11.172) 


The problem is illustrated in Figure 11.4. Multiplying (11.170) by a kernel function 
K(x, &) and integrating the result from 0 to co with respect to x, we have 


I, ee als) + K(x, Sx) + K(x, &)c(x)u+ K(x, aoe x =0 
(11.173) 
y 
u(0, vy) =0 


u(x,0)= f(x) 7 


Figure 11.4 Domain of the PDE defined in (11.45) 


At this moment, the kernel function is the unknown of the problem. We are going 
to find what condition needs to be satisfied by K(x,é). Using integration by parts, 
the first term (11.173) can be integrated as 
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% Oru 
J, K@.Dae@ at 


= K(x, sacs) “ls HS iK(ng ¢ a(x) dx 


(11.174) 
= Ke 6a) -{uZtKe, acs} + fv St.2,acoi 
X Io 


Gul” fF? 
=K(x2)ax)S) + [uw SK, Saye 
Ox 0 0 Ox 
We can impose an additional constraint on the boundary conditions for K as: 
K(0,5)a(0)=0, S(~,8)=0 (11.175) 
x 
With these constraints, we have (11.174) becoming 
oo) 2 fal 2 
| KG Ou@- “ae | i” ete Dale (11.176) 
0 ax? 0 x? 
Similarly, we have the terms being 


[KG 8)00) Bae = Ke Eby — [ALK DC) 


(11.177) 
=- I uE {K(x )b0) kx 
% Keo? > ean K(x,é)udx (11.178) 
Now we define the integral transform as 
UE») = J Kg), yiax (11.179) 
In view of the decay condition as y 0, we can assume U has the following form: 
U(é,y) = F(é)e (11.180) 


Substitution of (11.180) into the boundary condition given in the second equation 
of (11.171) leads to the following identity 


FQ@)= [Kfar (11.181) 
The differential equation given in (11.170) becomes 
© | Ou a 
I, + Ste ercraent- Zee) «kts +e) =0 (11.182) 
x 
Since u cannot be identically zero for all uv, we must have 


2 
SAK 2a(x)]- <IK(s, E)b(x)] +[e(x) +22]K(x,2)=0 (11.183) 
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Therefore, we have arrived at a differential equation for the kernel function K(x,é). 
Depending on the exact mathematical form of functions a(x), b(x) and c(x), this 
non-constant coefficient PDE is in general not easy to solve. 


Example 11.10 For the special case of a(x) = 1, and b(x) = c(x) = 0, find the kernel 
function K(x, &). 


Solution: For this special case, we have the governing equation for K(x, €) as: 


2 
SKED+EK HE) =0 (11.184) 
x 
The solution is clearly 
K(x, &) = A, sin(€x) + A, cos(€x) (11.185) 
Substitution of (11.185) into (11.175) yields 
K(x, &) = A, sin(€x) (11.186) 
The proper integral transform for this problem is therefore 
UE» =[ _ Sin(xe)u(x, y)dx (11.187) 


Therefore, the Fourier sine transform can be used to solve the following Laplace 
equation: 
2 2 
oe es 0<xy<0 (11.188) 
Ox” Oy 


11.3 ABEL INTEGRAL EQUATION 


The Abel integral equation is formulated as 


pdt) 1 : 
I, n a f(p) (11.189) 


where @ is the unknown to be determined. This integral is known as the Abel 
integral. The solution of it is well known and is given as 


g(s)=2 | piejdp (11.190) 
a dsd0 f.2_ p 

This equation appears naturally in crack problems (e.g., Chapter 5 of Mura, 1987). 
This integral was first considered by Abel in 1923 when he studied a mass sliding 
along a frictionless curved vertical plane under the influence of gravity such that 
the arrival time at the lowest point is independent of its starting point on this curve. 
The solution is called a tautochrone curve, as in Greek fauto has the same meaning 
as iso- or equal and chrone is time. The tautochrone curve is also the 
brachistochrone (shortest time in Greek) curve. That is also the curve that takes the 
shortest time to travel to the lowest point. It is this problem that leads to the latter 
development of calculus of variations by Euler and Lagrange. 
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In particular, Figure 11.5 shows the tautochrone problem of a sliding bead 
along a frictionless wire. 


s a 
a 
ee Ox, y) 


Figure 11.5 Tautochrone problem of a frictionless sliding mass along a curve with the same 


x 


arrival time independent of the starting point 


Using conservation of energy, we have 
1 ds 2 
mgv +0 = mgy +—m(— 11.191 
gv saa ( 7 ze ( ) 


Rearranging (11.191), we get 


& =—Pgo-3) (11.192) 


In obtaining (11.192), we have taken the negative sign in front of the square root in 
view of the fact that s shown in Figure 11.5 is a decreasing function of ¢. Integrating 
both sides, we have 


r=[-a=[ —ds =(" ds 

v f2g(v-y) °9 J2e(v-y) 
Clearly, the sliding distance s is an unknown function of the shape of the unknown 
curve, and we assume 


(11.193) 


ds = F(y)dy (11.194) 


where F(y) is an unknown function to be determined such that the arrival time T is 
a constant. Thus, we have 
r-_ (i F(y)dy 
2g °9 Mv-y) 
This is the Abel type of integral equation. Niels Henrik Abel was a Norwegian 
mathematician and was born in 1802 and passed away at the age of 26. Abel had 
shown that it is impossible to solve the quintic equation (5 order algebraic 
equation) in radical forms. He also discovered elliptic function, which was 
subsequently improved by Jacobi and is now called the Jacobi elliptic function (see 
Chapter 1 and Appendix D ). Independent of Galois, he also developed the group 
theory. He lived in poverty for his whole life, and passed away just two days short 
of receiving the good news from his friend Crelle (editor of Crelle’s journal) that he 
was appointed professor at University of Berlin. The renowned French 


(11.195) 
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mathematician Charles Hermite (the discoverer of Hermite polynomials) remarked 
on Abel’s six years of works as “Abel left mathematicians enough to keep them 
busy for five hundred years.” Another renowned French mathematician Adrien- 
Marie Legendre (the discoverer of Legendre polynomials) commented, “What a 
head the young Norwegian has!” 

We now go to back to the technical side of (11.189) and (11.190). To prove 
the validity of (11.190), we consider the following generalized Abel integral 
equation: 


{ : ee =f(9) (11.196) 
a [h(x)—h(o)] 
where 
O0<a<l, a<x<b (11.197) 
In addition, we consider the case that H(f) is strictly monotonically increasing and 
its first derivative is not zero for all t between a and b. Note that A(¢) = f in 
(11.189). 
To start the proof, we consider the following related integral: 
Coen ee A (11.198) 
@ [h(x)—h(u)] 
Substitution of (11.196) into (11.198) results in 
I(x) =[" h'(u) - i’ g(t)dt : 
a [h(x)—h(u)] “4 [h(w) —h()] 
x x h'(u)du 
[80 2 : 
t [h(x)- hwy“ [A@) - AO] 
The last line of (11.199) is the result of reversing the order of integration. In 


addition, the upper limit for variable ¢ has been set to x (instead of using w). 
Applying the following change of variables, we get 


(11.199) 


A(uy=€, py =h(x), p, =A(t) (11.200) 
i h'(u) fee (ie dé 
t [h(x)-A(w) 7 [hw) - AO) WO (p, -€)'*(E-p,)% 


i? dé (11.201) 
P (py -£)'*(E-p,)" 

This integral cannot be found in most handbooks on tables of integrations, 
including Gradshteyn and Ryzhik (1980). However, the integration given in 
(11.201) can be evaluated through an integral covered in Chapter | and it is also 
reported on p. 118 of Whittaker and Watson (1927), In particular, we recall the 
following result from Chapter 1: 


wo ~a-l 
n= a (11.202) 


0 l+x — sin(az) 
Let us apply a change of variables with the new variable being y as: 


x22 (11.203) 
ys 
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Then, the differential of x is 


i. (11.204) 
a a 
Noting that x = 0 gives y =z, and x = 00 gives y = é, we can convert (11.202) to 
= dy Ss (11.205) 


é(z-y)*(y-&* _ sin(az) 


This is precisely the integral we got in (11.201). Therefore, substitution of this 
result into (11.201) and (11.199) oe 


= _ f° #MOFwdu 11.206 
I(x) = [goat a I TT (11.206) 


Differentiating (11.206) with respect to x and using the Leibniz rule of 
differentiation on integral, we get 
oj) sin(za) d <i" eae (11.207) 
x dda [h(x)—h(u)} 

This is the solution of (11.196) and in addition we now make the following 
identifications: 

hu)=u?, g(p)=9(p), @=1/2, x=p, a=0 (11.208) 
We finally obtain 


“ al |’ pee (11.209) 
Integrating on both sides gives 
ee i (11.210) 


This completes the proof of (11.190). 


11.4 FREDHOLM INTEGRAL EQUATION 


The linear Fredholm integral equation of the first kind is defined as 


F0) = J K(s)9lss (1.211) 


where ¢ is the unknown in this equation. The linear Fredholm integral equation of 
the second kind is defined as 


f(x) = f(x) +Af K(x, spss (11.212) 


where fis a given continuous function, ¢ is the unknown in this equation, and / is a 
parameter of the equation. More generally, if the unknown ¢ does not appear in the 
integrand linearly, we have the nonlinear Fredholm integral equation of the second 
kind as 
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Hx) =f) +A] KL, 9s) (11.213) 


11.4.1 Solution of the Fredholm Integral Equation of the Second Kind 


The following Fredholm formula gives the solution of (11.212) 
x)= f(x)+ D(x, y3A 11.214 
He) =f) aol (Arey (11.214) 


where the D(A) and D(x, y; 2) are defined as 
= 7% (—A)” gi > > > m 
Daal < J -fx(er "ide. dé, (11.215) 


D(x, 3A) = 2 A" ~JI- J« C. é ose dé, (11.216) 


m=0 
The integrands in (11.215) and (11.216) are defined in the form of determinants 
K(x) K(x, ¥2) and K(X15¥m) 
oa dai K(X) oe K(X, ) (11.217) 
V2 V2 9-009 Vm oa one oe mas 
K (Xn) K(Xn>¥2) ene K(Xn>¥m) 
Ky) KQ4G) KOE) 
i ay KEY) KES) ~ KES) ay aig) 
Desi basen 


Kile, 2 y) Ké, ¢1) ele pd (a om) 
Although the formula given in (11.214) is exact, the calculation of the determinant, 
evaluation of the integration, and the sum of the infinite series are very tedious 
procedures. 

To derive Fredholm’s formula, we first divide the limit of the integral as n 
equal intervals: 

a, X,=at+0, X,=a+20,..., x, =atnd=b (11.219) 

Then, we use the fundamental principle of integration by expressing (11.213) in the 
form of an infinite sum: 


b,(x) = f(X) +A) K(x, x))G, (25 (11.220) 
j=l 
This solution form is supposed to be valid for all x within the interval from a to b. 
Thus, we can substitute each x; defined in (11.219) into (11.220) to obtain a system 
of n equations: 
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-ASK(x.))4y 8+ G0) =f(x;), 1=1,2,...,,n (11.221) 
j=l 


This is a system of n nonhomogeneous equations for n unknowns ¢,(x;), where i = 
1,2,...,.2. The solution of this system is unique if the determinant of coefficients in 
(11.221) is not zero. We denote the determinant of (11.221) as: 


1-ASK(x,,x,) —-ASK(x%,x,)  ... ASK (x), x,) 
D,(A)= -AbK(xXy,X;) 1-ASK(X,X_) .. = —-AOK(X>,X,) (11.222) 
-ASK(x,,X;)  —-AOK(x,,X,) .. 1-AdK(x,,x,,) 


Next, we observe that from the theory of determinant expansion, the determinant of 
the following matrix 


I+a, a2 es Qn 
a l+a we 
(12) > i a (11.223) 
An) an2 ae ae Ann 


can be expanded as (see p.88 of Aitken, 1944) 


Fin Gin oe Ane, 
1 ayy, Gan 1 a a, iH a, I; rae a, I, 
det|S,|=1+—) a,, ~)> +... > vias " 
NANG Den n! es a ee 
H Koh | 2% 29 Hatha 
Cae: Sag. ee Gea 
(11.224) 


We should mention a story of Alexander Aitken here. Alexander Aitken was a New 
Zealand—born mathematician, who possessed extraordinary memory. He was a PhD 
student of E.T. Whittaker at Edinburgh University, but was awarded a D.Sc. 
instead because of his outstanding thesis on data smoothing. He was known as one 
of the greatest mental calculators, and he could multiply two nine-digit numbers in 
his head in 30 seconds, and render fractions to 26 decimal places in under five 
seconds. Aitken made significant contributions to statistics and econometrics. 

The formal proof of (11.224) will not be discussed here, however, for cases 
of n = 2 and 3 the author is advised to check its validity (see Problems 11.8 and 
11.9 set at the end of the chapter). Thus, we can write the determinant in (11.222) 
as 


674 Theory of Differential Equations in Engineering and Mechanics 
H ae K(x, .x,) K(%,.%, 
D(A) =1-A YOK 0%, cS 
nA=I-AY ORG )+ TDP ea an) Kiya) 
4 Kd, 2 
K(x, .%,) K(%,.%,) -- K(%,.%,) 
(-A)" y gn [X Ginn) KO) KO) 


K(%, %,) KO, CS. KO, x,) 


(11.225) 

Now, we take the limit of 5 > 0 and n — o, and replace the summation by 
integration. This turns (11.225) into 

b a2 po p|KG.4) K(E.e) 

D(a)=1-al KE.6)dG + ae 

a a|K(Sy551) K(S2552) 

K(S061) K(S1.52) +  K(Si56n) 


cyt (iaeas K(on,62) + K(Ep56,) 


dédéy 


ae dé. é, +... 


Bese) Ko 952) pee Reon) 


(11.226) 
where D(A) is also known as the Fredholm determinant of the kernel K. Using 
Cramer’s rule, we can solve for (11.221) as: 


_ F (%) Dy Hy) + FX) Dy (Kyo) +--+ L(%, Dy Xp, Xn) 
9(X,,) 
D, (A) 


where D,,(x,, xv) is the cofactor of the determinant D,(A) for the term K(xy, xz). It is 
defined as: 


(11.227) 


K(x ,%) K(X o%, 


D, (Gy x)= AK (X,, xy) ey K(x x,) K(x x.) 


KGiam) KGist,) sv. Riga) 
(- Ao n K(x,, xy) K(x,» x,,) sae K(x,,, x,) 
+... —_—_—_——_— ’ 
ae eet 7 
‘ke, X,) KG, 0x *y, K(x, »X,) 
(11.228) 


Take the limiting form of 5 — 0 and n — © such that 8-'D(x,, xv) is redefined as: 
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b|K(Xy, x,) KXyS1)| 
dc, + 

a|K(g),x,) K(¢,41) 

Benen K Gr) KS) KGy$) (11.229) 

+o] KG) KEG) KEE) Ede - 


K(g,,.%,) K(2,6)) K(op.52) 
For D(A) # 0, Fredholm inferred that the solution of (11.212) is 


P(x) =fOree I, Dx, SA) f(S)ds5 (11.230) 


This is Fredholm’s celebrated formula, and it is a breakthrough in the analysis of 
the Fredholm type of integral equation. This is an elegant formula but the actual 
calculation of the determinant and integration may be tedious. Although 
Fredholm’s approach is kind of intuitive, its formal justification was considered by 
Hilbert in 1904 and subsequently led to the development of functional analysis, 
which will be discussed further in a later section. 


D( )=AK(x,,%,)-A? 


XX, ih 


Example 11.11 Solve the following Fredholm integral of the second kind: 
1 
Ox) = x42) soya (11.231) 


Solution: For this special case, we have 
1 
D(A)=1-a] Sdg =1->A, D(x, y34) =Axy (11.232) 
0 


Note for this case that only the first term in (11.229) is nonzero. The solution is 
clearly 


0(x) = rel Axy*dy = = (11.233) 


Example 11.12 Solve the following Fredholm integral of the second kind. 
I 
Hx) = x42) Cort y)OvMy (11.234) 


Solution: For this special case, the determinants are: 


aa ee 
D(A) =1-=A-—a 11.235 
(A) a a5 ( ) 


1 it . te a 
D(x, YA) = AG V+ AG" 3 ee (11.236) 


The solution is clearly 
Ax+18+424x 


TA 


P(x) =x +17 ( ) (11.237) 
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11.4.2 Solution of the Fredholm Integral Equation of the First Kind 


Note from (11.214) that we must have D(A) # 0. For the case D(A) = 0, suppose 
that its roots are called eigenvalues and denoted as: 


N= Ay AgysAns NYO (11.238) 
It corresponds to the following eigenvalue problem: 
b 
v(x) =A, | Kew (dé (11.239) 


where yj (j = 1,2,...,2) is the eigenfunction. This so-called eigenvalue problem 
defined in (11.239) can be considered as a generalization of the traditional 
eigenvalue problem for a matrix. This eigenfunction has been normalized as: 


b 
[ w@yag=1 (11.240) 
It can be shown that these eigenfunctions are orthogonal in the sense that 
b 
[ view @odg = 5, (11.241) 


Note that (11.241) is actually the homogeneous form of the original Fredholm 
equation given in (11.212). We will see in a later section that (11.241) is actually 
the inner product defined for the Hilbert space. 

With this eigenfunction, we are now ready to establish the solution of the 
following Fredholm integral equation of the first kind: 


FO) = J K(xs)9lsds (11.242) 


The solution can be established using the so-called Hilbert-Schmidt theorem. In 
essence, this theorem asserts that both f(x) and (x) can be expanded in a Fourier 
series expansion in the eigenfunction yj: 

foe) 


S)= Yi anv (2) (11.243) 


n=l 


B(x) = D2, Yn) (11.244) 


n=l 


Multiplying both sides of (11.243) by y;,(x) and integrating from a to b gives 


b = b 
[ seov, @de= Ya, [ voy, ode =a, (11.245) 
M n=l a 
The last part of (11.245) is a consequence of the orthogonal property of the 
eigenfunctions given in (11.241). Thus, we have 


b 
= } fv, (de (11.246) 
Substitution of (11.244) and (11.243) into (11.242) gives 
foo} co. b 
Y anVn(X) = >, | K(x,s)y,(s)ds (11.247) 


n=l n=l 


In view of (11.239), we have 
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b b 
adn | K(x. wy(OdE =, | -K(xs)y, (sds (11.248) 
This gives 
b, =a,4, (11.249) 
Therefore, the solution of the Fredholm integral equation of the first kind is 
B(x) = oa, Ann (2) (11.250) 


n=l 


This solution is analogous to the eigenfunction expansion discussed in Chapter 10. 


11.5 FREDHOLM ALTERNATIVE THEOREM 


There is a major by-product of Fredholm’s formula discussed in Section 11.4.1. 
We recall that the solution is valid only if D(A) # 0. For the case D(A) = 0, it is an 
eigenvalue problem of the following problem: 


u(x) = Af Kes Eu(EdE (11.251) 


This is actually the homogeneous equation of (11.212). If the kernel function is a 
square integrable function: 


b eb 2 
| | K(x, 2) dédx <o, (11.252) 


the following theorem can be established. 

In particular, for D(A) # 0 there will be no solution for the eigenvalue 
(11.251). For D(A) = 0, we have an infinite number of roots called eigenvalues: 

A=A Ay Ay, NO (11.253) 

This A is also called the characteristics of kernel K(x,é) and the set of roots of D(A) 
= 0 is called the spectrum of kernel K(x,é). The solutions of (11.251) 
corresponding to the eigenvalues A; are called eigenfunctions or characteristic 
functions: 


U=Uy,Uy,..5U,, NO (11.254) 
The general solution of (11.251) is 
U(x) = Cy (X) + CyUy(X) +...+€,U,(x), n> (11.255) 
Fredholm defined an associated integral equation as: 
b 
u(x) =A K(é,x)u(Eydé (11.256) 


Note that this is also the adjoint problem of the original integral (11.251). 
Corresponding to the eigenvalues given in (11.230), the characteristic functions of 
the adjoint equation in (11.256) are 

W(x), Wo(X), Wy, (x) (11.257) 


Then, the condition of having the solution of the nonhomogeneous form of 
Fredholm integral equation given in (11.242) is: 


} Whe ee=0 (11.258) 
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This is called the Fredholm alternative theorem, and if it is satisfied, the Fredholm 
equation in (11.242) is solvable. This is related to the existence of a solution of a 
nonhomogeneous equation, and thus it is a very important and powerful theorem in 
the theory of integral equations. In fact, the Fredholm alternative theorem can also 
be applied to investigate the existence of nonhomogeneous ODEs (e.g., Boyce and 
DiPrima, 2010). 


11.6 ADJOINT INTEGRAL EQUATION 


Let us formally define the adjoint integral equation. Let us consider an integral 
equation defined symbolically as: 

g(x) = If (x)] (11.259) 
where / is an integral operator. For example, the linear integral operator can be 
defined: 


b 
g=| KA /OdE (11.260) 
The corresponding adjoint problem is defined as: 
_ b 
Tifenl= | Kens Gag (11.261) 


The adjoint problem has to be studied by employing the following definition of the 
inner product defined by Hilbert as: 


b 
Litl=| AOAOA (11.262) 


Note that the concept of the inner product is an important ingredient of the 
functional space concept. Of course, the inner product defined by Hilbert is for the 
so-called Hilbert space (or Hilbert functional space). In fact, the functional analysis 
or sometimes referred to as abstract space analysis in the last hundred years was 
initiated by Hilbert’s studies related to the Fredholm integral equation. More 
discussions on this are presented in a later section. 

Then, the adjoint integral operator of an integral operator / is defined by 


HeAm A bree) (11.263) 


where the superimposed bar indicates the adjoint integral operator. For a self- 
adjoint integral equation, we require: 
U(A)s 21 =f 12)1 (11.264) 


with zero boundary terms. This leads to 


b pb b pb 
[ | K@aiO@hergde=[ [KEK Odgde (11.265) 


Consequently, the conditions of the self-adjoint integral equation leads to 

K(x,é) = K(é,x) (11.266) 
Similar to the self-adjoint differential equation, the corresponding eigenvalue of the 
self-adjoint integral equation has real eigenvalue 2; and eigenfunctions yj. In 
addition, the eigenfunctions are orthogonal. 
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11.7 VOLTERRA INTEGRAL EQUATION 
11.7.1 Volterra Integral Equations of the First and Second Kinds 
In this section, we will consider a special case of the Fredholm equation. If the 


upper limit is not the constant 5, instead the upper limit becomes x. We have the 
linear Volterra integral equation of the first kind 


f(x) = I K (x,s)y(s)ds (11.267) 
Similarly, the linear Volterra integral equation of the second kind is defined as 
y(x) = f(x) +4 } oe (x, 8) y(s)ds (11.268) 


More generally, the nonlinear Volterra integral equation of the second kind can be 
defined as 


va) = fa+Af KLxs, voyds (11.269) 


We observe that the only difference between Fredholm and Volterra integral 
equations is the upper limit. In particular, if the kernel function in (11.245) or 
(11.212) is defined as 
K(x,s)=0, s>x 
#0, x>S, 
when (11.270) is substituted into (11.211) and (11.212), (11.267) and (11.268) are 
recovered as a special case. It is illustrated in Figure 11.5. 

Despite the similarity between the Fredholm integral equation and the 
Volterra integral equation, there is a major difference between them. In particular, 
if fand K in (11.267) and (11.268) are continuous, there is a unique solution for the 
linear nonhomogeneous Volterra integral equation of the second kind. However, 
for a linear nonhomogeneous Fredholm integral equation of the second kind to 
have a unique solution, the Fredholm alternative theorem must be satisfied. 


(11.270) 


11.7.2 Liouville-Neumann or Picard Successive Approximation 


Historically, Volterra’s solution of the linear Volterra-type integral equation of the 
second kind was obtained in 1896 and 1897, and, thus, preceded Fredholm’s 
solution. Let us consider the solution scheme by Volterra, which was obtained by 
the idea of successive iteration (p. 1057 of Kline, 1972). In particular, Volterra 
assumed that the solution can be expressed as an infinite sum as: 


Wx)=f£@)+ >) f,) (11.271) 
p=l 
where every term in the infinite series is an integral defined as: 
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A=] Ke» 


j= [x (x, WA C)dy (i277) 


ful) = |KO) Spay 


K(x, s)u(s) 


[| Fredholm 
Fal Volterra 


Vi 


a x b Ss 


Figure 11.6 Domain of integrand of Fredholm and Volterra type integrals 


The first integral depends on the given function f, whereas the second integral 
depends on the result of the first integral, and so on. Stakgold (1967) called this 
infinite series given in (11.271) the Neumann series, which was used by Neumann 
for many years before it was published in 1877. The first publication of using this 
kind of series was by Liouville in 1832 and 1837. Therefore, some researchers 
called it the Liouville-Neumann series. The same idea was published again in 1890 
by Picard and he established the formulation in a general and widely applicable 
form (Tricomi, 1957). Therefore, it was also called the Picard process of 
successive approximation. Apparently, the most popular adopted name is Picard’s 
method of successive approximation. 

Now, substitution of (11.272) into (11.271) and then the result into the right 
hand side of (11.268) gives 


y(x) = f(x) + [ Re s) f (sds (11.273) 


where R(x,s) is called the resolvent kernel (versus the kernel K(x,s)) and is defined 
as 


b b pb 
R(x, 8) = K(x,s) +( K(x,r)K(r,s)dr +f I K(x,r)K(r, w)K (w, s)drdw +... 


(11.274) 
This result can also be recast into a slightly different form: 
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w(x) = f+ | “SK jsf (9)ds (11.275) 
j=l 
where 
K, (xs) =K(x,s) (11.276) 
K (45) = [ Ke V)K ,4(v,8)dv (11.277) 


In order for this solution to be valid, we must require that the kernel function K(x,s) 
be bounded. 

If the kernel function is of convolution form (i.e., K(x,s) = K(x-s)), the linear 
Volterra integral equation can be solved by using the Laplace transform. This will 
be discussed in Section 11.7.5 and illustrated in Example 11.16. 


Example 11.13 Solve the following integral equation by Picard’s iterative method: 


yt) =1- f, (t—x)y(s)ds, y(0)=1 (11.278) 


Solution: We can start with the boundary condition as the first approximation 
¥ = y(0) =1 (11.279) 
Then, the second approximation is 
t 1 
y= [« syds=1-5° (11.280) 
0 


The third approximation is 


yy =1-[ (@=s)n(s)ds=1 KG s)(1 58?) 


(11.281) 
= 1-f 1a-2s?)ds + [e-S sds 
0 2 0 2 
Carrying out the integration, we find 
1, 14 
=l1-—t*+—1t 11.282 
V2 5) 34 ( ) 
If we continue the process, we find 
1 1 
y, =1-— 0? +—F +...=cost (11.283) 


This is also the exact solution. 


11.7.3 Solution of Volterra Integral Equation of the First Kind 


For the solution of the linear Volterra integral equation of the first kind, we find 
that we can actually convert it to the Volterra integral of the second kind such that 
the above iterative solution scheme proposed by Volterra applies. In particular, 
rewrite the linear Volterra integral equation of the first kind as 
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g(x)= [- 1G9)y(o)ds (11.284) 


Differentiation of both sides with respect to x and application of the Leibniz rule of 
differentiation on integral gives 


g'(x) = H(x,x)y(x) + y(s)ds (11.285) 
If the kernel H(x,x) is not zero, we can rearrange ie equation as 


g(x)  (*_-l_ 0,5) 
x)= S)ds 11.286 
¥@) A(x,x) Ja H(x,x) Ox vet ( ) 
This is, in fact, the Volterra integral equation of the second kind if we make the 
following identifications: 


ik a Ss) 


-l1 OH 
f@Q=27. Kas= ga) (11.287) 
H(x,x)’ A(x,x) ox 
Thus, we know how to solve it. 
Example 11.14 Solve the following Fredholm integral equation 
ax)-Al eA y)dy = f() (11.288) 
Solution: Using (11.275), for this special case we have the first kernel Ki(x, &) as: 
K(x, y) =e"? =ee” (11.289) 
The second kernel given in (11.277) is 
K(,y)= [ete ae = et [de =e*(x—y) (11.290) 
» y 


Substitution of (11.290) into (11.277) yields 
K3(x%,y)= [ete (z-y)dz = el" —y)dz=e*? oo (11.291) 
Substitution of (II. 291) into (11.277) sh gives 
Ky) =— |e ete? (z—y)? de=e x >t Ce oe cy (11.292) 


Obviously, we can generalize the pres from (11.289) to (1 _— to 


Kyu y) =e — = ue (n=0,1,2,...) (11.293) 
Substitution of (11.293) into (11.274) gives 
RW YA =U Ky yay) =e" AAW Lotito) (11.294) 


! 
n=0 n=0 Me 
Using this result, we finally get 


P(x) = f(x) +A | eee y)dy (11.295) 
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Two particular forms of kernels are of particular interest because of their physical 
significance and because of the easiness of solving these particular cases. They are 
the separable kernel (or called Pincherle-Goursat kernel) and the convolution type 
kernel. 


11.7.4 Volterra Integral Equation with Separable Kernel 


The kernel of the Volterra integral equation is called separable if the following 
equation is satisfied: 


K(t,s) = p(q(s) (11.296) 
Substitution of (11.296) into the Volterra integral equation of the second kind 
given in (11.268) leads to 


v(a) = f(2)-+A]” plaa(s)y(s)ds (11.297) 


This can be rewritten as 


yx) = f(x) + Ap(x)Y(x) (11.298) 
where 
Y(x)= | “q(s) y(s)ds (11.299) 
Now, we can differentiate (11.299) with respect to x to give 
aw = q(x)yV(x) (11.300) 
dx 


Substitution of (11.298) into (11.300) leads to the following first order ODE for 
Y(x): 
dY(x 
= g(x) f(a) + 2g) POY) (11301) 
This is the general linear form of the first order ODE and can be solved readily. We 
will illustrate this solution method in the following example. 


Example 11.15 Solve the following Volterra integral equation of the second kind 
with separable kernel: 


t 
y(t) = Vt - } 2Jis y(s)ds (11.302) 
0 
Solution: Let us define: 
t 
Y()= } Js y(s)ds (11.303) 
0 
Substitution of (11.303) into (11.302) gives 
y(t) = Vt -2VtY) (11.304) 
Differentiation of (11.304) with respect to ¢ gives 
Y(t) =Vty =t-21Y(1) (11.305) 


The solution of the corresponding homogeneous ODE of (11.305) is 
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¥,@=Cce? (11.306) 
Using the procedure of variation of parameters, we seek 
YO=C(xe" (11.307) 


Substitution of (11.307) into (11.305) gives an equation for C and the solution of 
(11.305) is 


1 2 _p 
are a )+Ce (11.308) 
From the definition of (11.302), we have 
Y(0)=0 (11.309) 
Using this boundary condition for Y, we obtain 
yiy=50-e") (11.310) 


Finally, back substitution of this Y into (11.303) gives the solution of (11.302): 
y(t) =Vvie™ (11.311) 


More generally, Tricomi (1957) called the following separable kernel as the 
Pincherle-Goursat kernel 


K(x) = XY.) (11.312) 
k=l 
It was shown that for this case the solution of the Volterra integral equation can be 
converted to solving a system of linear algebraic equations. The details are found in 
Tricomi (1957). 


11.7.5 Volterra Integral Equation of Convolution Type 


The second special form of the Volterra integral equation of the second kind that 
can be solved easily is the convolution type kernel. That is, 

K(t,s)=K(t-s) (11.313) 
The argument in the kernel only appears as the difference between ¢ and s. For such 
cases, the Laplace transform can be used to solve the integral equation. In 
particular, we have 


10h = 76044 } ” K(x—s)u(s)ds (11.314) 
If the Laplace transforms of two functions f| and fo exist, we have 
F(s)=L{AD}, KHs)=Lth Od} (11.315) 
The convolution product is defined as: 
(hd) =(* $Me) = [hod float (11.316) 


By the convolution theorem for the Laplace transform, we have 
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LK * Ado} = LA] AC -Of Od} = F)F(s)_—— 11.317) 


It is clear that the integral in (11.314) is of the convolution type, thus (11.317) can 
be applied to (11.314). We will illustrate this method by the following example. 


Example 11.16 Solve the following integral equation of convolution type by using 
the Laplace transform: 


(0) =1+2f cos(t—-w)ruddu (11.318) 


Solution: Applying the Laplace transform and convolution theorem, we get 
t 
Lhy} = Li} + 2L4f cos(t —u) y(u)du} = L{l} + 2L{cost}L{y(t)} (11.319) 
0 


Using the table of Laplace transforms, we obtain 


¥(s)=14% a) (11.320) 
Ss so +l 
Solving for Y(s), we obtain 
2 
¥w=" us (11.321) 
s(s—L 
Application of partial fraction gives 
1 2 
¥(s) =—+—_, (11.322) 
s (s-) 
Taking the inverse Laplace transform, we obtain 
1 2 
th=L'Y(s)p=L 34+"! 
y(t) {Y(s)} th ‘Gop (11.323) 


=1+42te' 


Example 11.17 Solve the following integral equation of convolution type using the 
Laplace transform: 


‘ yW) 
du = Jt (11.324 
J 0 Vt—u 
Solution: Applying the Laplace transform and convolution theorem, we get 
uf 22 day = Lh} (11.325) 
0 Vt-u 
Noting that 
TG/2) Vx 1, vax 
LiW=—Ga 5am Mpls a (11.326) 


substitution of (11.326) into (11.325) gives 
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1 Va Va 


MyOH =Y(s) 2 = 55 (11.327) 
Thus, we have 
Vigo (11.328) 
2s 
Note that the following inversion formulas for the Laplace transform 
1 1 
C'N-t=—=1, pd =L'{YV%(s 11.329 
a Ti y(t) Y(s)} ( ) 
Taking the inverse Laplace transform of (11.328), we obtain 


1 
yO= a (11.330) 


11.8 FUNCTIONAL ANALYSIS AND VECTOR SPACE 


In this section, we continue to discuss the work of Volterra, Fredholm, and Hilbert 
and its subsequent development into functional analysis. The term functional 
analysis was coined by Levy. We recall from the previous section that in our 
discussion on the Fredholm alternative theorem we needed to introduce the concept 
of the inner product (in terms of integration), adjoint integral equation, the 
orthogonality between the nonhomogeneous terms and the eigenfunctions of the 
adjoint problem. In fact, the inner product and orthogonal properties are the main 
assumptions in the analysis of Hilbert, and functions that satisfy such conditions 
(plus some more standard operational requirements of the inner product, definition 
of norm, and convergence requirements) are said to form the Hilbert space. Such 
functional analysis mainly deals with the existence of the solution (similar to the 
role of the Fredholm alternative theorem) and is also referred to as abstract space 
analysis. Some of the major contributors to such development include Volterra, 
Hilbert, Riesz, Schmidt, Gelfand, Wiener, Fischer, Frechet, and Banach. There are 
many functional spaces being proposed, and the more notable ones include Hilbert 
space, Banach space, Schwartz space, Sobolev space, and Holder space. For 
example, the so-called Banach space does not have the orthogonal requirement 
between functions and is less strict compared with Hilbert space. In mechanics, it 
was discovered that Hilbert spaces are applicable to the analysis of quantum 
mechanics. In this context, the eigenvalues are the quantum energy levels. The 
work of Hilbert is also related to the concept of the discrete and continuous 
spectrum of the eigenvalues. It turns out that the Hilbert integral equation is a 
singular type of integral equation, and for such case the eigenvalue becomes 
continuous (at least within some interval). 


11.9 INTEGRAL EQUATION VERSUS DIFFERENTIAL EQUATION 


In 1904 and 1905, Hilbert showed that the eigenvalues and eigenfunctions of the 
following Sturm-Liouville problem 
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Fp) y+ goon + du =0 (11.331) 
dx dx 
subject to the boundary conditions 
u(a)=0, u(b)=0 (11.332) 
are also the eigenvalues and eigenfunctions of the following integral equation: 
Hx)-Al G(x. EO = 0 (11.333) 
The kernel function G(x, €) is the Green’s function of the following ODE: 
£ tp) ae +q(x)u =0 (11.334) 
dx dx 


The derivative of this Green’s function OG(x, 6)/& has a jump of —1/p(é). In other 
words, an integral equation is a way of solving ordinary or partial differential 
equations. 
To further investigate the relation between an ODE and an integral equation, 
we consider the following first order initial value problem: 
B= altult) +) (11.335) 
subject to initial condition 
u(0) = ug (11.336) 
We note that (11.335) can be obtained by differentiating the following Volterra 
integral equation of the second kind: 


u(t) = g(t) + [, Kesw(syas (11.337) 
where 
Ome [boas Kio ras (11.338) 


For this particular case, the kernel K is independent of ¢. 
Secondly, we consider the following initial value problem model by second 
order ODE: 
2 


Ke = atu +0 (11.339) 
t 
subject to initial conditions: 

u(0)=up, u'(0)=Vo (11.340) 


This equation can be obtained by differentiating the following Volterra integral of 
the second kind twice: 


u(t) = g(t) + [, Kesw(syds (11.341) 
where 
Z(t) =Up + Volt + { (t—s)b(s)ds, K(t,s)=(t—s)a(s) (11.342) 


Therefore, the second order ODE with non-constant coefficients given in (11.339) 
can be solved by considering the Volterra integral equation of the second kind in 
(11.341). 
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However, if we start by differentiating a general form of Volterra integral 

equation of the second kind given in (11.341), we have 
Gu) _ BO. Ke du +] = a 5) 
dt dt 

This is not an ODE, but instead it is a Volterra re equation (which 
will be discussed in more detail in a later section). Therefore, an ODE can be 
converted to a Volterra integral equation, whereas a Volterra integral equation of 
the second kind with arbitrary kernel K may not correspond to any ODE. 

Consider the following n-th order ODE with initial conditions 


n nl 


ao uls)ds (11.343) 


 eaeae atta, (xu = F(x) (11.344) 
te dx" 
u(0) = Cp, u'(0) = c,..., u” (0) =c,_, (11.345) 
This is equivalent to the following linear Volterra integral equations of second order: 
0) = $0) J Kydd (11.346) 
where 
n-l 
I(x) = F(x) — ¢,_14(X) — (C1 + C9 dy (x) —...-[e,-4 a +...4+¢)X + C9 Ja, (x) 
(11.347) 
m-1 
K(x,y) = Yan) or Z yi (11.348) 
m=1 . 
Consider the following boundary value — modelled by an ODE: 
d° 
es + cy + D(x)w= F(x) (11.349) 
Ms 
w(ia)=7, w(b)=6 (11.350) 


This is equivalent to the following Fredholm integral equation of the second kind 
(Zwillinger, 1997): 


w(x) = nea [Hes S)w($)dg (11.351) 


nx)=y+[ - 


a ae (a—6\NC(G)-DO), x >> (11.352) 


A(x,6)= 


“IC()- (b-o\C(S)-DS)), x<o 


Therefore, the ee pale problem can be expressed as a Fredholm integral 
equation of the second kind. 

For multi-variable function u, we can also have the following combined or 
mixed Volterra-Fredholm integral equation of the second kind (Zwillinger, 1997): 
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a =eewe i. [ Kesx £)u(s,£)déds (11.353) 


In contrast to the Fredholm integral equation, this mixed Volterra-Fredholm 
integral equation has a unique solution, and it can be expressed in a resolvent 
kernel as 


u(t,x) = e(t,x)+ ik [Re s,x,)2(s, \dEds (11.354) 


Example 11.18 Find an equivalent differential equation for the following integral 
equation: 


y(t) = Seost+ [ (aa (11.355) 


Solution: Differentiation of (11.355) with respect to ¢ once and twice gives 


t 
y(t) =—Ssint+ I y(u)du (11.356) 
0 
y"(t) =—5cost+ y(t) (11.357) 
Rearrangement of (11.357) gives 
y"(t)— y(t) = —Scost (11.358) 


We can substitute ¢ = 0 into (11.355) and (11.356), and we find the following initial 
conditions for the ODE found in (11.340): 
y(0)=5, y'(0)=0 (11.359) 


11.10 INTEGRO-DIFFERENTIAL EQUATION 


Many physical problems need to be modelled by integro-differential equation, such 
as airfoil studies, temperature variation in melted glass under heat conduction and 
heat radiation, Boltzman’s model of the distribution of particles of an ideal gas in 
an enclosure, probability of a customer’s waiting time in a queue, probability of 
brightness of a star being reduced by clouds of interstellar dust, and the theory of 
atomic scattering. Literally, it means that the unknown of the problem appears in 
derivative form as well as inside an integral in the same equation. For the case of 
one variable, a rather general form of integro-differential equation is written 
symbolically as 


y (x) = f(x, v(x), vy’. yr =a [xb 5, ¥(5)y.05 ¥"(s)]ds) (11.360) 


For n = m, (11.360) has a unique solution. The boundary conditions for (11.360) 
are 

yO(a)=y,, k=0,1,....n-1 (11.361) 
A number of more notable integro-differential equations are reported here. Integro- 
differential equations may also appear in a pair of coupled systems. For example, 
the growth of two conflicting populations can be modelled by the following 
Volterra nonlinear integro-differential equation 
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dx t 
oe ax—bxy— x] K,(t—s)y(s)ds (11.362) 
dt 0 

a = -ax+ ry ty] K; (t—s)x(s)ds (11.363) 

dt 0 


This is a prey-predator model proposed by Volterra, taking into account heredity 
factors. The integro-differential equation of Abel is given as 


° (x-y)* 
The Darboux-Picard integro-differential equation is 


Z(X, V) = O(x)+T(y)— Zo + [0 fenvy2.2452y)dudy (11.365) 


Relating to airfoil analysis, Prandtl’s circulation equation is given by 

lr 1 ld 
o(n+—[ (——)Zaé= fe) (11.366) 
mst E-x dé 
where the integral is taken as a Cauchy principal value. For radioactive transfer, we 
have the following partial integro-differential equation: 


On(r, l-w’ dg(r, 1) 
pO) HOD + o(r,4)-2[ olr.edé=0 (11.367) 
or r Ou 24-1 

For modeling multi-electron atoms, the following Thomas-Fermi equation has been 
proposed: 

at ogg | “B22 dt (11.368) 
dx 0 
We now quote the two most popular integro-differential equations, namely the 
Volterra type and Fredholm type of integro-differential equations. Mathematically, 
they are written as 


u(x) = f(x) Al Kut (11.369) 


b 
u(x)= f(x)t4 | K(x,tu(t)dt (11.370) 


Sometimes, the Volterra and Fredholm integral equations may appear 
simultaneously in a single equation as the linear Volterra-Fredholm integro- 
differential equation: 


ul (x) = f(x) +A, } "K,(x,t)u(t)dt +A, [& (x,)u(dt (11.371) 


They may also appear as a combined form in a mixed Volterra-Fredholm integro- 
differential equation as: 


u(x) = f(x+a if } ” K(r,t)u(t)dtdr (11.372) 


There are various techniques that we can use to solve for their solutions, such as the 
variational iterative method, Laplace transform technique, wavelet method, Taylor 
series method, Adomian decomposition method, modified Adomian decomposition 
method, and direct computation method. For details, we refer readers to the 
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excellent coverage by Wazwaz (2011) and more recent publications on this topic. 
Since the linear integro-differential equation is a special case of the nonlinear 
integro-differential equations, in the next section we will demonstrate two 
techniques via nonlinear Volterra and Fredholm integro-differential equations. 
More specifically, they are the Adomian decomposition technique for the nonlinear 
Volterra integro-differential equation and the direct computation technique for the 
nonlinear Fredholm integro-differential equation. 


11.10.1 Nonlinear Volterra Integro-Differential Equation 


If the unknown function does not appear linearly inside the integral of (11.369), it 
becomes a nonlinear Volterra integro-differential equation and is expressed as: 


uM) =f +al "K(x,t)F(u(d))dt (11.373) 


Although there are various techniques that can give approximations to (11.373), we 
will only cover two of them, namely the Adomian decomposition method and the 
modified Adomian decomposition method. This method was discovered by George 
Adomian who was a professor at the University of Georgia and had published a 
number of books on this decomposition method. 


11.10.1.1 Adomian Decomposition Method 


We now introduce a method called the Adomian decomposition method. We can, 
in general, integrate both sides n times with respect to x to get 


u=L'{f(x)}+AaL { } RE t)F (u(t))dt} (11.374) 
If n = 1 and a = 0 (this choice is only for e sake of simplicity), we have 
u= g(x) + ALY K(x) Fw())de} (11.375) 
where 
g(x) = I, f (bat (11.376) 
Next, we assume that the solution wu can be decomposed in an infinite series 
u(x) =S'4,(9) (11.377) 
n=0 


In addition, the nonlinear integrand can be decomposed into an infinite series of 
Adomian polynomials as 


G(u) = K(x,)F(u() = D1 4, (11.378) 
k=0 
where A; are the Adomian polynomials and are defined as 
A, =—)— 1G) Nu; 11.379 
ke | pa a ee 
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Substitution of (11.378) and (11.379) into (11.375) gives 


i.e) 


Doin) = g(x) + AL I, YAc(s)ds} (11.380) 
n=l k=0 

Comparing each term from both sides starting with the zero order term with the 
first term on the right hand side: 


Ug (x) = g(x) (11.381) 
aC ae 1M, coat, n=0 (11.382) 


The explicit forms of these Adomian polynomials are 
Ay = Guy) 
A, =u,G'(u) 


A, =U,G"(ug) + SHI") (11.383) 
' ” 1 m 
Ay = usG' (ug) + 14yG"(uy) + =u G"(Up) 


1 1 1 iy 
Ag =uyG'(ug) + OF Us + UyU3)G" (Uy) + 3 i t2G"(wo) + que )(up) 


To see the physical meaning of these Adomian polynomials, we can substitute 
(11.383) into (11.378) to give 
G(u) = Ay + A, + A, +g +... 


= Guy) + (uy, + Uy +z +...)G'(up) +... 


1 

+5 ur + Digits + Quy bus +...)G" (ug) +... (11.384) 
eee: 2 ” 

re + 3uy Uy +3uj Uz + OujUzUz +...)G"(Ug) +... 


= G(uy) + (u—ug)G'(up) + Stu =4,) GG) 


We see that Adomian polynomials are a Taylor series expansion about a function 
uo instead of the traditional Taylor series expansion about a point. 
Before we consider some examples, it is instructive to consider some 
nonlinear functional forms of G(u): 
Case 1: G(u) = wv? 
2 
Ay = Glu) = 13 
A, = u,G"(Ug) = 2ut9uy 


1 
A, =u,G'(ug) + sui Guo) =2ugu, tu; (11.385) 


1 
A, =U,G' (uy) +uju,G" (ug) + i") = 2ugu, + 2uyu, 
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Case 2: G(u) =u3 
_,3 
Ay = Up 
2 
A, _ 3ug uy; 
A, = 3upuy + 3ugutp 
4 = Buju, + 3uguy 
A; = 3ugu3 + OUglUy + u; 
Case 3: G(u) =u" 
_4 
Ay = Up 
A, = 4ugu 
1 = Ugly 
2 
A, = 4upu, + 6ugu; 
Az = 4ujuy + 4u; Uy + 12upujuy 
Case 4: G(u) = sin u 


Ap = SiN Uy 


A; =U, COSUy 


1). 
A, =Uy COSUy ——Uj SIN Uy 
2! 
; 1 3 
Ax = Uz COSUy — Uy SIN Uy — ail COSUy 


Case 5: G(u) = cos u 
Ay = COS Ug 


A, =—Uy SIN Ug 


‘ 1 , 
A, = Uy SINUg — —Uy COSUy 
2! 
. 13. 
Ax = Uz SINUg —UjUy COSUy + 34 SiN Uy 


Case 6: G(u) = e” 
Ay =e 
A, =ue" 


1 2) u 
A, ge ee ye 


1 3) uu 
A, = (Uz +UyUy aor’ ye"? 


(11.386) 


(11.387) 


(11.388) 


(11.389) 


(11.390) 


Example 11.19 Solve the following nonlinear integro-differential equation by 


Adomian decomposition method 
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aan u>(t)dt, u(0)=0 (11.391) 


Solution: Integration on both sides and in view of (11.378) gives 


eee bs “( 54, (Oat (11.392) 
Substitution of (11.377) into (11.392) gives " 
7 (x) =-x40 rn a (1)dt} (11.393) 
Thus, we have = a 
U(x) = g(x) =—x (11.394) 


Application of (11.385) and (11.394) for the first term 
x AY x Ss x AY 4 
=| [ A@atds =| [ usodids =[ | Patds=— (11.395 
a) J, J, 4 / 0 se 0 J0 7 12 ( ) 


Similarly, for higher order terms we have 


x Ss Bs S 7 

un(x)= | i Ay(dids =—— [ I, Sdids = (11.396) 
3 (x A — 11.397 
=f [40 a CL-397) 

ae x/3 
u(={ [4 Dats =-—— [|e 7 11.398 
4@) ay 80 1008 ~ 157248 

Thus, we have an approximation as 
4 7 10 13 

eee eG ae eee ae (11.399) 


12 252 6048 157248 


Clearly, the Adomian decomposition technique can also be used to solve nonlinear 
Volterra integral equations. This is illustrated in the next example. 


Example 11.20 Solve the following nonlinear Volterra integral equation by 
Adomian decomposition method 


u=x+l" w2(t)dt, u(0)=0 (11.400) 


Solution: Using the Adomian decomposition method to (11.400), we obtain 


34, (0) x+[e 514, (oa (11.401) 
n=l n=0 
Comparison term by term on both sides gives 


Ug (x) =x (11.402) 
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Up 4(X) = } ‘ A,(t)dt, k20 (11.403) 
0 
Direct integration of (11.403) term by term gives 
u,(x) = ( u2(t)dt = 7" (11.404) 
0 
t (x) = [2 pee (11.405) 
2 0 01 15 : 
7 (x)= [120 ee iss (11.406) 
3 0 OF 2, 1 545 - 


Therefore, the Adomian decomposition method gives the following approximation: 
3 
x 2 5, 17 7 


u=x4 +—x" 4 x +... (11.407) 
3 15 315 
In fact, the exact solution for this case can be shown to be 
3 
(eee Oe (11.408) 
3 15 315 


Thus, the Adomian composition does converge to the exact solution. 


11.10.1.2 Modified Adomian Decomposition Method 


For certain nonlinear integro-differential equations with the nonhomogeneous term 
consisting of a series of terms, a modified Adomian decomposition method was 
proposed by Wazwaz (2011). Under certain restrictions, we may arrive at an exact 
solution due to a so-called noise term phenomenon. 

In particular, we will illustrate this using the following nonlinear integral 
equation: 


u(x) = fi(x) + f(x) + Al K(x, t)F (u(t))dt (11.409) 
Using the Adomian decomposition method, we have 
n= ACO+h (+f KANDA, (1)dt (11.410) 
n=0 n=0 
Thus, we have 
ug(x) = f(x) (11.411) 
u(x) = fy(x) +A [, Kesn4y(ndt (11.412) 
Uy (x) = Al K(x, (t)dt, k>1 (11.413) 


We will illustrate this in the next example. 


Example 11.21 Solve the following nonlinear integral equation by modified 
Adomian decomposition method 
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ue ai43x 3 oat [ude (11.414) 
0 


Solution: Using the modified Adomian decomposition method by Wazwaz (2011), 
we can split the nonhomogeneous term into 


f(a) =143x-57, fils) = =x! (11.415) 
Thus, we have 
ug(x) = f,(x) =143x a (11.416) 
u(x) = =" 3 + [ (x —t)Ap (t)dt 
a x8 (11.417) 
=-x3 = + [ (x—t)ue (t)dt 
4 0 : 
Substitution of the result in (11.416) into (11.417) gives 
u(x) ox Sx xf U4 6r48F 30 4 eat 
4 0 4 
~[ e+ 60 48° —314 at (11.418) 
0 
_l a Dea 
2 12 20 120 
Adding the first two terms, we have 
1, 15 14 3.5 1 «6 
U(X) © Uy (Xx) +u, (x) =14+3x x +—x x x + x 11.419 
(x) © Up (x) + uy (x) 5 ; oi 0 on ( ) 


We see that the last term of wo cancels the first term in uw, and these are called noise 
terms. Wazwaz (2011) discovered that when these noise terms appear, the exact 
solution may be the first two terms of uo. However, we must check this by 
substituting it into the original integral equation. 

In particular, we assume that u is 1+3x and, thus, we have 


u(x) © (1+3x)? =14+6x+9x? (11.420) 
i 2 
I (8-04 649 dr == 48 44 (11.421) 
0 


Using these results, we find that (11.414) is satisfied exactly. This is called the 
noise term phenomenon. 


We want to emphasize that only certain types of integral equations will contain 
noise terms and will lead to the exact solution. We see that the exact solution must 
be contained in the first nonhomogeneous term /i(x). Even if there are noise terms 
in the first two terms in the solution series, it does not automatically lead to the 
exact solution. It is necessary to show that the non-canceled terms of uo(x) indeed 
satisfied the original equation. If it is not the exact solution, we have to continue 
our normal calculation for the remaining Adomian polynomials. In general, it has 
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been found that the Adomian decomposition method converges faster than the 
Picard iterative method. It can be used to solve either linear or nonlinear, and either 
an integro-differential equation or integral equation. So, it is highly recommended 
for solving Volterra type integral equations or Volterra integro-differential 
equations. 


11.10.2 Nonlinear Fredholm Integro-Differential Equation 
In this section, we consider another commonly encountered nonlinear Fredholm 
integro-differential equation: 
b 
u(x) = s+ K(x,)F(u(t)dt, u®(0)=h, O<k<n-1 (11.422) 


We will only consider the case that the kernel K(x,f) is separable or a P-G 
(Pincherle-Goursat) kernel. That is, the kernel is written as: 


K(x.) =) gO (11.423) 
k=l 
Substitution of (11.423) into (11.422) gives 


b b 
u(x) = f(2)+ B2)| OF UM)dt + 89(2)] hy (OF WO)dt +... 


+g ()[ OPW oat (11.424) 


= f(x) + QE (x) + a8 (X) +...+@, 85 (x) 
where 


b 
a, = i} h,(t)F(u(t))dt, 1<k<n (11.425) 
We can integrate both sides of (11.424) n times to get 
u(x) = u(0) + xu'(0) + + 2") +...4+ 1 tye) 9) 
2! (n-1)! (11.426) 


HLS (2) +0481 (0) + 0289 (2) +... +E, By ()} 
where L7! is the n-fold integral operator. When (11.426) is substituted into 
(11.425), we will have n equations for all the constants a@% (k = 1,2,...,n). 
This procedure is called the direct computation method by Wazwaz (2011), 
and it is illustrated in the next example. 


Example 11.22 Solve the following nonlinear Fredholm integro-differential 
equation by the direct computation method: 


2 a 
oH <cosx— x4 f xtu(t)dt, u(0)=0 (11.427) 
dx 4 0 


Solution: Clearly, the kernel K is separable and thus we have 
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du nr 
—=cosx+(a@—-——)x 11.428 
r ( re, ( ) 


where 
ee I ” w(t)dt (11.429) 
0 


Integrating (11.428) with respect to x and applying the boundary condition given in 
(11.427) leads to 


2 
u =sinx+(5—“ x (11.430) 


Back substitution of (11.430) into (11.429) gives an equation for @ The two 
solutions for @ are: 


nr 2-967? +5767 +96 


a=—_, 11.431 
4 4n° 
There two corresponding solutions for u(x), and they are 
ge )x? (11.432) 


u(x) =sinx, Sie ee 
a © 


Since the integro-differential is nonlinear, we can have more than one solution. 


11.11 SUMMARY AND FURTHER READING 


In this chapter, we review the concept of the integral transform, including the 
Laplace transform, Fourier transform, Hankel transform, Mellin transform, and 
Hilbert transform. It was shown that for different equations we need a different 
integral transform technique. The Abel integral equation is discussed with its 
connection to the tautochrone problem in mechanics. Fredholm integral equations 
of the first and second kinds are presented, and Fredholm’s solution is summarized. 
We also discuss the adjoint integral equation and its role in the Fredholm 
alternative theorem. The application of the Fredholm alternative theorem to 
investigate the existence of a particular solution for a nonhomogeneous integral 
equation is discussed. Volterra integral equations of the first and second kinds are 
summarized and the classical solution using the Liouville-Neumann series or Picard 
successive approximation method is presented. Two special kinds of Volterra 
integral equations are considered: separable type (that can be solved by direct 
computation) and convolution type (that can be solved by using the Laplace 
transform). The relation between the existence of a solution of the integral equation 
and functional analysis is discussed. We also discuss how to convert a differential 
equation into an integral equation, or vice versa. Finally, both linear and nonlinear 
integro-differential equations are discussed. Nonlinear Volterra integro-differential 
equations are solved using the Adomian decomposition technique, whereas 
nonlinear Fredholm integro-differential equations are solved by using the direct 
computation method for the case of separable kernels. 

The best textbook on integral transform is Sneddon (1972). An introduction 
to integral equations can be found in Davis (1962) and Tricomi (1957). For 
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compilation of the solutions of integral equations, we refer to the comprehensive 
handbook by Polyanin and Manzhirov (2008). For more comprehensive coverage 
of integro-differential equations, we highly recommend the book by Wazwaz 
(2011). There are other types of solution techniques for solving integro-differential 
equations that are not covered in the current chapter. They include the variational 
iterative method, wavelet-Galerkin method, Taylor series method, and least square 
method. In the current chapter, for the case of the nonlinear Volterra integro- 
differential, we cover the Adomian decomposition method and its modified version 
proposed by Wazwaz (2011), and for the case of nonlinear Fredholm integro- 
differential we present the direct computation method if the kernel is separable. 


11.12 PROBLEMS 


Problem 11.1 Consider the following double Fourier transform for three- 
dimensional space: 


u(x, y,Z) = ff. U(E,n, zee ™Mdédn (11.433) 


UEn2)=5-| | u(x, y, ze") dxdy (11.434) 
TT 2-009 —00 


Applying this double Fourier transform to a three-dimensional PDE, we can 
convert simultaneously the differentiation with respect to x and y to algebraic forms 
in € and 7 respectively. Prove the Faltung theorem or convolution theorem for this 
double Fourier transform: 


=. {" F(C392) G(E,n, ze (E"™ dédn 
a Oe we aes 
=5— | [_ stev2)e—u.y—v2)dudv 


where Fis the double Fourier transform of f 
F(é.n, Zz) = =| | : F(x, y, ze) dedy (11.436) 


The function G relates to g in a similar manner. 


Problem 11.2 Solve the following half-space problem satisfying the Laplace 
equation by using the Fourier transform for both x and y axes (see Figure 11.7) 

Ou 4 Ou Oru 
ax? Oy? Gz? 
with boundary conditions 


u(x, y,0)= f(x,y), uO, asr=afx7 ty? +2" > (11.438) 


Vu(x, y,Z) = =0, 0<z<0,-«o<x,yp<o (11.437) 
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Ans: 
1 pe px ee ads 
u(x,7.2) =| | F(é,nye ean Fete dEdn 


Figure 11.7 Half-space subject to boundary condition 


(11.439) 


Problem 11.3 Show that the following double Fourier transform can be converted 


to the Hankel transform of zero order: 


gs I(S)X1 +52X> 
F&&)=5-[ JM xP +x3 el +52 dx, dx, 


That is, 
1 ie} 
F(p)=—| if )Joleryar 
27 40 
where 
paG+8, rasetx 
Hints: 


(1) Use the following change of variables 
6 =pcosy, € =psing x =rcosO@, x, =rsind 
(ii) Show that the area integral can be converted as (use Jacobian): 
dx,dxy = rdrd@ 
(111) Show that the following equality 
GX +$2x, =rpcos(— 9g) 
(iv) Show the following identity: 
FE.)=—— ff [ere doar 
27 40 0 


(v) Because of the periodicity of 27 for 8, we must have: 


(11.440) 


(11.441) 


(11.442) 


(11.443) 


(11.444) 


(11.445) 


(11.446) 
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2a, 2m, 
( efPreos’9-P) 1g = } elPrees8 ag (11.447) 
0 0 
(vi) Note that the integral representation of the Bessel function of zero order is: 


2n . 
i(or)=——| elProos9 gg (11.448) 
2n 40 


Problem 11.4 Consider the Fourier transform of the following function 


e“@(t), t>0 
= 11.449 
S(t) 0, L2G ( ) 
Show that the result is 
af ()]= [, eM @(t)dt = L{@(t)} (11.450) 


where p = xtis. Note that this is a Laplace transform of ®(). Thus, this problem 
relates the Fourier transform to the Laplace transform. 


Problem 11.5 Find the Fourier transform of the following rectangular function 


nw =1> laa (11.451) 
~ 10, \‘|=1/2 
1S 7 ma@=, fef<1/2 
=0, | 21/2 
t 
-| 0 1 
Figure 11.8 Rectangular function 
Ans: 
1 sin(s/2) 
F(s) = — ——— 11.452 
(s) Jin s/2 ( ) 
Problem 11.6 Consider the Mellin transform of the following function 
fjsss (11.453) 
l+¢ 


Hint: 
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(1) Use a change of variables of 


(efe—*= (11.454) 
l-x 
to show that 
1 
MIf(O]= } (x) x at (11.455) 
0 
(i1) Note that the beta function is defined as 
1 
B(x, y) = i, a1 teat (11.456) 
and can be expressed in terms gamma function as (Lebedev, 1972): 
B(x, y) = FOO) (11.457) 
I'(x+y) 
(iii) Recall from Chapter 4 that 
T(s)P-s)=— (11.458) 
sin 7s 
Ans: 
ie a (11.459) 
l+¢ sinzs 
Problem 11.7 Consider the Mellin transform of the following function 
f(t) =H(t=ty)t? (11.460) 
Ans: 
(z*8 
M[f(@)]=-2 (11.461) 
zt+s 


Problem 11.8 Show the validity of (11.224) for the case of n = 2. In particular, 
show the following 


. l+a, ay | 
(i) = 14 a) + Ay) + (4) Ay9 — Ay] 4)2) (11.462) 
Ay 1+ dy) 
a, a a a 
(ii) aie Genin (11.463) 
49, 472, |%2 Ay 


(i11) Use the above results to show that 


l+a a 2 1 
in 12 =1+ 14, +> 
11 2! 

yh 


nn nn 


(11.464) 


a 
a a 


On 6 


a, 1+a)) 


Holy 
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Problem 11.9 Show the validity of (11.224) for the case of n = 3. In particular, 
show the following 


I+a) a2 a3 
Ay, 14a, yx | = 14 yy + Ay +433 + (Gy Agy — Ad A) ) 
; 43] ayy = 1 +43, 
(i) (11.465) 


+( A) 1433 — 413431) + (Ay9433 — A343) 
Fy 1A7933 + Ay 7A7343) + A309 1439 
3.431499 ~ 433491 4)2 — Ay 1473439 
Ay Aq 43) 41 3 AQ} 422 01 93 
49) 42 493) = |431 9330 430) = 1412 3 
(ii) 43, 439 433] [421 423, 4n2] 432, 931 433 (11.466) 
42 493° 421) 1433, 431 932) 433, 432, 431 
= 1437 9330 431) =]%3 0 41 2) =]403 422 1 
42 43 Ai} [423 421 422} [4130 42 AY 
(iii) Use the above results to show that 


I+a, ap a3 3 Gin Fury Gan, 


3 3 
1 Gey Ui 1 
a) 1+ a2 493 |= Le ; ayy, + oy ° a 31 : Diy Dir Din 
as, a3, +433 i‘ cL) ee ae ae 
afi 312 313 
(11.467) 


Problem 11.10 Solve the following integral equation by Laplace transform: 
t 
y(Q) =1+2f] cos(e—u) y(u hau (11.468) 
0 
Ans: 
y(t) =t+24+2(¢-De’ (11.469) 


Problem 11.11 Solve the following Abel integral equation by Laplace transform: 


‘ yu) 2 
du=1+t+t 11.470 
ees (11.470) 
Ans: 
pl? 
= B+ 68+ 807} (11.471) 
TT 


Problem 11.12 Solve the following Abel integral equation by Laplace transform: 


704 Theory of Differential Equations in Engineering and Mechanics 


t 
20) dy ate? (11.472) 
0 (t—-u) 
Hint: Using the following identity 
T(p)Fd- p)=—_, (11.473) 
sin pz 
show that 
2 21 
c= —— (11.474) 
3°) T(1/3)v3 
Ans: 
1/3 
y(t) = eae Bt+2} (11.475) 
TT 


Problem 11.13 Find the differential equation that is equivalent to the following 
integral equation 


y(t) =? -3t+4-3 fe uy y(u)du (11.476) 
Ans: 
y"O+6y() =0, y0)=4, y(O)=-3, y"(0)=2 (11.477) 


Problem 11.14 Find the differential equation that is equivalent to the following 
integral equation 


t 
r+] (7 + 4t—ut —u—2)y(u)du = 0 (11.478) 
0 
Ans: 
y"+B3t—2)y"+(¢+10)y'+3y=0, y(0)=0, y'(0)=0, y"(0)=0 (11.479) 


Problem 11.15 Solve the following integro-differential equation by Laplace 
transform: 
dy ff 
2 =( cos(t—u)y(u)du, y(0)=1 (11.480) 
dx 40 
Ans: 


y(t) =14 (11.481) 


Problem 11.16 Solve the following integro-differential equation by Laplace 
transform: 


[, wore -wdu =248, (0) =0 (11.482) 


Ans: 
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y(t) =+—— 1°” (11.483) 


Problem 11.17 Solve the following nonlinear integro-differential equation by the 
Adomian decomposition method and collect seven terms in the series 


M14 Pat, u(0) =0 (11.484) 
dx 0 

Ans: 

Aca x! e x0 x8 79x'6 1177x" 
12 252 6048 157248 264176640 135522616320 


+... (11.485) 


Problem 11.18 Solve the following nonlinear integro-differential equation by the 
direct computation method 


_e2 1 
a eae I xu?(t)dt, u(0)=1 (11.486) 
dx 2 0 


Ans: 
u(x) =e", e* +(30-10e)x? (11.487) 


Problem 11.19 Prove the validity of (11.385) to (7.390). 


Problem 11.20 Solve the following ODE with non-constant coefficient by the 
Laplace transform: 
d*y dy 
t——+—+4+9ty =0, 0)=3, y'(0)=0 (11.488 
re ty y(0) y(0) ) 


Hint: Refer to (11.33) and note the following result 
1 


Vs? +a? 


where Jo is the Bessel function of the first kind of zero order defined in Chapter 4. 


J ajet.< } (11.489) 


Ans: 
V(t) = 349 (32) (11.490) 


Problem 11.21 Solve the following ODE by the Laplace transform: 
y"—3y'+2y=4e", y(0)=-3, y'(0)=5 (11.491) 


Ans: 
y(t) = —Te! + 4e7! + 4te*" (11.492) 


Problem 11.22 Apply Mellin transform to solve the following wedge problem: 
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2 2, 
Vu =e a = 26 (11.493) 
ir ror r 
u(r,ta) =H(R—-r) (11.494) 


u(r,a@) = H(R-r) 


Figure 11.9 Wedge domain of potential theory subject to finite uniform boundary conditions 


Ans: 


ct+ioo 8 
u(r, 0) = 7 ae ES) ds 


- (11.495) 
2m Jc—iw 5 COS(S@) 


CHAPTER TWELVE 


Asymptotic Expansion and Perturbation 


12.1 INTRODUCTION 


In this chapter, we will discuss asymptotic expansion and perturbation analysis. 
Although these two methods can be studied independently, they are related. In fact, 
asymptotic series lays the fundamentals for perturbation analysis. The method of 
asymptotic expansion has been studied since the time of Euler, and, in fact, it is one 
of major weapons of Euler. However, asymptotic series are non-convergent, and 
thus caution must be taken in using them. Nevertheless, many famous results of 
Euler were, in fact, obtained by manipulating non-convergent asymptotic series. 
Although Euler never found any problems in his analysis with non-convergent 
series, Pringsheim did find counter-examples that non-convergent series led to 
erroneous conclusions. 

The works of Abel and Cauchy on non-convergent series led to the banishing 
of the non-convergent series by mathematicians for more than 25 years. It is the 
works by Poincaré in 1886 and Stieltjes in 1886 that refreshed the interest in 
divergent series and founded the development of asymptotic series analysis. It turns 
out that solutions expressed in terms of asymptotic series are accurate and useful if 
a finite number of terms are used, but its accuracy deteriorates as “too many” terms 
are used (because of its eventual non-convergent nature). The term “asymptotic 
series” was coined by Poincaré whereas it was called “semi-convergent series” by 
Stieltjes and “convergently beginning series” by Emde. Many mathematicians had 
studied asymptotic series, including Cesaro, Borel, Le Roy, Mellin, Mittag-Leffler, 
Van Vleck, Barnes, Hardy and Littlewood. A number of studies considered 
transforming asymptotic expansions into convergent series, including Airey, van 
der Corput, Miller, van Wijngaarden and Watson. Note that Poincaré’s theory of 
asymptotic series considered both summability of divergent series and the 
asymptotic solution of differential equations. These asymptotic solutions are useful 
in calculating planetary positions. In fact, perturbation methods using asymptotic 
series were motivated by its application in celestial mechanics. The perturbation 
theory is related to the three-body problem, which considered the perturbations of 
the motion of two bodies being influenced by the existence of a much smaller third 
body. An exact solution could not be found and series solutions of finite terms were 
formulated to obtain an approximation. The main contributors to the three-body 
problem were Euler, Clairaut, Lagrange and Laplace. 

In considering Prandtl’s boundary layer in viscous fluids, Friedrichs in 1941 
matched the inner and outer solutions of the boundary layer. Friedrichs was a 
student of Courant and a post-doc of von Karman. This “matched asymptotic 
expansion” was popularized by the books of van Dyke in 1964 and Cole in 1968. 
The method of multiple scales was introduced by Swedish astronomer Lindstedt in 
1883 and by Poincaré in 1886 for solving problems with more than one intrinsic 
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time scale. The method of perturbation is particularly useful in obtaining accurate 
approximate solutions for nonlinear differential equations. Such approximate 
solutions provide insight into the physics and mechanics of the problem, which 
computer generated simulations cannot produce. Sometimes, it also happens that a 
different approximate solution expressed in convergent series actually converges to 
the true value much slower than a divergent asymptotic solution (as long as we are 
not too greedy on the number of terms in the divergent series) (e.g., Bleistein and 
Handelsman, 1986). 


12.2 ASYMPTOTIC EXPANSION 
12.2.1 Order Symbols 
We first introduce the so-called order symbols or Bachmann-Landau symbols, 


which were introduced by Bachmann in 1894 and popularized by Landau in 1909. 
In particular, the “big oh” and “small oh” are defined as 


f =O), g=o(9) (12.1) 
where f and g satisfy the following equations respectively 
lim Lee) =finite, lim sé) =0 (12.2) 
é>& P(E) ££ GE) 


where & is typically a small parameter larger than zero. For example, if = ¢” and 
€o= 0, we have 


f =O(e?), then tim =i (12.3) 
60 ¢ 
where k is a constant, and we have 


g=o(e’), then lim-= =0 (12.4) 
esl ¢ 
In other words, f contains a leading order term of s? and g contains a leading order 
term of ¢? or higher. 


12.2.2 Asymptotic Series 


Among all divergent series, a particular type is known as the asymptotic series. 
Despite the fact that this series diverges, the value of the functions can be 
calculated with a high degree of accuracy if we take the sum of a suitable number 
of terms of such series. The asymptotic series have many properties that are 
analogous to those of the convergent series. The asymptotic expansion of a 
function is denoted as 


f(2)~ 42" (12.5) 
n=0 


for z — oo. Let us consider a particular function, which closely resembles 
exponential integral (Abramowitz and Stegun, 1964): 
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ee) e! 
f{@= ze" | oat (12.6) 
z ¢ 


We will now derive an asymptotic series for (12.6). Applying integration by parts, 
we get 


fede’ 7.e. “ 2 et 
f(2)= ze*| ze*{ ( vf at} 
ZL t 4 
z : (12.7) 
0 pt 
=1-z0"| at 
We can apply integration by parts another time to (12.7) to get 
I! oe 
f(@)=1-=+ (C1 ze721[ sit (12.8) 
Zz z ft 
Integration by parts again yields 
I! 2! et 
f@=1-=+ + Cy ze731[ cg (12.9) 
Z 2 Zz t* 
Finally, repeating the process of integration by parts gives 
Ir 2! 3! -1)""(n-1)! oe! 
f@=1-=4+5-S4 _ Me ae 1)" ntze* | <—dt (12.10) 
Z Zz Zz Zz Zz {* 


The last term is the error of the expansion for the first n terms. This series diverges 
for all values of z, and appears useless. However, we are going to show that it does 
give an accuracy solution if a finite number of terms are used. It was given that an 
exact convergent series of f(z) is 
2 2 2 
& Z Zz Z 
P@)=2e(Cy—Inz42~ 5 ta al 
where y is Euler’s constant, which is discussed in Appendix E. Numerical results of 
the sum of these series for z = 10 are compiled in Table 12.1. We see that the sum 
of the first 9 terms for (10) in the asymptotic series given in (12.10) is 0.9158192 
and for the first 10 terms is 0.91545632. In fact, the exact value of (10) is between 
these values. The divergent series actually converges for the first 10 terms before it 
diverges (see the result of 40 terms or more in Table 12.1). We need only 6 terms 
to achieve 3-decimal accuracy. A more accurate calculation shows that the sum is 
about 0.915633339264773. We also evaluate the sum of the exact convergent 
series given in (12.11), and these results are also compiled in Table 12.1 for 
comparison. If we take the first 9 terms, the result is totally out of order. We need 
40 terms to achieve 3-decimal accuracy. This series does converge but it converges 
very slowly compared to the asymptotic series. Therefore, we have illustrated that 
for a large value of z (in this case we use z = 10) the asymptotic series is practically 
much better than the exact series. This is the reason why the asymptotic series, even 
though it diverges, has been found very useful in getting approximate solutions. 
This observation is also true for approximate solution of differential equations in 
terms of asymptotic series. 
In general, the following properties of asymptotic series should be noted: 
1. Operations of addition, substitution, multiplication and rising to a power can 
be performed on asymptotic series just as on absolutely convergent series. 


ae (12.11) 
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2. One asymptotic series can be divided by another asymptotic series provided 
that Ao in (12.5) is not zero. The series obtained as a result of division will also 
be asymptotic. 

3. An asymptotic series can be integrated term by term, and the resultant series 
will also be asymptotic. In contrast, differentiation of an asymptotic series is, 
in general, not permissible. 

4. A single asymptotic expansion can represent more than one function. On the 
other hand, a given function can be expanded in an asymptotic series in only 
one manner. 


Table 12.1 Comparison of convergence asymptotic divergent series and convergent series of f (10) 


f0) 

Term Asymptotic divergent series Exact convergent series 
1 1.000000000 1568328.251 
2 0.900000000 —3938288.198 
3 0.920000000 8298637.243 
4 0.914000000 —14645597.96 
5 0.916400000 22065178.37 
6 0.915200000 —28922010.97 
7 0.915920000 33511282.09 
8 0.915416000 —34775132.2 
9 0.915819200 32668239.94 

10 0.915456320 —28030794.99 
11 0.915819200 22133696.69 
12 0.915420032 —16186401.12 
40 —16252520.109560400 0.915620319 
41 65339008.215229400 0.915636379 
42 —269186257.916409000 0.915632646 
43 1135819859.836470000 0.915633494 
44 —4905706446.5009 10000 0.915633306 
45 21677009301.383600000 0.915633346 
46 —97945211564.096600000 0.915633338 
47 452317004417.112000000 0.915633339 


When two functions are asymptotically equivalent, 
f(é)~ gle) (12.12) 
we mean that as e > 0 we have 
Sle) = g(e)tl +o} (12.13) 
For example, we have 
x+2~x, x70 (12.14) 
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éte~e, 630 (12.15) 
2sinhx-e*, x0 (12.16) 

We normally express an asymptotic sequence as 
f(x) ~ fy(xtefi(~) te’ fy(x) +... éE->0 (12.17) 


Whenever an asymptotic series in the form of (12.17) exists, the perturbation 
method for regular expansion can be used. In general, there are other forms of 
asymptotic sequences ¢, for the perturbation of a function. Here are some examples 
of asymptotic sequences: 


{b,(x)} =(X—- x9)", X > Xp (12.18) 
(G,()}=—, x00 (12.19) 
Xx 
Dn (x)} = oe x> 2, Ans > An (12.20) 
fh 


12.3 REGULAR PERTURBATION METHOD FOR ODE 


In this section, we consider the perturbation method for ODE. In general, three 

main steps are involved in the perturbation method: 

1. Identify a small parameter ¢ in the original problem. 

2. Assume an expression for the solution in the form of a perturbation series and 

find the differential equation for each order of approximation of the 

perturbation series with appropriate boundary conditions. 

The solution of the differential equation for each order is solved accordingly. 

4. The final answer to the original ODE can be found by substituting the solution 
of each order into the assumed perturbation series. 


LoS) 


To illustrate the idea, we consider the following ODE 
y'+2ey'-y=0 (12.21) 
where € is a small parameter of the ODE. We seek the following regular 
perturbation series 
y~ yo(x) +Ey, (x) +E? V(x) Ft... (12.22) 
Substitution of (12.22) into (12.21) yields: 


Ly" (x) ty," (x) $E7y9" (x) +... +2 v9 (x) + Ey! (x) + 87 79 (0) +...] 


, (12.23) 
—[vo(x) + Ey, (X) + €*y2(x) +...]~0 
By collecting coefficients of different orders of ¢, we have 
OI): ¥ (*)—Yo(x) =0 (12.24) 
Ole): yO) — V(X) =—2y9 (®) (12.25) 


O(E7): yo" (x) —y2(x) =-2y/' (x) (12.26) 
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OE")? Yn") Ya) = 29 (12.27) 
This set of ODEs for each level of approximation illustrates that the ODE is the 
same except that the first order does not have the nonhomogeneous term whereas 
all other nonhomogeneous terms depend on the solution of the previous solution 
(e.g., the ODE for y, depends on y,-1 for m = 1). Thus, we have to solve the 
solution order by order. 

The ODE given in (12.21) is of course linear. In general, for nonlinear ODEs 
the situation is much more difficult. For differential equations arising from 
practical problems, the solution normally behaves well at most points of the 
domain, except at isolated singular points (such as the singularity at the origin in 
the Bessel equation). However, solutions of nonlinear differential equations possess 
a richer spectrum of singular behaviors. For linear ODEs, the singularities are fixed 
by the coefficient functions of the ODE and are independent of the choice of the 
initial or boundary conditions. They are called fixed singularities. For nonlinear 
ODEs, in addition to fixed singularities, there are also singularities that move 
around as the initial or boundary conditions vary. These are called movable 
singularities (compare Section 4.14). This makes asymptotic analysis very difficult 
for nonlinear ODEs. 

The perturbation method for nonlinear ODEs is illustrated by the projectile 
problem considered in the next section. 


12.3.1 Projectile Problem 


In this section, we illustrate the perturbation method by considering the nonlinear 
problem of a projectile. Assume that a projectile is shooting up from the surface of 
the Earth, as shown in Figure 12.1. Using Newton’s second law, we obtain the 
equation of motion of the projectile as 


Figure 12.1 Projectile problem on surface of the Earth 
d*x_ GmM 
m—~ =-—— (12.28) 
dt (R+x) 
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for t > 0, where G is the universal gravitational constant, m is the mass of the 
projectile, R and M are the radius and mass of the Earth, and x is the position of the 
projectile measured from the surface of the Earth. The initial conditions are 


0 
x(0) =0, a) Yo (12.29) 
dt 
We note that at the ground surface the weight of the projectile can be expressed as: 
= ee (12.30) 
R 


Therefore, the mass of the earth M and universal gravitational constant G can be 
replaced by 


GM = gR* (12.31) 
Substitution of (12.31) into (12.28) leads to the following differential equation 
2 2 
as gk 5 (12.32) 
dt (R+x) 


For most applications, we have x << R, and (12.32) leads to the usual 
approximation of 


a 
d°Xy _ 


dt’ 
where xo is used to denote the fact that the solution of (12.33) only leads to the first 
approximation of x. We recover the gravitational constant of 9.81 m/s? on the 
surface of the Earth, but for an intercontinental missile or rocket projectile, the 
vertical distance x becomes not negligible compared to Earth’s radius R. The initial 
conditions (12.29) can be used as 


-2 (12.33) 


x9(0) =0, 5) os Yo (12.34) 
dt |,-o 
Integrating (12.33) once and twice, we get 
WT ayes (12.35) 
dt 
X= -Set +at+b (12.36) 


By using the boundary condition given in (12.34), the unknown constants can be 
found as a = vo and b = 0 and the final solution is 


1 
iy == +Vot (12.37) 
The maximum height that the projectile reaches can be found by setting the upward 
velocity to zero in (12.35) and substituting the result in (12.37) as 
2 
ig (12.38) 
2g 
We now return to the nonlinear ODE given in (12.32). We first normalize the ODE 
by using the following normalized parameters: 
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t t X(t X(t 

fu#) y= MO _ 8 (12.39) 
c Yo c Yo 

where f, and x. are chosen such that the initial condition can be simplified. 

Substitution of (12.39) into (12.32) gives 


dy _ 1 


= —-——___ (12.40) 
dt? = (1+ ey)? 
where the small parameter ¢ is defined as 
2 
XxX Vo 
pel eee 12.41 
R Re (12.41) 
For the radius of the Earth being 6300 km, we have 
2 
= ~9 = 1.618x10*v2(s? /m?) (12.42) 
& 


If the initial velocity vo = 7.86 km/s, we have ¢ = |. This is extremely difficult to 
achieve technically, so we can assume ¢€ is a small parameter. Thus, we seek the 
following asymptotic series 
y~ yo(t) +e y,(c) +... (12.43) 
where {> 0 is a parameter to be determined. Substitution of (12.43) into (12.40) 
gives 
1 
l+e[y(t) + €7 y, (2) +." (12.44) 
=—14+2¢eyo(r) +... 


To balance the term of ¢ on both sides, we must have B = 1. The initial conditions 
given in (12.34) lead to 


yo (tr) +6" y,"(r) +... 


y0)=" -0, y= 


c c ec 


t t 
£ (0)=“y =1 (12.45) 


For the f, and x. defined in (12.39), we arrive at the simple form of the second 
condition given in (12.45). Collecting the constant and linear order term of &, we 
get 


OW): yo () =-L 


(12.46) 
¥o(0) =0, yp (0) =1 
Ole): yy, (t)=2 F 
(€): yy @)=2yo(z) (12.47) 
y(0)=0, y, (0)=0 
The first order solution can be integrated directly from (12.46) as: 
yolt)=—5r? +ar+b (12.48) 
Using the initial conditions, we have 
y(t) : e+r=r(l- = (12.49) 


Substitution of (12.49) into (12.47) and integration gives 
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1 41-3 
T)=-—-t +t +att+b 12.50 
Os aa : (12.50) 
Using the initial conditions given in (12.47), we get 
1 3 1 
t)=—t U--t 12.51 
y(7) 3 ( 4 ) (12.51) 


Finally, we obtain a two-term asymptotic solution as 


1 1 3 1 
~ — os 12.52 
y(t) ~ cr 5 T)+ : ev (1 ; T)+ (12.52) 


The first term in (12.52) is the approximation for the case of a uniform 
gravitational field. This solution is only valid for 
O<T<t, (12.53) 


where 7, is the time for the projectile to come back to the surface of the Earth. 


12.3.2 Projectile Problem with Air Resistance 


In this section, we extend the projectile problem to include the effect of air 
resistance approximately. The equation of motion of the projectile becomes: 


d°x _ gR* k dx 


(12.54) 
dt? = (R+x)_ (R+x) dt 
Using the same normalized time and distance defined in (12.39), we have 
2 
g y = : —— ay (12.55) 
dt (it+eyy (U+t+ey)dr 
where 
ga (12.56) 
gR 


The definition of ¢ given in (12.42) suggests that a is a function of ¢. However, to 
simplify our analysis, we assume that a is a constant. The following asymptotic 
series is assumed 

y~yo(t) +e y,(c) +... (12.57) 
Substitution of (12.57) into (12.55) gives 
1 


{l+elyq(t) +e%y, (2) +...]P? 


yo (t) +E? y" (1) +..= 


(12.58) 
a 


1+ elyo(z) +E%y, (7) +..] 
Using Taylor series expansion, we have 


Ly (t) + 6% y (2) +...) 


yo (rt) +e y(t) +... = (1+ ayy ) + (2%) + aVy My EEF ay, +... (12.59) 
The corresponding initial conditions become 
(0) = yo(0) +€y,(0) +... = 0 (12.60) 


y'(0) = yp (0) + Ey, (0) +... =1 (12.61) 
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Collecting the constant and linear order terms of ¢, we get 
O11): yo +ayq =-1, 
yo(0)=0, yo (0)=1 
Oe): yy tay, =y(2+ ay), 


y,(0)=0, »(0)=0 
For the first order solution, we assume an exponential solution form for the 
homogeneous form as 


(12.62) 


(12.63) 


Yon(t) =e" (12.64) 
The corresponding characteristics equation of the homogeneous form of (12.62) 
becomes 


r(r+a)=0 (12.65) 
The homogeneous solution becomes 
Von = At Be (12.66) 


Since the constant is a homogeneous solution (i.e., matching a single characteristic 
root), we assume the following particular solution form (see Section 3.3.3) 
Yop (t) =Cr (12.67) 


Substitution of (12.67) into the original ODE in (12.62) gives the particular 
solution and combining the homogeneous solution with the particular solution gives 


yy = A+ Be ae (12.68) 
a 

Substitution of (12.68) into the initial conditions given in (12.62) leads to 

A332 06. 5 (12.69) 
a a 
Finally, we have the first order of approximation as 
jee ‘yf ent ig (12.70) 

a a a 

Substitution of (12.70) into the right hand side of (12.63) gives 


ci 1. (ita) _ l+ay’ _ [4 _ 
y tay, =—(1+—)+4 ( 2), ar _f 2) geen ire) te (12.71) 
a a a a a a 


The homogeneous solution is again 
Vin =C+ De (12.72) 
Using the method of undetermined coefficient, we assume the particular solution as 
Vp =Ert+ Fre +Ger™ +H? + Ire (12.73) 
Differentiation of (12.73) gives 
Wp =E+ Fe“ +(21-Fa)re™ -2aGe™ +2Ht-alr’e™ (12.74) 
The second derivative of (12.73) gives 
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Vp =20-Faje +(a°F -4al)re™ +40°Ge™ +2H +07 Ire 


(12.75) 
Combining (12.74) and (12.75) gives 


Vp +p =Ql-Faje™ -2alte +2H +aE +2a°Ge°™ +2Har 


(12.76) 
Equating (12.71) and (12.76) gives 
1. 1 (l+a) 
2H+aE=—(1+—); 21-Fa=——,; 
=.= — (12.77) 
rT ead ogig=- OF) See 


a 
This provides a system of 5 equations for 5 unknowns, and the solution of the 
system gives 


ae ¢ (12.78) 
a 
gee 
re c (12.79) 
a 
2 
g--at® (12.80) 
2a‘ 
We “ss (12.81) 
a 
f= = (12.82) 


Finally, the general solution is 


2 2 
2+a l-a@ s l+a)y — 1 l+a = 
T4 Te QT ( ) e 2a oa re aT 


yy, =C+De™ 4 


a we 2a4 20° 2a 
(12.83) 
Substitution of these into the initial conditions leads to 
2 
C+D= a (12.84) 
2a 
E+F=aD+2aG (12.85) 
Solving for C and D, we have 
2 —* = 
Qe ia gil (12.86) 
2a 
. = (12.87) 
a 


Finally, we obtain the following second order solution 
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wo -T-4a 4430 4. Fa. le _. U4+ay 2y 
= yee ee ee 
2a a a a 2a (12.88) 
1 2 l+a 2 -ar 
~~ at 5) Te 
2a 2a 
Combining (12.70) and (12.88), we obtain the approximation 


Jy 


*-7-4da 44+3a or. 2+a 

7 7 e + —Jzt 
: ae a a (12.89) 
_ Ita) _ 1 l+a@ 4. _ 
+ 5 te aT ( z e 2at 5 r+ ; re Oe) 

a 2a 2a 2a 

This solution appears quite different from the case of no air resistance obtained in 
(12.52). It is not obvious that for « — 0 we can recover (12.52). To show this, for 
the case that the air resistance is small (i.e., a— 0), we can expand the exponential 
functions as a Taylor series expansion as: 


| ierrewene | ae a 
yn {—O+—)il-e™ |=—t} +t 
a @ a 


2 3 4 
Pegs OO (12.90) 
2 6 24 
2 3 4 
e 2% =] de — ee ats: (12.91) 


Substitution of (12.90) into (12.70), we get 
2 
Vo =1-5(1+a)+0(@"r") (12.92) 


On the other hand, substituting (12.90) and (12.91) into (12.88) and collecting 
terms of the same order, we find that coefficients for each order are: 


a =7 da 4430 (+a)? _ 


O(1): 0 (12.93) 
2a4 at 2a4 
4 2 l-a’ (+a) 
Oe. eg (12.94) 
a a a a 
44 l=o* “(ey  2..1 
i oes ee, (298 
2a a a 2a 2a 
4D 2 
(2°): 4+3a l-a Altay 1l+a_ 1 (+a) (12.96) 
6a 2a 3a 2a 6 
44 lan” Gsey .. 14 1 
DG). 2 SE OO ge oF 6 ay ae) (12.97) 
24 6 3 4 24 
Using these results, (12.88) finally becomes 
1 1 
y= gata — 5g 24 Tat 4a’ yet +O(aer”) (12.98) 


Finally, for ¢ > 0 and a > 0, we have the following asymptotic series solution: 


2 
y-t Ft a)+ + ar s+ Ta+4a?)c} +. (12.99) 
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In this asymptotic form, we can set a = 0 to recover the solution obtained in the last 
section for no air resistance. 

To illustrate the effect of a non-uniform gravitational field and the effect of 
air resistance, Figure 12.2 plots the projectile flying height versus normalized time 
for yo, yotéy1, and yo(a)t+Evi(a) (with ¢ = 0.1 and a =0.1). We can see that the 
nonlinear effect due to € increases the maximum reached height and flight time 
(dotted line), whereas the effect due to air resistance reduces the flying height as 
well as the flight time (dashed line). This example precisely illustrates that 
asymptotic solutions can provide qualitative behavior of certain parameters in the 
problem that numerical simulations cannot provide. 
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Figure 12.2 Effects of non-uniform gravitational field and air resistance on projectile maximum 
reached height and flight time for a = 0.1 and ¢ = 0.1 


12.4 METHOD OF MATCHED ASYMPTOTIC EXPANSIONS (MMAE) 


In the last section, we have introduced the idea for regular perturbation analysis. 
However, it was discovered that for some problems the regular perturbation method 
breaks down. For such problems, there is typically a rapid change of the solution at 
one of the boundaries. This development is strongly associated with fluid 
mechanics. Prandtl in 1905 developed the theory of boundary layer for fluid 
mechanics. It was discovered that no matter how small the viscosity, there exists a 
narrow layer of fluid near the surface of a solid. The fluid within the boundary 
layer is different from the flow around the body, such that there is a rapid change of 
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the fluid field from the outer field to the inner boundary layer. The idea of a 
boundary layer for fluid mechanics was so influential that Prandtl was nominated 
for a Nobel Prize in physics (see Appendix B). It was Friedrichs in 1941 who fully 
developed the boundary layer problem systematically. In this technique, an 
asymptotic expansion is developed for the inner layer whereas another asymptotic 
expansion is developed for the outer region. The two different expansions are 
matched at the boundary region where both asymptotic expansions are valid. If a 
single expansion is used, the solution is found to be singular at the boundary point. 
Therefore, different names have been used for this perturbation technique, 
including the singular perturbation method (a term coined by Friedrichs and 
Wasow in 1946), boundary layer analysis (apparently motivated by Prandtl’s 
work), or method of matched asymptotic expansions (MMAE) (a term coined by 
van Dyke in 1964). Note that van Dyke was a PhD student of Lagerstrom at 
Caltech, and Wasow was a PhD student of Friedrichs at New York University. This 
method is powerful but is not straightforward, and we will only illustrate the idea of 
MMAE in this section. The complication that the inner and outer expansions are 
expressed in terms of different variables indeed suggests the more sophisticated 
multiple scale procedure (which will be introduced briefly in the next section). Full 
discussions of MMAE are found in more technical books on the subject by Holmes 
(1995), Bender and Orszag (1978), Lin and Segal (1988), Kevorkian and Cole 
(1981), and O’Malley (2014). Application of MMAE to shear crack in elastic 
diffusive solids was done by Rudnicki (1991). 
Let us consider a simple problem of 


ee: u(0) =1 (12.100) 
dx 


where ¢—> 0. This is a linear first order ODE, and the solution can be found as 
u(x,é)=e"" (12.101) 
Figure 12.3 plots the solution given in (12.101) as a function of various values of s. 
Note that a small parameter ¢ multiplies the higher derivative term in the 
differential equation in (12.100). This is actually a common feature for all 
problems that contain a boundary layer. 
For a fixed x # 0, we have the following limit as > 0 
_ u(x,é)=0 (12.102) 
E> 


For a fixed ¢ # 0, we have the following limit as x > 0 
lim u(x, é)=1 (12.103) 
x-> 


As demonstrated in Figure 12.3, for small ¢ the solution of u ~ 0 suddenly changes 
tou = 1 asx — 0. Thus, we find that 
lim lim u(x, €) # lim lim u(x, €) (12.104) 
x>0é>0 é>0x>0 
Therefore, x = 0 is considered a singular point of u for small ¢. There is a very 
narrow boundary of solution near the singular point at x = 0. This suggests the use 
of the term “singular perturbation method,” but we will stick to the name of 
MMAE in this section. The existence of a narrow boundary suggests the use of 
“boundary layer analysis.” 
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Figure 12.3 Illustration of boundary layer type solution in (12.101) for ¢ = 0.05, 0.2, 0.6 and 1 
We will summarize the procedure of MMAE as follows: 
(i) Outer expansion 


We assume that an asymptotic expansion outside the boundary layer can be 
established as 


u(x.) = Yu, (0) fy (2.8) (12.105) 
n=0 
as ¢ > 0. This solution is not uniformly valid in x as ¢ — 0, and it is definitely not 
valid at x = 0. 


(ii) Stretching transformation 


We want to stretch out the neighborhood of the singular point. In particular, we 
assume a new variable for the boundary layer near the singular point: 


ao (12.106) 
HE) 
where the scaling parameter ¢ is subject to the following condition as s > 0: 
g(0) =0 (12.107) 


In the problem above, we have ¢ = € or €=x/e. We can see that the solution given 
in (12.101) is uniformly valid in € as ¢ > 0. There are two main properties for this 
stretching coordinate €: 


x#0, E>0, E50 (12.108) 
x=0, €=0, foralle (12.109) 
In other words, we want to stretch the boundary layer to infinity. 
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(iii) Inner expansion 


The inner expansion for the solution is expressed as 

u(x, é) =u((é),€) =U(S,€) (12.110) 
We assume the solution in the boundary layer can be expressed in another 
asymptotic sequence as: 


UE)= LU, O81 E62) (12.111) 
n=0 
This solution is valid within the boundary layer but is not uniformly valid outside 
the boundary layer. 


(iv) The overlap interval 


Next, we assume that there is an overlap interval of the inner and outer expansions. 
That is, the overlap interval is defined as 


Xo (€) <x <x;(E) (12.112) 
In particular, we have assumed that 
(i) Xo(é)<x/(€), € 0 (12.113) 
(ii) Xo(€),x;(€) 70, ¢€>0 (12.114) 
(iii) TO) Fe) es px g (12.115) 
We) PE) 


The existence of such an overlap interval cannot be proved analytically. Its 
existence depends on the problem itself. The presumed existence of the overlap 
interval is also known as Kaplun’s hypothesis. Kaplun was a polish-born PhD 
student of Lagerstrom at Caltech in 1950s. 


Yy Overlap interval 


Outer expansion 


Inner expansion 


Figure 12.4 Overlap interval for the inner and outer expansions 
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(v) The matching condition 


To match the two expansions, we expand the outer expansion in (12.105) in terms 
of the inner variable and the inner asymptotic sequences: 


u(é,2)= Du, (Ee) f, (EP(E),€) (12.116) 
n=0 
More precisely, we expand it as 
u(é,2)= )o4, (6g, (6.2) (12.117) 


n=0 
Within the overlap interval, we expect that the difference of the inner and outer 
expansions approaches zero as ¢ > 0 and & > o. This leads to the following 
matching conditions: 
doit (e)-U G8 n=0,1,2,... (12.118) 
Ss) 


This condition should not depend on the choice of the scaling parameter $(é). 
(vi) The composite expansion 


To find a composite expansion valid for both inner and outer layers, we can add 
(12.105) and (12.111) and subtract (12.117): 


U(Xx,é) = f U,(%) fy (08) +[U,(—) -#, (Dg, ( of 
> He) re Ke é) 

The result in (12.119) is the sum of the two expansions subtracting the overlapping 

term within the common interval. We will illustrate this procedure by the following 

example. 


(12.119) 


Example 12.1 Use MMAE to find an asymptotic composite expansion which is 
uniformly valid throughout the interval as e— 0: 

éy"+2y'+y=0, 0<x<l (12.120) 

y(0O)=0, yA=l, O<e&l (12.121) 


where ¢ > 0. Assume that a boundary layer of the solution locates at x = 0. 
Solution: Again, the highest derivative term is multiplied by a small parameter «. 


Therefore, we expect a boundary layer, and we assume the boundary layer is at x = 
0. To find the outer expansion, we assume that 


ey"=0 (12.122) 
Thus, the outer solution can be determined by solving the following ODE 
2¥o+¥o =9, yYoM)=1 (12.123) 


Since this is the outer solution, we should not expect the boundary condition to be 
satisfied at x = 0. Instead, we only need to enforce the boundary condition at x = 1. 
The solution for the first order ODE given in (12.123) can be found easily as: 
1 
>(I-x) 
Yo(x) =e? (12.124) 
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A stretching transformation is assumed as: 
Xx 


=—_ (12.125) 
WE) 
Substituting this change of variables into (12.120), we get 
2 
aay a (12.126) 


+ + 
g dé’ pds 
If the stretching is appropriate, we expect that the coefficient for all three terms are 
comparable. In particular, we want to balance any pair of the following 
coefficients: 
E 
e 
The first coefficient must be used in the pair balance since the highest derivative 
always controls the main behavior of the solution. Pairing the first and third terms 
of (12.127), we get = !”. As & — 0, the second coefficient goes to infinity. 
Therefore, this choice is not correct. Pairing the first and second terms of (12.127), 
we get d = , and the third coefficient is 1. Thus, we must have = s. Using this 
information, (12.126) becomes 


» fl (12.127) 


S/N 


2 
g v2 ey=0 (12.128) 
dg dg 
Thus, the inner expansion can be found by 
2 
d 
d 71 4221-0, y,(0)=0 (12.129) 
dé dé 
This ODE can be solved easily to get 
y,(€)=C(-e**) (12.130) 
To do the matching, we assume the intermediate region as 
xz OO(e)] (12.131) 
where © satisfies the following conditions: 
fine =e, lim @ =0 (12.132) 
é>0 @ 60 
We introduce a new variable 1 in the intermediate region such that: 
x 7® 
=—, €=— 12.133 
ia S F ( ) 
As ¢ — 0, the matching condition for a fixed 7 is 
Yo)|,-ne = Si (ey (12.134) 
Substitution of (12.124) and (12.130) into (12.134) gives 
1 _ 270 
>(1-7®) ¢ 
e =C(l-e y=C, 050,650 (12.135) 


Therefore, we get 
C=el? (12.136) 
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With the constant found in (12.136), we have the inner expansion in (12.130) 
becoming: 


yr (€) =e"? (1-e*) 


Finally, the composite expansion can be found as 


Yel) * Yo(%) +94 (G)~ lim yo 78) 


neice ae 
=e? +e! (1l-e # )-el”” 


After simplification, we get the final expansion as 


Ve (x) ~ el? ee = eu) 


Actually, the exact solution for this case can be obtained as 


where 


-ltJl-eé 
é 


Mo = 


(12.137) 


(12.138) 


(12.139) 


(12.140) 


(12.141) 


Figure 12.5 compares the results of MMAE with the exact solution for the case of ¢ 
= 0.05. The inner solution is accurate near the boundary at x = 0, whereas the outer 
solution is good outside the boundary layer. The composite expansion and the exact 
solution are basically indistinguishable from one and other. This example illustrates 
that when the coefficient of the highest derivative term is multiplied by a small 
parameter (in this case g), there exists a boundary layer and MMAE can be used to 
provide a composite expansion which is accurate for the whole domain (in the case 


O0<x<1). 
2 
Lf SE ee 
1.5 
1 , 
Exact solution 
sciczes Composite expansion 
0.5 —"7""  Quter expansion 
Inner expansion 
0 


0 0.1 0.2 0.3 0.4 0.5 


Figure 12.5 Comparison of MMAE solution with exact solution for ¢ = 0.05 
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12.5 MULTIPLE SCALE PERTURBATION 


In this section, we will consider the cases that a single small scale in asymptotic 
perturbation analysis is not enough to give an accurate approximation. In such 
cases, multiple scales are needed in the perturbation analysis. This situation arises 
in singular perturbation problems discussed in the last section. The multiple-scale 
perturbation analysis constructs uniformly valid approximations to the solutions of 
perturbation problems, both for small as well as large values of the independent 
variables. This is done by introducing fast-scale and slow-scale variables (can be 
multiple scales) and treating them as independent. In fact, both MMAE and multi- 
scale perturbation can be used to solve singular perturbation problems. 

The two-time scales analysis was originated from the Poincaré-Lindstedt 
method of solving the nonlinear Duffing equation related to three-body problem in 
astronomy. The idea of multi-scale analysis was originated by Lindstedt in 1883 
and studied in detail by Poincaré in 1889. In 1889, Poincaré won the King Oscar II 
Prize, which was set up to celebrate the king’s 60th birthday, for his work on the 
three-body problem. There was an error in his prize submission and the corrected 
version was published in 1890. The judging panels consisted of Mittag-Leffler, 
Hermite, and Weierstrass. According to O’ Malley (2014), this multi-scale method 
has been repeatedly rediscovered by many mathematicians and scientists in 
different disciplines (including physics, engineering and applied mathematics), 
using different names. Full details are found in O’Malley (2014) who compiled a 
lot of results in a reader-friendly manner, and is highly recommended. Half- 
jokingly, Nayfeh (1973) claimed in his celebrated book on perturbation analysis 
that “the method of multiple scales is so popular that it is being rediscovered just 
about every 6 months.” In view of the importance of such analysis, Mittag-Leffler 
tried to get a Nobel Prize for Poincaré from their initiation in 1901, but he was 
never successful. The father of Chinese rocket science, H.S. Tsien, called the multi- 
scale perturbation analysis the PLK-method (in honoring the contributions from 
Poincaré, Lighthill, and Kuo). Sir James Lighthill used the technique to study gas 
dynamics in 1949 while Kuo studied incompressible air flow over a thin plate in 
1953 by combining both boundary layer and multi-scale analysis. Tsien was a 
student of von Karman at Caltech, but he got interested in the multi-scale analysis 
through his interactions with Erdelyi at Caltech, H.Y. Kuo at Cornell, and C.C. Lin 
at MIT (both Kuo and Lin were students of von Karman). Tsien was under house 
arrest for 5 years in California in 1950 before he returned to China and became the 
father of rocket science in China. Nayfeh (1973) called it the derivative expansion 
method. Nayfeh was a PhD student of van Dyke at Stanford University. In their 
studies of nonlinear interactions of wave trains, D. J. Benney and his MIT students 
(Newel, Ablowitz, Haberman, Lange, and Luke) independently developed the 
multi-scale methods and it was referred to as the method of slow variations by 
Haberman. Many of his students became experts in soliton theory and professors in 
applied mathematics. Professor Benney was born in New Zealand (N.Z.) and got 
his PhD under C.C. Lin at MIT and remained there for the rest of his career. He 
was a hilarious and devoted teacher at classroom and an excellent mentor for his 
PhD student supervision. He was recognized as an extremely modest and humble 
person. There is a lovely story about his character. When Benney was a student, he 
used to work as a gardener at N.Z. Government House during summer holidays. 
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The Governor General, Sir Freyberg, sometimes would walk around in the garden 
and talked to Benney about tomatoes, etc. In the meantime, Benney was nominated 
for a prestigious scholarship and Freyberg was one of the judge panel members. On 
the day of the interview, Benney worked at the Government House garden in the 
morning as usual before he went for the interview in the afternoon. When the 
interview started, Freyberg asked: ““Haven’t I met you? You look familiar!” Benney 
however never revealed in the interview that he was his gardener! 

In this section, we will consider a case that the exact solution can be found 
and we will demonstrate that the regular perturbation of a single scale fails. Then, 
we will extend the perturbation analysis to multiple scale analysis. Let us consider 
multiple scale perturbation through the following example: 

u"+2eu'+u=0, u(0)=0, u'(0)=1 (12.142) 
Mathematically, this is the same as the dynamic equation of a lump mass oscillator 
subject to initial velocity, but the damping term is very small as s > 0. 
By regular perturbation, we assume the following asymptotic expansion for u: 


u(t,€) =Uug(t)+u, (e+ = Me +i. (12.143) 


To get the first order approximation, we can substitute (12.143) into (12.142) and 
set € = 0 to form the ODE for the constant order: 

ug +uy =0, uo(0)=0, up (0)=1 (12.144) 
Mathematically, this is the same as the dynamic equation of a lump mass oscillator 
subject to initial velocity, but the damping term is very small as e > 0. The exact 
solution can be found easily as: 

Up (t) =sint (12.145) 

For the first order of ¢, we substitute (12.143) into (12.142), and differentiate the 
result with respect to ¢ to get 


u"+2u'+2eu'’+u=0, u,(0)=0, u,(0)=0 (12.146) 
where 
u= oe (12.147) 
0€ 


More explicitly, we get 
[uy +use +...J+2[up +uje +...]+2e[uy tuset+...J+[u,tue+..J=0 (12.148) 
Setting ¢ = 0, we obtain 


Ul +, =—2uy, (12.149) 
The solution can be found as 
u(t) ¥ —tsint (12.150) 
Adding the first two terms, we finally get the asymptotic expansion as 
u(t) ¥ sint —(tsint)e (12.151) 


We see that for tf > 0, (12.151) gives u > o. This is not only unphysical but also 
incorrect. The exact solution of (12.142) can be found easily as: 
—ét a: i= 2 
u(t,e)=——=*— (12.152) 
l-e 
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This exact solution predicts u — 0 for ¢ > o. Therefore, the result of the regular 
perturbation method is not uniformly valid as ¢ + o«. The exponential function 
contains a time scale of 

T=Et (12.153) 
The time scale in the sine function is 


Vl—et xt (12.154) 


Strictly speaking, we can expand the sine function as 


sin J1—e2t = Vl—e714+ (1—27)"7# +... (12.155) 


More generally, we can have infinite time scales. For the time being, we are content 
with the two time scales given in (12.153) and (12.154). The slow time scale is 
given by t whereas the fast time scale is given by ¢. The asymptotic expansion 
becomes: 


u(t,e) =u°(t,r) +ul(t, rE + 5 te? +... (12.156) 


The asymptotic series now includes functions of both time scales ¢t and z Note that 
the superscript of wu’ (where i = 1,2,3,...) denotes the number in the asymptotic 
sequence. In contrast to a single time scale, the subscript is reserved for denoting 
differentiation. By the chain rule, we have 
du Ou OtOou Ou Ou 
=—+ =—+ 


é (12.157) 
dt ot ot Ot Ct OT 
Using this chain rule, we get 
' _ du _ 0 0 1 1 2 2 Ee 
u'(t,é)= a =u, +éu,+[u, +éu,Je+[u; reels (12.158) 


0 0 1 1 0 0 Dy a 2 1 1 4é 
u"(t,é) =u, +éu,, +[u, tu,,6 tu, tu, elet[u, tujet 2u,, +2u_é] me +... 


12.159 
Substituting (12.156) and (12.159) into (12.142) and setting ¢ = 0, we find 
uo+u°=0, u°(0,0)=0, u?(0,0)=1 (12.160) 

The solution for (12.160) is 
u° = A(r)cost+ B(r)sint (12.161) 


Note that A and B are not constants but functions of the slow time scale t. The 
boundary conditions in (12.160) become 

u°(0,0) = A(0) =0, u)’(0,0) = B(O) =1 (12.162) 
To find the functions A and B, we have to go to a higher order to find conditions 
that they satisfy. Substituting (12.156) and (12.159) into (12.142), differentiating 
the result with respect to ¢, and setting ¢ = 0, we get 


u, tui =—-2[u?. +ur] (12.163) 
u(0) =u'(0,0) =0, w'(0) =u} (0,0) +u2 (0,0) =0 (12.164) 

Substitution of (12.161) into (12.163) gives 
uw, tu’ =2[4, + A]sint —2[B, + B]cost (12.165) 
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The two bracket terms are known as “wanted” secular terms and we need to remove 
them. We learned from the undetermined coefficient method for solving 
nonhomogeneous ODE that the particular solution will contain the following terms: 


u' x tsint,tcost (12.166) 


No matter how small is ¢, the solution eventually becomes unbounded as t > o. 
Lin and Segal (1988) called them resonant terms. These secular terms appear in the 
analysis at each power of s. In fact, this boundedness requirement from the next 
order of problem forms the heart of multiple scale methods. 

Thus, we get two additional conditions for solving A and B: 


A,+A=0, B,+B=0 (12.167) 
The general solutions for (12.167) are 
A(t)=ae", B(t)=fye* (12.168) 
Imposing boundary conditions in (12.162), we obtain 
A(t) =0, B(r)=e% (12.169) 
Finally, we get the first order expansion as 
u-u°+O(e) =e sint (12.170) 


We now see that this asymptotic expansion indeed decays to zero as t > o. Figure 
12.6 plots the results of single and multiple scale perturbations given in (12.151) 
and (12.170) versus the analytical solution given in (12.152). We have used a 
relatively large value of ¢ = 0.2 in Figure 12.6 because for small values of ¢ the 
multiple-scale plot and exact solution essentially overlap. The single scale 
expansion oscillates with a much larger magnitude and extends beyond the range of 
our plotting area for large x. 


1 : 
Exact solution) 

0.8 ' i : ' 
Multiple scale expansion : i 

0.6 : : 


Single scale expansion 
1 1 


0.4 
0.2 
0 
-0.2 
-0.4 


-0.6 \: 


Figure 12.6 Single-scale and multiple-scale expansions with exact solution for ¢ = 0.2 
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12.6 LIOUVILLE-GREEN OR WKB EXPANSION 


In quantum mechanics, the WKB method was developed in 1926 and was named 
after Wentzel, Kramers, and Brillouin, who studied the solution of the Schrodinger 
equation. Some authors also referred to it as the WKBJ method (J stands for 
Jefferys). However, in mathematics the method and the idea can trace back to the 
celestial analysis by Carlini in 1817, and was also referred to as the method of 
Liouville and Green, both of whom published the method in 1837. Therefore, it is 
known as the L-G approximation. The WKB method can be applied to a singular 
perturbation problem (i.e., a small parameter multiplying the highest derivative 
term), but the problem must be linear. Review of the WKB method in solid 
mechanics is given by Steele (1976). In this sense, although WKB is easier than 
both MMAE and a multiple scales method, it is less powerful than either one of 
them. 
Let us consider the following linear ODE 
&°y"—q(x)y =0 (12.171) 

In order to get an approximation for the problem, we see that if g is a constant, we 
can solve the problem instantly as: 

y(x) = de®" 4 Bes! (12.172) 
This solution is the main idea behind the WKB method. It was assumed that an 
exponential function provides a good approximation to equation (12.171). To allow 
a more flexible form of exponential functions that can handle the non-constant 
function q(x) in (12.171), the following form is assumed in the WKB method: 


Vx) ~ eF8™ fy (x) +E (x) +...} (12.173) 
The exponential form in (12.173) constitutes the main assumption in the WKB 
method, and it focuses on the fast variation of the function (as ¢ — 0 the 
exponential function is the dominant term). In addition, the WKB method can be 
considered as a special case of multiple scale perturbation. 
Differentiation of (12.173) gives 


1 OF oye Ble% go ' 
yv=ze e: {yy té7y, +..$ +e 7 {yg +é7 yy, +. } 
‘ (12.174) 
_ Ole® ' ’ 
=e { I+ HON +) 
é 
" a Ole® J ' ' 
y"=—e é {Zot yi +0'n +} 
E é 
a 6g" 6’ 
+e { — Vp + Zr yb Osis oy tf (12.175) 
é é 


2 Ce ; 1 ” | ae LU 
= fl (2 Yo + a [2 Yo +20'¥6 +(0 Pn}e} 


Substitution of (12.174) and (12.175) into (12.171) gives 
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2. (6')” 1 " tact 12 

é Vo t+—] OB" yy +20’ +(O')° y, | +. 

| : ral : J (12.176) 
—(X){¥9 FEFY, +... =0 

In order to balance the order of the equations, we must have a = 1. Collecting the 

coefficients of different order and setting all of them to zero (in order to satisfy the 

ODE), we get the constant order term 

O(1): (0')” = q(x) (12.177) 

This is called the eikonal equation, and it is related to the studies of nonlinear 

waves and optics. The solution can be expressed as 


Ax) =+ } ” Ja(s)ds (12.178) 
The linear order term of € is 
O(e): "yy + 20'yp +(O') =D (12.179) 
This is called the transport equation. Using the result of (12.177), we have 
A" yy +20'yy =0 (12.180) 
This can be rearranged as 
pe 9 (12.181) 
Vo 20 20 
Integrating both sides we find the solution as 
C 


(12.182) 
where C is a constant. Substitution of (12.182) and (12.178) into (12.173) gives 


Te tent} (12.183) 
Since there are two values of 0 from (12.178), we obtain the following WKB 
approximation 


y(x) ~e 


lpr lp 
a (s)ds — (s)ds 
yx) ~ g(x) "* {Ae | toon aie (12.184) 


Example 12.2 Use the WKB method to find the approximation of the following 
ODE with non-constant coefficient 


Ey"te*y=0 (12.185) 
yO)=a, yA)=b (12.186) 

Solution: We see that 
q(x) =-e* (12.187) 


Thus, we find 


VqQ) =ie* (12.188) 


g(x) 4 = We Sle _ eo iFl4 Hl2 (12.189) 
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Substitution of (12.187) and (12.188) into (12.184) gives 

yee (6 he™ ane 3 (12.190) 
where A = 1/s. Note, however, that we start with an ODE with real variables and 
real parameters, thus our approximation must be real. Thus, A and B must be 


complex constant and conjugate to each other. Using this information, we can 
recast the approximation in (12.190) as 


y(x) ~ e*’? {Ccos(Ae*) + Dsin(Ae*)} (12.191) 


where C and D are now real constants. Substitution of (12.191) into the boundary 
conditions given in (12.186) gives 
y(0) =CcosA+Dsind=a (12.192) 


y(l) =e? {Ccos(Ae) + Dsin(Ae)} =b (12.193) 


This provides two equations for two unknowns. The solutions of the system can be 
found as 


_ asin(Ae) — be'? sind 


- (12.194) 

sin A(e-1) 

1/2 
A- A 

_ be os acos(Ae) (12.195) 

sin A(e-1) 

With these constants, the WKB approximation can be expressed as 
1/2 a | oe : x 

roe a ed sin A(e* —1)—asinA(e a (12.196) 


sin A(e-1) 


12.7 LAPLACE METHOD 


One of the main applications of asymptotic analysis is to estimate the value of an 
integral for certain particular limit of the parameters (i.e., one of the parameters 
approaches infinity). Since many problems of differential equations can be solved 
by integral transforms, as introduced in the last chapter, the final solutions of 
differential equations are often expressed in terms of some definite integral because 
an inverse transform (often expressed in the form of integration) cannot be obtained 
analytically in most cases. Very often numerical evaluations of these integrals are 
needed. Asymptotic analysis provides the dominant behavior when one of the 
parameters in the integrand is large. 


12.7.1 Erdelyi’s Derivation 


We first consider the following integral, known as the Laplace integral: 


f@)= ii g(tedt (12.197) 
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where x approaches infinity and /(¢) is real. Note for the special case that a = 0, B 
— o, and h(t) = —#, (12.197) is in fact the definition of the Laplace transform. If 
A(t) has a number of maxima, we can always break up the integral in a finite 
number of integrals in a way that h(t) reaches its maximum at one of the end-points 
and at no other points. For the case that the maximum is at f = a and x > , it can 
be shown by the Laplace method that the function on the left of (12.197) can be 
expressed as: 


1/2 
xh(@) 
aje 12.198 
f(x) > les g(a) ( ) 
where a is the maximum point of the function A(t) such that 
h'(a)=0, h'(a)<0 (12.199) 


Equation (12.199) illustrates that the main contribution from the integrand to the 
integration is from the maximum point of function h(?). 
To prove (12.198), Laplace introduced the following change of variables 


h(a) -— h(t) =u? (12.200) 
Thus, we have from (12.200) that 


fi (12.201) 
h'(t) 
The lower limit for u becomes 
t=a, >u=0 (12.202) 


Note that the main contribution of the integrand is from the neighborhood of the 
maximum point ¢ = a because we have h‘(t) < 0 for the range 


a<t<at+yn, 7-0 (12.203) 
Therefore, the integral can be approximated by replacing the upper limit B by a+n 
t=atn, =>u=[h(a)-h(a+n)}\? =U (12.204) 
That is, we have 
g(t) 


f(x) ~ [r ' (tye dt = -[" 20g Ph) u2}idu (12.205) 


For x +00, the following term changes very . ~ us 


erth(a)-w] (12.206) 
whereas, as n—> 0, g() does not change rapidly. Thus, we can set 
e(t) © g(a) (12.207) 
In addition, the following limit needs to be considered 
h\(t) 


i a ee (12.208) 
toah\t) tra h(t) t>a Quh"(t) 
At the limit t > a, we have the form 0/0. Applying L’H6pital’s rule, we obtain the 
second part of (12.208). Amazingly, note that the left hand side appears on the 
right of (12.208). The whole success of this method comes from this beautiful 
result. We can rearrange (12.208) as 
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: a) : 1 
lim[——]° = lim[-——— 12.209 
lim Gl = links) (12.209) 
Thus, we find the following limit 
uy _;-_1_ (12.210) 


lim[ 
toa h(t) 2h"(a) 
Substitution of (12.207) and (12.210) into (12.205) gives 


f(x) F[- 2 8 s(a)| exp[—xw? + xh(a)|du 


h" 
r (12.211) 
=[-— "7 g(ae™ | exp(—xu” du 
h"(@) 0 
If the only maximum is at a, we can well extend U to infinity and note the 
following Laplace integral: 
[exp )du =i (12.212) 
0 2\x 
With this result, (12.211) becomes 
2 
—1 
fx) ~|—2~]|  g(a)exp{xh(a)} (12.213) 
2xh"(a) 


This is the required result. 


12.7.2 Bleistein-Handelsman Derivation 


A slightly different proof of the Laplace method was given by Bleistein and 
Handelsman (1986). In particular, the following form of integral is considered 
(Bleistein and Handelsman, 1986) 


iaj\= (0 f(de*at (12.214) 


where A is real and approaches infinity and again both (7) and f(f) are real. Note 
that (12.214) is essentially the same as (12.197) except that a minus sign has been 
included explicitly in defining the index of the exponential function. Thus, we 
search for the minimum ¢(f), instead of the maximum as we did in the last section. 
Suppose that a minimum can be found such that 

P(t) =0, 0"(t)) >0 (12.215) 
In addition, we assume that the function (4) exists at the minimum point ¢ = fo. 
Similar to the argument used in the last section, we expect that the main 
contribution from the integrand comes from the minimum point fo. Thus, we define 
the following integral around this point as 


[yt+é 
e4H'0) 1, (A) = I f (De MOO-HoN a (12.216) 
Qo 7é 


where € is a small number. We expect the following limit holds 
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lim 1o(A) _y (12.217) 
Aso I(A) 
That is, when 2 — o, the integral defined in (12.216) approaches the original 
integral (12.214). To justify this assertion, Figure 12.4 shows that the area under 
the exponential function in (12.216) mainly comes from the neighborhood of f for 
large 2. This justifies the limit given in (12.217). 


expt-AlA(t) — A(t I 


small 2 


Figure 12.7 The behavior of the exponential function in (12.11) around the critical point 


Using Taylor series expansion and assuming that f(t) is not a highly oscillating 
function near fo, we have 


tyté 
eM) 7, (A) = Flt) e 
(ee 
Now, we can apply a change of variables 


r= fF ee). dr= fo otat (12.219) 


Applying (12.219) and (12.217), we find the following approximation: 


[A 

SP (ie 

T(A) = & 0) = Fs) i *” exp{-r2}dr +... (12.220) 
Ao (%) ~ 59" lode 

As seen in Figure 12.7, we can extend this integration from —oo to +oo without 

changing the integration (as long as A > 0): 


a 7 
SP" (to tty 
ane dt + (12.218) 
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re ae. x 
1(A) xe 7H) a f(t) [expt }ar+... (12.221) 


The integration in (12.221) can be carried out exactly by (see Chapter 1): 
| exp{-r}dr = Vr (12.222) 


1(A) = exp{-A@ll) (tp) Laas (12.223) 
0 


This result is essentially the same as (12.213), which is for the case where the 
critical point is on the limit of integration, whereas (12.223) is for the critical point 
within the upper and lower limits. Therefore, they differ by a factor of 2. Bleistein 
and Handelsman (1986) also derived the accuracy, or more precisely found the 
order of error of (12.223): 


1(A) = expt-Ablo)} Fo) 


Finally, we get 


2, o(exP-AGto)! 
Ad"(to) FED 

We refer the reader to the details of the analysis leading to this result in Chapter 5 
of Bleistein and Handelsman (1986). 


) (12.224) 


12.7.3 Generalized Formula 


The previous result presented in (12.213) and (12.223) makes two major 
assumptions that pose limitations on the result in (12.213) or (12.223). Let us recap 
here. Consider a Laplace integral of the following form: 


b 
i= | (teat (12.225) 
for A + oo. In the last section, the two major assumptions are made: 


g(a)#0, h(a)=0 (12.226) 


where a is the maximum boundary point. We will generalize this result to the cases 
that 


g(a)=g'(a)=...= 2g" (a) =0, g(a) 40 (12.227) 
h(a) =h'(a) =...=h" (a) =0, A ?(a) #0 (12.228) 
where m and n are integers. To do this, let us assume a much more general form of 


functions g and hf, and then specialize them to (12.227) and (12.228). In particular, 
we assume for A — 00, the asymptotic forms of g(t) and A(t) as 


g(t)~ A(t—a)*, a>-l (12.229) 
h(t) ~ h(a) - B(t-a)? (12.230) 
Substitution of (12.229) and (12.230) into (12.225) gives 


I(A) Ae | “(t-a)te 7B ay (12.231) 
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Using previous arguments, we can extend the upper limit to infinity. Applying a 
change of variables of s = t—a, we obtain 


1(A) = 4e% } * ste 78 as (12.232) 
0 
Let a new variable u such that 
u = ABs? (12.233) 
Then, we have 
du = BABs®"ds, s% = Cre (12.234) 
Using (12.234), we can rewrite the integral in a 232) 
I(A) = Ae ("C ( “be —u Pies (12.235) 
s 
Using (12.234) and rearranging (12. . we have 
Ae Ah(a) 
i= [, yatd/BA o-H gy 
BABY YF Jo 
se*) ee (12.236) 


) 


= Tr 
BABY eB ( B 


where /"is Euler’s gamma function. If @ and B are integers and the coefficients A 
and B are those from Taylor series expansions, we have 


a=m, B=n (12.237) 
(m) (n) 
{2 pF) (12.238) 
m! n! 


This special case agrees with the conditions imposed in (12.227) and (12.228). 
Thus, we have the asymptotic value for the integral (12.225) subject to the 
condition of (12.227) and (12.228): 


g(a) eh) m+l 


1(a) [- ymin & ——) (12.239) 


A (a ) ml pglarrD/n 
If m = 0 (i.e., function g(t) is regular at t = a), we have the special case: 
nam sae" _ 1 


1(A) =[ ] T(-) (12.240) 
A (a) nan n 
If m = 0 and n = 1, we have the special case: 
of Ah(a) 
1(a)=- g(ae aa (12.241) 
Ah'(a) 


If m = 0 and n = 2, we have the a case: 


(A) | Sane Dan (ay oD xP aM) (12.242) 


Of course, (12.242) is the same as (12.198) that we obtained in the last section. 
Therefore, (12.239) is a very general result. 
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Example 12.3 Find the asymptotic form of Legendre polynomials for large order, 
which is defined in integral form as follow: 


P(z)= = [eave —1cos0)"d0 (12.243) 


forz>1landn—o., 


Solution: We look for the asymptotic behavior of the Legendre function for a fixed 
z(> 1) and for n > o. First, we note the following formula: 


7 =e%™! (12.244) 
Using this idea, we can recast (12.243) in the required form: 
P.(z) = = [shen eaag (12.245) 
a Jo 


Comparing (12.245) and (12.225), we have 


g(O)=1, h(@)=Ing(6), g(@)=z+Vz —lcosé (12.246) 
Clearly, the maximum occurs at 0 = 0 because of the cosine function. The 
differentiation of g gives 
q'(0) =-Nz -1sin (12.247) 
For 0 < @ < a, we have g(@ < 0 and, thus, we do have a maximum and the 
maximum is at the lower limit of integration. We now expand the cosine function 
about 0=0: 


oe 
cose (12.248) 


Substitution of (12.248) into (12.245) leads to 
P@=—|"e ellnlz zal? (1 PIM gy (12.249) 


To proceed further, we employ the following mathematical trick 


P,(2)=— J" expfnin{(z + V2" 1(1 a pe (12.250) 
Z+VNZ — 


Then, we have 


1 Vz" -167 
P(z)=—| exp4n{In(z+ vz? -1)+ nd -_—-—~_——)] do 
as f “~ men lF=1) e 


(12.251) 
exp{nIn(z+ Vz? -1)} (7 v2 -10" | -10° 
= I exp, In(1— 
it Azt yz? - Ay 
Note the following series expansion 
In(l-x) ~—x+... (12.252) 


Then, (12.251) becomes 
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2 _— a 
opine Nz 1)} fi ent 


n —10° 
\d@ (12.253) 
Aztvz? -1) 


Then, we can apply the following change of variables 


2 nvz? —1 


yw? =——~"___¢” (12.254) 


Az+V z= 1) 
After change of variables, we get 


1/2 
Ere —1)" aoe -1) an exp{-u7}du (12.255) 


F,(2) = yi4 


F,(@) = 


Note that we have shifted the upper ae to —_ as the main contribution is from 
the lower limit. From (1.102) of Chapter 1, we get 


1 (z+ |? =i" 


P(z)® re (onl (12.256) 
Example 12.4 Find the asymptotic form of the following integral 
(x)= [oe Pa (12.257) 
for x > 0, 
Solution: For this case, we let 
A(t) =(¢ +3) (12.258) 


We look for the minimum of /(f) which will have maximum contribution to the 
integral. Then, we can set 


h ‘@=1-==0 (12.259) 
r 
Thus, we have the minimum point at 
t =(2x)!3 (12.260) 
Guided by this observation, we let 
t 
= (12.261) 
(2x) 
Substitution of (12.261) into (12.257) leads to 
T(x) = (2x)3 ("eCPM gg (12.262) 
0 


This change of variables allows us to convert the integral in (12.257) to the 
standard Laplace type integral as x — oo. Comparing (12.262) with (12.214), we 
identify that 


1 
=s+— 12.263 
#(s) $455 ( ) 
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We can easily find that 
1 3 3 
$(s)=1- a ¢(1)=0, hd)= a o"(s) = oe PI) =3 (12.264) 


The minimum is at s = 1. Expanding ¢ in Taylor series arson about s = 1, we 
have 

3 4, = Ds. 
p(s) = = ; 


Using the result in (12.224), we oe 


I(x) ~ 2(2x)'? exp {— >(2s x)'3} te u (12.266) 


Note that we have added an extra 2 because the minimum point (s = 1) is not at the 
lower limit of integration of s = 0. Simplifying (12.266), we get 


ia Een" exp{-52x)!9} (12.267) 


(12.265) 


12.7.4 Laplace Type Integrals in Higher Dimensions 


So far, we have restricted our discussion to integrals with one variable. In this 
section, we will consider a multiple integral of functions depending on multi- 
variables. In general, consider the case of n variables: 


1B) =| ..[ OG xp. PL Or desde, (12.268) 
To find the critical point or minimum point for the above integral, we set 
V- f(a) =0 (12.269) 
More explicitly, we have 
CU eee ee (12.270) 
Ox Xx, 
where 
a=(q,...,4, (12.271) 


The critical point or stationary point a is said to be non-degenerate if the Hessian 
matrix is nonzero and, in particular, larger than zero for the present case of the 
minimum point: 

oy 
Ox;OX;; 
Assuming that @ is regular at point a and applying Taylor series expansion for the 
vector form, we get 


det |H(a)| = det 


>0 (12.272) 


BUF (Qos a.) (e=a)" Ha) 


I(B) © P(a,,4),... sda) fe A dx (12.273) 


Introducing a new vector y = x—a, we obtain 
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By 
1(B) = p(aye 9 ( 2 } e 2 ay (12.274) 


According to a well-known theorem in matrix algebra, since the Hessian is real and 
symmetric (see definition in (12.272)) we can also decompose it as (see p. 288 of 
Lipschutz, 1987) 

P’DP=H (12.275) 
where P is the orthogonal matrix and its column is composed of eigenvectors of the 
Hessian matrix. Equation (12.275) is known as the spectral decomposition of 
Hessian matrix H. Because of the orthogonal properties of eigenvectors, D is a 
diagonal matrix with eigenvalues of H as the diagonal terms: 


4 0 0 
D=|\3 “2 3 (12.276) 
0 0 A, 
Thus, (12.274) can be expressed as 
tp, 
I(B) = plaje PE | a e2 dz (12.277) 
where 
z=Py (12.278) 


Equation (12.278) is also known as the shearing transformation. In obtaining 
(12.180), we note that since the matrix P is composed of unit eigenvectors, we must 
have the determinant of P be unity or: 


det |P| = det IE (12.279) 
Oz ; 
Note also that 
a’ Dz =Ayz; +...+A,22 (12.280) 
Thus, we have 
Pye af 5 2 
I(B) = eae" le 2 dz...) 20" a (12.281) 


For each integral in (12.281), we can apply the result from the last section and 
obtain 


1(B) = ~Bi(a)(2% nia A 12.282 
(f) = Y(a)e , a ( ) 


Note that the determinant of the Hessian equals the product of the eigenvalues or 
(see definition in (12.276)): 


nN 


TT: = det D=det H (12.283) 
i=l 
Substitution of (12.283) into (12.282) results in 


I(B) x “Bf(a)(2%yn2__ 12.284 
(B) = pa)je ( p> ray, ( ) 
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This result agrees with Equation (5.15) on p. 498 of Wong (2001). However, the 
present proof is somewhat simpler and does not need to introduce the Morse lemma 
in bifurcation theory. 


12.8 STEEPEST DESCENT OF RIEMANN 


The Laplace method that we discussed in the last section is for real functions / and 
g. When both of these functions are complex and analytic, the asymptotic analysis 
is called the method of steepest descent and was originated by Riemann and further 
developed by Debye. This original work by Riemann was never formally 
published. It was only published posthumously in 1876, and it is among the 
unpublished notes found in his house (or nachlass) that was subsequently included 
in his collected works. The method was first published by Debye in 1902 and he 
applied it to find an integral representation of Bessel functions for a large argument 
or order. In Debye’s (1909) paper, he did mention its appearance in Riemann’s 
Collected Works. Riemann is one of the most influential German mathematicians 
and he studied number theory, differential geometry, and complex analysis. His 
work on the complex zeta function related to prime numbers led to the unsolved 
Riemann hypothesis (see Problem 4.56 of Chapter 4 and Sabbagh, 2003), and his 
Riemann geometry is the basis for Einstein’s theory of relativity. The Riemann 
hypothesis is probably the most well-known unsolved mathematics problem after 
the celebrated proof of Fermat’s Last Theorem by Andrew Wiles (Singh, 1997). 
Debye is a Dutch physicist and was a student of Sommerfeld, and he studied the 
specific heat of solids (Debye model), ionic solutions (Debye potential and Debye- 
Hiickel method), and he received the Nobel Prize in chemistry in 1936. In fact, 
Debye’s undergraduate degree was in electrical engineering and his PhD was in 
physics under the supervision of Sommerfeld. Debye had been nominated for a 
Nobel Prize in physics as well. Friedrich Hund, his colleague at the University of 
Leipzig, described him as “clever but lazy” as he could often spot Debye watering 
roses in the institute garden when he should have been working. Debye responded 
that he had a certain tendency to take things easy. It was speculated that many great 
ideas behind his over 200 publications (one led to the Nobel Prize) were 
germinated in this outdoor garden. The steepest descent method is also known as 
the saddle point method. The method of steepest descent belongs to a special 
theory of contour integration. This idea of steepest descent has also been used in 
optimization problems with success, but such consideration is out of the scope of 
the present book. 


12.8.1 Direction of Steepest Descent 


When A(z) and g(z) are complex functions of a complex variable z, the integral in 
(12.225) is of Fourier type (compare the integral in the Fourier transform given in 
Section 11.2.2): 


iG\= I. e(z)e" dz (12.285) 
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where z = x+iy is a complex variable and integration is carried out along a certain 
contour C. The saddle point of the exponential function is found by setting 


w'(2)|,_., =0 (12.286) 


If the first (1—1)-th derivatives of w are also zeros, we call the saddle point is of 
order n. Mathematically, it is defined by: 


we. =w'2)|,_,, == WD (Z)| =0, w”(z)} #0 (12.287) 
2=2Z9 0 


If n = 2, the saddle is called a simple saddle point. If we plot the exponential 
function in (12.285) on the complex plane (with axes being x = Re[z] and y = 
Im[z]), the function will appear as mountains with valleys, cols, and saddles 
between them. The exponential function in (12.285) can be written as: 

ef") = e Re[w(z)] ,i4 In[w(z)] = Chl Y) piAvny) (12.288) 


The contour lines and constant phase lines of such a plot are represented 
respectively by: 

u(x,y) =Re[w(z)]=C,, v(x, y) = Im[w(z)] = C, (12.289) 
where C; and C2 are constants (see Fig 12.8). Note that the imaginary part in 
(12.288) will lead to oscillation in the functions but does not contribute to the 
integral, while the real part contributes mainly to the integration. The constant 
phase lines also represent the steepest descent or ascend paths. Physically, it is 
because the changes in the real part of the function is fastest (or the steepest 
descent or ascend along the contour) if the imaginary part is a constant. 
Mathematically, the change in w can be expressed as 


e Saddle point 


Figure 12.8 Contour lines and constant phase lines 
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low) =|du\” + ov)” (12.290) 
Along the lines of constant phase, we have dv = 0 or 
Im[w(z)] = Im[w(zy)] (12.291) 


Then, the change of du is maximal when (12.291) is satisfied. The method of 
steepest descent aims to deform the contour path of integration passing through the 
saddle point zo such that it coincides with the steepest path as far as possible (i.e., a 
path that (12.291) is satisfied). If (12.291) is satisfied, the integrand would not 
oscillate rapidly on the steepest descent path. Then, an approximate value of the 
integral will be determined from the integrand in the neighborhood of the saddle 
point. 

Alternatively, the equivalence between the constant phase lines and steepest 
descent path can be proved by using directional or vector differentiation. The 
gradients of the functions u and v are 


Vi ee, Wee, (12.292) 
ox * O ox ~ Oy 


The gradient Vu is perpendicular to the contour lines of wu = const. (dashed lines in 
Figure 12.8) and indicates the direction of maximum changes of u whereas Vv is 
perpendicular to the constant phase lines of v = const. (solid arrow lines). 
Emerging from the saddle point, the deepest descent line is denoted by D and the 
steepest ascent line is denoted by A. Since w is an analytic function, uw and v must 
satisfy the Cauchy-Riemann equations as (recall from Section 1.7.1): 


Ou Ov Ou_ Ov 


> (12.293) 
ox Oy oy Ox 
These two equations can be combined to give 
Ou ov, Ou _4 (12.294) 
Ox Ox Oy Oy 
In tensor notation, (12.294) is 
Vu-Vv=0 (12.295) 


This shows that the contour lines and constant phase lines are perpendicular. This 
also gives another proof that the constant phase lines are the steepest descent path 
(corresponding to the maximum change of w). 

Differentiating the first equation of (12.293) with respect to x and the second 
equation of (12.293) with respect to y and adding these two, we get 


Ou Ou _ 

ax? dy" 
Similarly, we can repeat this procedure with a reverse of the order of differentiation 
with respect to y and x for the first and second equations of (12.293) to yield 

2 2 

eg ee ~=V'v=0 (12.297) 

Ox” oy 
Thus, as expected, both uw and v are harmonic functions. From the maximum 
principle of potential theory from Chapter 9 (e.g., Sections 9.7.7 and 9.8), there is 
no maximum or minimum of wu within the domain, except on the boundary. The 


V7u=0 (12.296) 
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same conclusion can also be made in view of the maximum modulus principle of 
complex variable theory (e.g., p.146 of Silverman, 1974). 

We now consider the direction of the steepest descent path at the saddle 
point. Using the information on the derivative of w, we can express the change of 
function w at the saddle point as: 


1 d"w n 
Ow = W(z)— W(Zp) = as |z Zo| {l4 O(|z Z|) (12.298) 
! dz 
Using Euler’s formula to write the terms in (12.298) in polar form, we get 
oe ae, z—z, = pe” (12.299) 
dz" 


Clearly, we are looking for @ along the steepest descent path. Substitution of 
(12.299) into (12.298) yields 
ae“ n_inO 
aWw=———p e {1+ O(p)} (12.300) 
n! 
This can be rearranged as 


i(a@+n@) 
ow ~—— {1+0(p)} we “[cos(a +n0)+isin(a+n0)] (12.301) 
nt nN: 


n 


fe) 
Along the steepest descent path, we must have (12.291) satisfied or (12.301) must 
be real and, in addition, dw < 0. Thus, we have to set 


atn0d=(2p+)a, p=0,1,....n-1 (12.302) 
Therefore, the directions for the steepest descent path are 
b=- 24 0p, petil.cn=1 (12.303) 
n n 
Along the steepest ascent path, we must have dw > 0 instead, and this leads to 
at+nO=2pr, p=0,1,...,.n—-1 (12.304) 
The corresponding @ are 
ase”. p00... 8-1 (12.305) 


The direction for constant u (the divide between hills and valleys) is given by 
and = (p+), p=0,1,...,2n-1 (12.306) 


The directions 0 for the separation lines between hills and valleys are 


v= Pad, p=0,l,...,2n-1 (12.307) 
n 2 n 


The most important type of saddle point is for n = 2. The values of 0 for special 
cases are: 


Descent? Ga= 44, Fe (12.308) 
22> 2 2 


Ascent: a=", =See (12.309) 
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Constant u: 0 ie Lae 207. paveed (12.310) 
2 4 2 4 2 4 2 4 


The simple saddle for n = 2 is illustrated in Figure 12.9. 

A more detailed view of the directions of steepest descent and ascent are 
depicted in Figure 12.10. The steepest descent path is labeled as D and those for 
steepest ascent are labeled as A. The projections of hills, valleys and the horizontal 
plane cutting the saddle point are defined as: 


hills: u(x, vy) > u(xo, Vo) (12.311) 
valleys: u(x, y) <u(Xo, Yo) (12.312) 
plane: u(x, vy) =u(Xp, Vo) (12.313) 


Physically, along the steepest ascent (A in Figure 12.10) the real part of w(z) will 
diverge to infinity as the contour passes the saddle point (except for very special 
case). It means that the integral will diverge. Therefore, we must consider the 
steepest descent path indicated by D in Figure 12.10. 
12.8.2 Steepest Descent of Regular Saddle Point 


We now return to the asymptotic expansion of (12.285). We first rewrite it as 


I(A) = exp{Aw(zy)} I 8 exp Antz) — Am zp)}dz (12.314) 


u=u(X, y) 


u= U(X, Yo) 


Figure 12.9 A simple saddle with n = 2 
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The exponential function in the integrand is exactly the change of the function w(z) 
and thus is related to the steepest descent. Since the contour C is supposed to pass 
through the saddle point such that we have 


w(Z))=0, (12.315) 
hence, the Taylor series expansion for w(z) about the saddle point is 
1d? 
wWz)-wWzy))=——~| (zz) +. (12.316) 
2! dz? 
Z=Zy 
Substitution of (12.316) into (12.314) gives 
An 
I(A) = exp Aw(eo)3)_ (2) exp i w" (Zp (Zz - Zo) }az (12.317) 


We now follow the Euler form given in (12.299), then (12.317) can be expressed as 
I(A) = exp Aw(2y)3{_ 82) exp me p’ [cos(a +20) +isin(a +20)}dz (12.318) 


Figure 12.10 Orientations of steepest descent and ascent for saddle point with n = 2 


Applying the idea of steepest descent for contour C, we deform it to from —00 to 00 
or from 0 to «. The choice on the limits of integration depends on the nature of 
w(z). More discussion on this choice of contour to real integration will be given 
later in this section. As discussed earlier in the section, we can set 

at+nO=(2p+l)z, p=0,1L...,.n-1 (12.319) 
such that the imaginary term in (12.318) will be zero (i.e., constant phase lines). 
Thus, we have for the case of n = 2 
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I(A)= exp{Aw(zy)}f g(z)exp Spaz (12.320) 
From the second equation of (12.299), we have 
dz=e'’dp (12.321) 
Substitution of (12.321) into (12.320) gives 
i Aa 
1(A) = exp{dw(zo)}ezo)e” | expt-S p*}dp (12.322) 


In obtaining (12.322), we have assumed g(z) is a well-behaved function near the 
saddle point. Note that integration of the complex function is now converted to 
integration of the real variable. The integral is known as the Gauss or Laplace 
integral and can be evaluated as (see (1.102) of Chapter 1): 


o Aa 20 
[ext ~ p }dp= i (12.323) 


Substitution of (12.323) into (12.322) gives 


I(A)= = exp {Aw(zy)}2(zy er” (12.324) 
a 
From the first of (12.299), we find 
a= |w"(Zp)| (12.325) 


Substitution of (12.325) and (12.319) into (12.324) leads to the final result: 


(A) = ale exptamtay) +p +) 2-23} (12.326) 
w"(Zp)| non 


This result differs from equation (7.2.10) of Bleistein and Handelsman (1986) by a 
factor of two. Their result is based on the result in Chapter 4 of their book. 
However, if we look at equation (4.4.24) of Bleistein and Handelsman (1986), the 
formulation is for integration limits from 0 to 0. Therefore, if we divide our result 
by 2, we recover the result of Bleistein and Handelsman (1986). The derivation of 
the results of Bleistein and Handelsman (1986) is done through the consideration of 
the Mellin transform and involves a triple summation. The current approach is 
much simpler. As a final remark, we should emphasize that if the contour can be 
deformed to the following real limits 


[> [, -40. (12.327) 


we should use half of (12.326). If the contour can be deformed to the following real 
limits 


f de> [ ope, (12.328) 
Cc —00 


we should use (12.326). 
In summary, the method of steepest descent can be divided into 5 steps: 
(i) Identify all critical points, including saddle points. 
(i1) Determine the path of the steepest descent from these critical points. 
(iii) Justify the deformation of the original contour C onto one or more of the paths 
of steepest descent found in (ii). 
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(iv) Determine the asymptotic expansion of the integrals on the deformed contour 

such that the complex integral becomes a real Laplace type integral. 

(v) Sum the asymptotic expansions to give an approximation for the integral. 
Among these, Step (iii) is the most difficult, and the process of deforming the 

contour is a consequence of the Cauchy integral theorem. 


12.8.3 Steepest Descent of Higher Order Saddle Point 


The previous result presented in (12.326) makes two major assumptions that pose a 
limitation on the result in (12.326). Let us recap here. Recasting the Laplace 
integral slightly, we have: 


b 
1A) = g@explaw(jat (12.329) 
In the last section, the two major assumptions are: 
g(to)=regular, w'(z)) =0 (12.330) 


First we generalize the second condition in (12.330), and suppose now that the 
saddle point is not a simple saddle and the function w satisfies: 


w"'(Zy) = W"(Zp) =. = WwW" (29) =0, w(z9) #0 (12.331) 
where n is an integer (can be both even and odd). The analysis given in Section 
12.8.2 can easily be modified. In particular, we recall from (12.299) that: 


w")(z))= ae, z—z, = pe” (12.332) 
In addition, the integral (12.329) can be rewritten as 
b 
1(A) = exp{Aw(29)}[g(z)exp{ALw(2) — w(zp)]}dz (12.333) 
Taylor series expansion gives 
(1) no 
w(z)- w(zZ9) = us au (z-Z)" +..= —. gor) 
ms se (12.334) 


= <P feos(nd +a)+isin(nO+a@)] 
nt! 


By requiring that the imaginary part of (12.334) is a constant, we have 
aT a 


6 =(2k +1) , k=0,1....n-1 (12.335) 
In view of (12.335), we have — 
w(z)—w(Zp) =— ss (12.336) 
Substitution of (12.336) into (12.333) gives - 
(A) = exp {Aw(z9)} expliO) (9) ° exp MP dp (12.337) 


where g(z) is an analytic function at z = zo. If n is not an integer, the integral is not 
an even function. Thus, we cannot in general extend the limits of the integral to —co 
to oo. Now consider a change of variables: 
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Aa. 1/n 
(eas (12.338) 
With this change of variables, (12.337) can be written as 
I(A) = exp {Aw(zy) +10} 2(20 > =" " exp-S"}dp (12.339) 


Finally, the integral in (12.339) can be nae in terms of the gamma function 
through the following change of variables: 


s=€", ds=n0"'de (12.340) 
Using this change of variables, the integral in (12.339) becomes 
I 
ioe) 1 re —-l 1 1 
ie =[ exp{ odp=—[ fp? dp--TO) (12.341) 
0 no nN nN 


The last result is obtained by noting the definition of the gamma function given in 
Chapter 4. Back substitution of (12. ‘tau into (12.339) gives 


aca n} 
7) 


This result agrees with the result given by Bleistein and Handelsman (1986). 
Similarly, we can also generalize the first condition in (12.330) that g is not a 
regular function at z = zo: 


8 (Zo) = 8'(Zo) == 8" (Zo) = 0, (Zo) #0, (12.343) 
For this case, we expand g(z) in a Taylor series expansion as (since all lower 


derivatives vanish at z = zo) 
aed 6 ) m eM ) mim 
g(z)—g(Z) = — (2— 29)" +..=[-—— opr”, (12.344) 


Substituting (12.344) into (12.333) and in view of (12.343) and (12.344), we can 
express (12. on as 


1(A) ={2&) Jem” exp (Awtz9)}{ 0" expt 22 hap (12.345) 


To determine the i we can apply the change a. wanableg proposed in 
(12.338) to simplify the integral in (12.345) to 


i m Aa 7 n!} m n ~ m n 
J, eo exp ap = (Sy |g exp 63d (12.346) 
0 n! Aa 0 


1/n 


I(A) = ye ~)explan(zy) + 2k+)=— 2) (12.342) 
nN 


>) 


Introducing another round of change of variables ¢ = 5", we get 


ee . ! is = 
I a exp { Aap \dp _ ( n ene I glint Din 1 exp {-s}ds (12.347) 
0 n! Aa n#0 


The last integral in (12.347) is again recognized as the gamma function. Thus, we 
finally get 


m+1 


(m) 
go) "TD exp{aw(en)+ilm-+ D1 2k+) 2-5} 


m! Alw rom ) 


I(A)s “| 


(12.348) 
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This result again agrees with that given by Bleistein and Handelsman (1986), but 
the present proof is much simpler that theirs. 


12.9 STATIONARY PHASE OF STOKES-KELVIN 


According to the discussion by Watson (1944), the method of stationary phase was 
first published explicitly by Kelvin in 1887 when he considered the asymptotic 
value of the following integral with a large argument, which was related to two- 
dimensional waves in water (Kelvin, 1887): 


v= = [“ costmtax—yf(m)} dm (12.349) 
27 40 


However, Watson (1944) also discovered that the same idea had been used by 
Cauchy in 1815, Stokes in 1856, and Riemann in 1876. Nevertheless, in most 
literatures this method was attributed to either Stokes or Kelvin. 

In particular, the following integral was considered: 


I(x) = i g(t) exp fixh(t)}dt (12.350) 


where x — o. Note that the Laplace method deals with a real function in the 
exponential function, the method of steepest descent considers a complex function 
in the exponential function, whereas in the present case, the method of stationary 
phase treats a purely imaginary function in the exponential function. Actually, the 
method of analysis closely resembles that used in the method of steepest descent. It 
is well known from Euler’s formula that 

exp {ixh(t)} = cos[xh(t)]+isin[xh(t)] (12.351) 
When x — , this is a highly oscillating function. In mechanics and physics, such 
functions are mostly related to wave phenomena, and xh(f) is actually the phase of the 
wave motion. The integral is essentially zero as the integrand is rapidly oscillatory 
such that the contributions from adjacent sub-intervals nearly cancel one another, 
except at the point where the function is stationary (i.e., A (¢) = 0). Kelvin asserted 
that the main contribution is from the end points and stationary points, but the 
contribution from the stationary point is more important than those from the end 
points. In other words, we are looking at the point where the phase of the wave 
motion is stationary. This suggests the name “method of stationary phase.” 


12.9.1 Erdelyi’s Derivation 


Assume that the stationary point is at ¢ = tsuch that 


h'(r)=0, h(t) #0, ax<r<B (12.352) 
Since the contribution is from the neighborhood of the stationary point, we have 
T+E 
I(x) ® } g(t)expfixh(t)}dt (12.353) 


where ¢ is a small positive number. Following the procedure discussed in the Laplace 
method, we can apply the following change of variables: 
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2udu 
h(t) 


h(t)—A(t) =u", dt= (12.354) 
Thus, the integral in (12.353) becomes 


[A(c+e)—h(r)]'? 


I(x) ® [ ie Oro a ip oxP tale) ta? Tidu (12.355) 


We now consider the following limit as t > 7 


tig (12.356) 
wor h(t) 0 
Since the limit is of the form of 0/0, we can apply L’H6pital’s rule to get 
gy i (12.357) 


tot h(t) tar h(t) terh"(t) 2u 
We note that the limit that we want to find on the left hand side appears as an inverse 
on the right hand side. Thus, we can rearrange the result as 


Bul) yi2 2 


lim[ (12.358) 
or h(t) A"(c) 
Using this result, we can rewrite (12.355) as 
1/2 
I(x) ® a5 a ; g(t) exptivh(r)}| exp fixu? }du (12.359) 
To evaluate the integral in (12.359), we make the following change of variables 
1/2 
= etizl4 =] (12.360) 
x 


where the positive sign is for h’(7) > 0 and the negative sign is for h’(2) < 0. 
Effectively, we rotate the angle or we convert the oscillating exponential function to 
become a decaying function as 


un tit/4 no 
I(x) * oe ; tryexptin(e)} Te-[, o" expt-S¥de (12361) 


The last integral becomes a gamma function and thus we have 


1/2 . 
ie | g(r) exp {ixh(r) 2r) (12.362) 


Finally, we can use the well-known result for the gamma function that (see Example 
4.2 in Chapter 4) 


rm) = Jn (12.363) 
Consequently, we have the asymptotic result as 
I(x) ® au (rT) ex Gens (12.364) 
tah"(ey | SPR 
This result of course agrees with that of Erdelyi (1956), who only considered the 


positive sign. Note however that there is a typo in equation (6.1.5) on p. 220 of 
Bleistein and Handelsman (1986). 
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Example 12.5 Consider the leading order term of the following integral form of the 
Bessel function defined by: 


7t= = | cos(nt —Asint)dt (12.365) 
a 


where A> 0. 


Solution: We first note from Euler’s formula that 
cos 0 = ste” ee") (12.366) 
Using (12.366), we have 


J, (A) = a fF exp(int) exp(—iA sin t)dt + [ exp(—int) exp(i/A sin nat 
- : : (12.367) 
= a, +Jna(A)s 
1 


We recognize that both integrals are of the type of (12.350). For the first integral, 
we find 


A(t)=—sint, h'(t)=—cost, hA"(t)=sint (12.368) 
The stationary point is at tf = 7/2, and thus 
h(/2)=-1, h'(z/2)=0, A"(z/2)=1 (12.369) 
Employing the formula derived in (12.364), we get 
J (A) © exp {iA} exp(i oe Pe exp) (12.370) 
Similarly, for the second integral we can follow the same procedure in deriving 
J,9(A) © exp {id} exp(-i re [2 exp( = (12.371) 
Substitution of (12.370) and (12.371) into (12.367), we find 
2n nt 
J (A) ® cos(A 12.372 
n (A) a 7 ( 5 re ( ) 


This can be found in 9.2.1 of Abramowitz and Stegun (1964). 


12.9.2 Stationary Phase of Higher Order 


This section follows closely the presentation by Bender and Orszag (1978). Let us 
recast the problem as 


I(x) = (f f(Dexpfixy(O}at (12.373) 


as x — oo, Without loss of generality, we can assume the higher order saddle point is 
at the boundary point ¢ = a defined by: 


y'(a)=...=v? Y(a)=0 yw” (a) 40 (12.374) 
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First, we can decompose (12.373) into two integrals: 


I(x) = i f(Dexpfixy (dt + [. f Oexpfaxy(W}dt =1,(x)+1,(x) (12.373) 


We are going to show that the order of magnitude of these two integrals is not the 
same as x — 0. The leading order term is from the first integral as the second one 
decays much faster than the first for large x. Let us apply integration by parts to the 
second part of (12.375) as: 


b 
he)=])_ SOexptiw(O}at = —_[" fdtexptiny(o) 


: 1 ¢? explixy(t)] df(t 

-_1_ Rocsetol. =f ON at 
ixy'(t) ixdate y(t) dt 

If there is no stationary point within the limit of integrations, the integrals in (12.376) 


decays as 1/x, and so does the second integral in (12.375). 
To evaluate the first integral in (12.376), we use Taylor series expansion for 


Yt) as 


(12.376) 


y(t) =y(a)ty (a) —— 


eis (12.377) 
p! 


Thus, if f(d) is a regular function, we get 
; até 1x 
I(x)~ f(@exptixy(a)}[ exp Cy ane-ay? yar (12.378) 


The following change of variables is adopted 


S=t-a (12.379) 
Then, with ¢ > o (12.379) is simplified to 
F % ix 
I1(x)~ flaexptixy(a)}{ expt yas? hs (12.380) 
To evaluate the integral, we apply the following change of variables 
i l/p 
+ ! 
s=e P|_P™ (12.381) 


xy (a) 
where the plus sign is used if y/”(a) > 0 and the minus sign is used if y/”(a) < 0. 
Employing this change of variables, (12.380) becomes 
l/p 
p! 
xy (a) 


In obtaining this, we set upper limit ¢ — 00. The integral is again the gamma function, 
and thus we finally arrive at the following result: 
l/p 


1) -|—2— | s@exptiixyia+yre2) 42.383) 
xy (a 2p P 


L()- f(ayexp fifxy(a)+—] ee } “yest exp{—u}du (12.382) 
2p po 
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Since p is an integer larger than 1, this integral decays as 1/x’”, and it is much slower 


than 1/x compared to the second integral in (12.375). Therefore, the result is 


Ix) ~ fl@exptiiya)+ Zp t= 2 rr) (12-384) 
P x|y »(a)| P 


This result agrees with Equation (6.5.12) on p.279 of Bender and Orszag (1978). 


Example 12.6 Consider the leading order term of the following Bessel function 
with both the argument and order approaching infinity: 


J,(n) = ~ ["cos(at —nsin#)dt (12.385) 


where n 00. 


Solution: For this case, the phase function is identified as 
y(t) =sint-t (12.386) 

Therefore, we have 

y'(t)=cost-l, w"(t)=-sint, w"(t)=—-cost (12.387) 
The stationary point is at ¢ = 0. Substituting the stationary point into (12.387) we 

et 

: y'(0)=0, w"(tH=0, w(t) =-l (12.388) 
Therefore, we have p = 3 for this case. Employing the formula derived in (12.384), 
we obtain 


1 1 1 
J,(n)~ tReee™(2y°ry (12.389) 
a 3 n | 
Finally, taking the real part we get 
J,,(n) ~ A2283N HT (12.390) 
/s 


This result was first obtained by Cauchy in 1854, and this was also obtained by 
Nicholson in 1909, Rayleigh in 1910, and Watson in 1918 (Watson, 1918). 


12.9.3 Stationary Phase of Higher Dimensions 


So far, our results are for the one-dimensional case. We can extend our analysis to 
multi-dimensional integral as: 


1(A) = [--[oG.-x,dexpiA f (XqpueX, idx, - de, (12.391) 


The derivation of the result is similar to that employed in Section 12.7.4. In 
particular, the stationary point can be determined as 


Vf(a) =0 (12.392) 
This is equivalent to 
Le een er (12.393) 
Ox xX, 


where 
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a=(q,...,4,) (12.394) 
As discussed before, the determinant of the Hessian matrix is larger than zero: 
af 


det|H (a)| = det >0 (12.395) 


Ox;OX;; 
Expanding f(x) above a, we get 


sees a, + S(x-ay" H(x-a)] 


iAL f (q 
1(A) ¥ P(4,,43,..54,) [fe dx (12.396) 


Denoting y = x—a, we obtain 


; aes, 
I(A) x playe*@ } " | e2 dy (12.397) 
We can decompose the Hessian matrix as (see p. 288 of Lipschutz, 1987) 
P’ DP =H (12.398) 


where P is the orthogonal matrix and its column is composed of eigenvectors of the 
Hessian matrix. Consequently, D is a diagonal matrix with eigenvalues of H as the 
diagonal terms: 


a4 0 0 
D=|: “. (12.399) 
0 0 A, 
Thus, (12.177) can be expressed as 
; rp 
1(A) x gaye"! | - ( er ‘de (12.400) 
where 
z= Py (12.401) 
The matrix P is composed of unit eigenvectors as: 
det| P| = det a (12.402) 
Oz ; 


It is straightforward to see that 
zg’ Dz =Az, +..+A,Z2 (12.403) 
Substitution of (12.403) into (12.400) ae 


Az, 
I(A) = gaye” fer" dz... } eo. dz (12.404) 


We now recall the previous result ay in Section 12.9.1 ae 


[ exp (ti Au? \du =(— 1 yea (12.405) 


1 
re nis (Bye 
ap ENG i 
The plus or minus sign depends on whether the eigenvalue 4; defined as the 
diagonal terms in (12.399) is positive or negative. In view of the symmetric 
property of the integral, (12.405) can be rewritten as 
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a 


‘- exp (HS Au \du = (Se Tr (12.406) 
where i = 1,2,...,n. Substitution of this result into (12.404) gives 
; ZS son A, 2 1 
TA) x laye*fMe* (Ayr? (12.407) 


a Sand, 


Recall that the determinant of the Hessian equals the product of the eigenvalues: 


TV = det D = det H (12.408) 
i=l 
Substitution of (12.408) into (12.407) gives 
20 n/2 1 ‘ 17 a 
I(A) © p(a)(—)"* ——— exp {iA f(a) +— ) sgnd,} (12.409 
A vdet H 4 2 ¢ 
If the stationary point is at the boundary, the value of the integral will be half 


I(A) x =a) nia 


epiayia) + sen ij} (12.410) 


1 
Vdet H a=) 
This result agrees with Equation (2.32) on p. 487 of Wong (2001), but the present 
derivation is much simpler than that of Wong (2001). 


12.10 NAVIER-STOKES EQUATIONS FOR SURFACE WAVES 


As derived in Chapter 2, the most general form of incompressible fluid flow can be 
modelled by the following Navier-Stokes equation 


Oe a, Yess pha (12.411) 
Ot fe) 
Continuity of the fluid for a 2-D, incompressible, inviscid flow with no capillary is 
oe + On = (12.412) 
Ox OZ 


The equation of motion along the x-axis is 


ead cL Se (12.413) 
Ot Ox Oz p Ox 
The equation of motion along the z-axis is 


gg ge at (12.414) 
Ot ox Oz po 


As shown in Figure 12.11, there are two sets of boundary conditions, one at the free 
surface and one at the bottom. The dynamic condition on the free water surface is 
p=0, z= (12.415) 


Another surface condition is the kinematic condition on the water surface 
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on On 
+u =w, z= 12.416 
at ox J con) 
whilst the kinematic condition at the sea bottom is 
w=0, z=-h (12.417) 


z=n(x,t) 


w(—h) =0 


Figure 12.11 Boundary condition for shallow water waves 


Because of the incompressibility given in (12.412), we can introduce a velocity 
potential ¢ 

i eet (12.418) 
Oz 
In terms of the velocity potential, the continuity condition in (12.412) can be 
expressed as a Laplace equation 


2 2 
oe Et 2, 97620 (12.419) 
Ox” Oz 
Equations of motion along the x- and z- axes given in (12.413) and (12.414) can be 
rewritten in terms of velocity potential as: 


00 10.0 fe) 10 
Cogs pa ieee (12.420) 
Ox Ot 20x Ox OZ p ox 
00 10.0 fe) 10 
Cn Oe ise (12.421) 
Oz Ot 20z Ox Oz p oz 
The boundary conditions become 
p=0, z= (12.422) 
6n Op0n __ Od 
=P zan(xt 12.423 
Ot Ox Oo oz eee ( ) 
And the kinematic condition at the sea bottom is 
OF 26 pj (12.424) 
oz 
Integrating (12.420) once, we obtain 
0g, 1,062 06.2, P 
+ + +—+g7=0 12.425 
GP Bae G - 4 ( ) 


We now specify (12.425) on the free surface (z = 77) to get 
6 1.0 0 
(FA (By + Pry+en=0, z=) (12.426) 
0 2° Ox Oz 
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Finally, the equations to be solved are (12.419), (12.423), (12.424) and (12.426). 


12.11 PERTURBATION FOR NAVIER-STOKES EQUATION 


Note that the governing equation for the velocity potential is nonlinear and, in 
addition, the shape of the free surface, which is one of the boundaries, is unknown. 
This nonlinear problem can be handled by the perturbation method discussed in 
this chapter. In particular, we assume that motion in the water is small. Without 
wave motion, the surface of the water is flat, and with wave motions, the water 
surface becomes uneven. Thus, the small motion also means that the vertical 
displacement of the free surface is small. This assumption normally breaks down 
when a wave motion is plunging onto a sloping bottom along the coastline. It can 
be shown that momentum change due to surface undulation with wavelength L is 
proportional to the wave steepness defined by ¢ = H/L where H is twice the 
amplitude of the wave function on the free surface, as illustrated in Figure 12.12. 
For small amplitude wave motions, we have ¢ being small. 


12.11.1 Shallow Water Waves 


For small amplitude waves with small ¢ (say <1/7), we assume that the wave 
motion is small and thus we have 


b= &b, + °b + Edy +... (12.427) 
= EN +E, + ON +... (12.428) 
P=DotED +E Pr te pst... (12.429) 


where ¢= H/L is the wave steepness (the nonlinear parameter of the problem). Note 
that we have finite water pressure p even for the case of no wave motions. 
Therefore, only the first order terms of the surface undulation 77 and wave function 
¢ is proportional to ¢, not p. 


(i) Equation of continuity 


Substitution of (12.427) into the equation of continuity (12.419) gives 


a a a a 

é A bee t rs a bee & a! (12.430) 
Ox ox Oz Oz 

Thus, we have the following governing equations for the first few approximations 

as 


ag aa 
2 2 
O(e’): oe 2 (12.432) 


Or? Gz? 
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L/2 


z=n(x,t) 


H=2a H 
on 


w(-h) = 0 


2 2 
Ole): oy! 85 (12.433) 
Ox 


(ii) Dynamic condition on free surface 


The free surface condition given in (12.426) is expanded around z = 0 with respect 
to 7 


ag ang ag b> 002 
G )-=0 Lor )-=0 2! Tot sa tee i es = ars aE 1--0 
8 (2b , aby a. bby er, 
{2 oe eneeair +( a (12.434) 
+...497=0 
All these terms can be expanded in a series of € as: 
ia 2 ge OO oO 5. (12.435) 
Ot Ot ot 
ag Oh 20h 
ar (En, +é Vi Pa: )[é —— + 6° —<+...] 
oe, a oe (12.436) 
2 
en Alar em —— aoz en alae +O(e* ) 
2 3 3 3 
1] o¢g er Og, 4 12.4 
Se aes (tole ) (12.437) 
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sy +h |= sl oo +e" a ro eee’ es] 
= EA A 14 Cy 4 By By ofc") 


(12.438) 


1] 0, 0¢.. (0¢.> 0g Ob O06 0°¢ 
FRAG "A ei} - fe Ze. see 


Og oa 


= (em +€°M +.. ete a 
0b, OH Oh — 
Ox aa Oz Oe? 


Substituting of these results into (12.434) and collecting the first and second order 
terms, we have 


a J+(é O. yes, .)} (12.439) 


=n 


Ole): ~<A en =0 (12.440) 
2. bh, _ aa OA» , Aye 
Ole"): Hy en hae ow ae | ] (12.441) 


Note from (12.441) that the second approximation depends on the solution of the 
first approximation. 


(iii) Kinetic condition on water surface 


We expand the kinetic condition given (12.423) around z = 0 to get 


én _ 26 , ag 7 OO. 121 __ 26 oe 7 Oo. 
te Ae Sao "a & at ae 


(12.442) 
gl, 2 Ons +... ig ns CPs pl got, eee. 
Ot Ot Ox Ox a Oz Ox Ox 
Oh | 2 Oh 2, Oh 
é +é ori eer t AB ce 

[ ae a ] ui) a2 

(12.443) 
The first and second order approximations are: 
Ole): Oh OM 6 = (12.444) 
Ot OZ 
2 

Oe, Sieg OM OK Oh, ON (12.445) 


a OU CO 
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(iv) Kinetic condition on sea bottom 


The boundary conditions given in (12.424) for the first two approximations are 


Ole): us =0 (12.446) 
Oz 
Ole"): 6 4 (12.447) 
Oz 
We now summarize the formulation for the first order approximation 
2 2 
eo A =0 (12.448) 
Ox Oz 
Og, Om , OW 
+ =0, + =0, z=0 12.449 
ae aa eres 
a =0, z=-h (12.450) 
Oz 


The governing equation for the first approximation is a Laplace equation. The two 
conditions given in (12.449) can be combined by elimination. Thus, the surface 
boundary condition becomes 


2 
ao. a gel. ga (12.451) 
Ot Oz 
The problem of the second order approximation is summarized as 
2 2 
al ese (12.452) 
ox Oz 
Ag, OH 1, b> , Phy? 
+ = , z=0 12.453 
a ee ae AIG) ( a Z ( ) 
2 
MG OO ay (12.454) 
Ot Oz Ox ox 2” 
EO Ny Ga (12.455) 
Oz 


Again, nonhomogeneous terms on the right of (12.453) and (12.454) depend on the 
solution of the first approximation. 


12.11.2 Airy Surface Waves 


The first order of approximation for a surface water wave is called an Airy surface 
wave. We adopt a separation of variables as 


o(x, Z,t) = X(x)Z(z)T(t) (12.456) 
Substitution of (12.456) into (12.448) gives 
2 2 
d = ZT =—-XT d f (12.457) 
dx dz 


This can be rearranged as 
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1a?X  1d?Z_ 2 
X dx? Z dz’ 
where & is the constant of separation of variables. This leads to two ODEs, one for 
X and one for Z: 


(12.458) 


2 2 
é +4 XX, é = k-Z=0 (12.459) 
dx dz 
The solutions for the first and second equations of (12.459) are respectively 
X = A sinkx + A, coskx (12.460) 
Z = B, sinh kz + B, coshkz (12.461) 


There is no governing equation for T(t), but we look for a periodic solution with 
respect to time 
T =C\sinot+C, cos at (12.462) 


Combining the time and space variables, we obtain 

@(x, z,t) =[B, sinh kz + B, cosh kz]{C, sin(kx — o@t)+C, cos(kx—at)} (12.463) 
which is a forward moving wave. Note for simplicity that we have dropped the 
subscript “1” in the wave potential. Similarly, the backward moving wave is 

@(x, z,t) =[B, sinh kz + B, cosh kz] {C, sin(kx + @t)+C, cos(kx+at)} (12.464) 


Without loss of generality, we look for a particular solution form 
@(x, z,t) =D, cosh kz + D, sinh kz]cos(kx — at) (12.465) 


The bottom boundary condition requires: 


CS. te nh oie 00486) 


& 
This leads to 
D, = D, tanhkh (12.467) 
Using this result in (12.465) and differentiating with respect to time and z, we get 
2 
= = ~o’ D, [cosh Az + tanh khsinh kz|cos(kx — at) (12.468) 
t 
0g : 
-g ee = —gkD, [sinh kz + tanh kh cosh kz] cos(kx — at) (12.469) 
Iz 
Substitution of (12.468) and (12.469) into the following surface condition 
2 
a Soap 0% (12.470) 
ot Oz 
gives the dispersion equation for frequency 
@” = gk tanhkh (12.471) 


This dispersive relation relates the wave frequency @ with the wave number k. 
Whenever @ is a function of k, the wave is called a dispersive wave. The main 
characteristic of a dispersive wave is that the wave amplitude decreases with 
propagation. Now, we can let 


Yee 


= (12.472) 
cosh kh 
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In addition, we note the following sum formula for the hyperbolic cosine 


cosh(x + vy) = cosh xcosh y +sinh xsinh y (12.473) 
Then, (12.465) can be rewritten in a compact form as: 
¢ = Acosh[k(z + /)]cos(Ax — at) (12.474) 
Differentiation with respect to time gives 
= = wAcosh[k(z + h)]sin(kx — ot) (12.475) 


The unknown constant A must depend on the wave amplitude a. Let us assume the 
surface undulation as 

n = asin(kx — at) (12.476) 
To find 4 in terms of a, we can substitute (12.476) and (12.474) into the (12.440) 
or 


og 

A +gn=0 (12.477) 
Thus, we have 

@A cosh khsin(kx — ot) = gasin(kx — ot) (12.478) 

Solving for A, we get 
ga 1 _awo 1 
@ coshkh  k _ sinhkh 
The second part of (12.479) results from the dispersive relation derived in 
(12.471). Thus, we obtain the flow potential as 


(12.479) 


h 
je ee y] a ae, 
. h ; (12.480) 
_ ga cosh[k(z+h)] ee, 
a) cosh kh 
The time dependent function can be expressed as 
cos(kx — at) = cos[k(x — : t)] = cos[k(x —ct)] (12.481) 
where the phase speed c is defined as 
a) 
os 12.482 
Fs ( ) 
Finally, the velocity components can be obtained as 
1p =F = ay SORE ON in ke— aot) (12.483) 
Ox sinh kh 
w= OF ao smn +hM cos(kx — at) (12.484) 
OZ sinh kh 


The trajectories of water particles can be evaluated by using a Lagrangian 
formulation. Assume that the initial position of the water particle is at (xo, zo) and 
the position of the same point at time ¢ is (x = xo+6x, z = Zo+dz). The change of the 
position can be approximated as 
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d(x-X 9) 


a =Uz (Xo, Z9.f) =Ug (Xp + OX, Zy + 02,1) 
ee era ote es (12.485) 
x 


Oz X0.20 ot 


+( f, wrdt) 


XQ) >20 >t 


X0.20 +t 
t Ou 
=Up(Xp,Zo.t) +(( u,dt)— 


Ox 


vial 


o4 


vie 
X0)>20 >t 


d(z—Zo) 


. = W,(X95Z95t) = We (Xp + OX, Zp + 62,1) 
t 


awe (12.486) 


Oz 


Ow 
'E 
= We (Xp.Z0t) + Ox — 
x X0).20.t 


XQ .Z0 +t 

where the subscripts “EZ” and “L” denote Eulerian and Lagrangian velocities. Using 

the (12.485) and (12.486) as the first terms in (12.483) and (12.484), we get 
d(x—Xo) _ a cosh[A(Z,) +/)] 


sin(kx, — ot 12.487 
dt sinh kh eo" ®) 
d(z- inh[k h 
(270) _ ao zi [ Go+) cos(Ax, — at) (12.488) 
dt sinh kh 
Integrating with respect to time, we have 
cosh[k(zp) +/)] 
7 cos(kx,) — at 12.489 
hs a rrr) (kx — ot) ( ) 
sinh[k(z) +/)] . 
=a sin(kx, — ot 12.490 
24 sinh kh iy oe) oil 


These two components can be combined to get the following elliptic trajectory of 
particle motions: 


Figure 12.13 Elliptic trajectories motion for the first approximation 
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X— Xo 2 Z —Zo 2 
7 12.491 
. cosh[k(z, +) ae iGo coe) 
sinh kh sinh kh 


This elliptic trajectory is illustrated in Figure 12.13. This is the Airy wave. Note 
however that if the nonlinear effect is included in the wave analysis, the motion of 
the water particle is no longer a closed circuit. That is, the water particle is being 
drifted forward in each cycle. 


12.11.3 Shallow and Deep Water Limits 


We will now consider the limiting cases of Airy’s wave that were first considered 
by George Green (recall Green’s theorem in Chapter 8). We first recall some 
relations for wave characteristics. The wave speed is defined as 
o L 
c=—-—=— 
k T 
where T is the period of the surface wave. The wave frequency relates to the wave 
period as 


(12.492) 


2 
pee 12.493 
T ( ) 
The wave number is defined by 
pe” (12.494) 
L 


Using these and (12.471), we write the phase speed in terms of the wave number, 
gravitational constant g, and depth of water h as 


2 
= =F tanh kh 
kek (12.495) 
el 


2h 
= fain me tanh( = ) 
20 L 20 L 

The phase speed given in (12.495) can be put in a normalized form as: 


eng Reece (12.496) 

gh kh 

In other words, if the wavelength and water depth are known, the wave speed can 
be estimated. The wavelength can be expressed as 


2 
2ah 
jeate* pane) (12.497) 
20 L 
We now consider two particular limits. 
(i) Deep water limit 


When the water depth is much deeper than the wave length, we have 
hiIL>o (12.498) 
For this deep water limit, we have 
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tanh kh > 1 (12.499) 
By virtue of (12.492) and (12.496), we have the following relation for a deep water 
wave: 


2 
poe 21 569? (12.500) 
20 


where the unit of wavelength is in meters while the period is in seconds. The wave 
relates to the period as 
go2 igor (12.501) 
20 
where the wave speed is in m/s and the period is in seconds. The normalized speed 
can be specified from (12.496) as 


c 1 
=> = (12.502) 
Vgh kh 
(ii) Shallow water limit: 
When the wavelength is much larger than the depth, we have 
h/iL—>0 (12.503) 
Using the first term of Taylor series expansion for hyperbolic tangent, we find 
ai es (12.504) 
L L cT 
Substitution of (12.504) into (12.495) gives 
62 2n ee (12.505) 


C= 
2a cT c 
Rearranging (12.505) we find the classical result of water speed for a shallow 
water wave: 


e=joh (12.506) 
The water length Z depends on the period 7’ as 
LS ifehr (12.507) 


We now consider a scenario for a large submarine earthquake and apply the 
shallow wave approximation to calculate the wave speed of a tsunami wave. When 
a large shallow submarine earthquake of magnitude 7.5 or higher occurs in deep 
sea (on the order of 8000 m), large sea bottom movements occurs. Water will be 
displaced upward or downward, and a so-called tsunami (literally means harbor 
wave in Japanese) will be generated. Because the size of the rupture surface for a 
large destructive earthquake is on the order of 200 km or more, the wavelength of 
such an initial disturbance is also on the order of 200 km. Using these data, 
(12.507) shows that the period of such an initial wave form is in the order of 12 
minutes. In addition, we have A/L ~ 0.04 such that the shallow water wave 
assumption is valid. 

The wave speed of such a tsunami wave can be estimated by (12.506) as 
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c= V9.81x8000 = 280m/s 


12.508 
9 OA am | hr = 1008kin | hr ——e 


280 


c= Vgh Shallow water limit 


Figure 12.14 Phase speed for small amplitude wave, shallow water wave, and deep water wave 


The tsunami wave travels in deep sea as fast as a jet airplane. However, the wave 
speed decreases rapidly when a tsunami wave comes close to the continental shelf, 
and at the same time the amplitude also increases. When such a tsunami wave 
comes ashore, the amplitude of the wave will increase rapidly and our small 
amplitude wave assumption will break down. Therefore, many scientists and 
geophysicists claimed after the 2004 South Asian tsunami that a tsunami 
propagating to shore at the speed of jet airplane is totally incorrect and is 
misleading. 

For shallow water waves, h decreases and the wave speed decreases as V(gh). 
As illustrated in Figure 12.15, the wave crest line always turns roughly parallel to 
the coastline. This is the reason why we seem to see waves are coming toward the 
coastline no matter where you go along a beach. 


12.12 SUMMARY AND FURTHER READING 


Using asymptotic and perturbation methods, especially singular perturbation, in 
solving ODEs or PDEs is not a straightforward business. This method is normally 
covered in graduate courses in the area of applied mathematics. It is a topic still 
undergoing fundamental developments. The perturbation method was developed to 
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solve nonlinear ODEs containing a physically small parameter, and the technique 
turns one nonlinear ODE into an infinite system of linear ODEs by expanding the 
unknown function in asymptotic series in the small parameter of the problem. 
Normally one or two term expansions will give very accurate and meaningful 
results. In view of the asymptotic nature of the technique, more terms do not 
necessarily give a more accurate result. Very often, even powerful numerical 
methods will also break down when it is applied to solve highly nonlinear 
problems, unless you have an idea of how the solution is going to behave and under 
what circumstances the solution may change rapidly. For example, solutions of 
nonlinear systems are often unpredictable near singular points or bifurcation points. 


Figure 12.15 Turning of wave toward the coastline as predicted by (12.506) 


In a chapter of this size, we can only introduce the basic ideas of this 
advanced technique that can provide approximate but insightful results for highly 
nonlinear problems, which are otherwise unsolvable. Our discussion only targets 
elementary and introductory levels. In particular, asymptotic expansions and their 
application in the regular perturbation method in solving ODEs are discussed. 
Singular perturbation methods are discussed briefly, and they include the method of 
matched asymptotic expansion (or boundary layer analysis), multiple scale analysis, 
and WKB approximation. The application of asymptotic expansion in evaluating 
integrals with a large parameter is discussed in light of the Laplace method, 
Riemann (or Debye) method of steepest descent (or saddle point method), and the 
method of stationary phase of Kelvin and Stokes. We conclude the chapter by 
looking at how to apply the perturbation method to convert the Navier-Stokes 
equation of fluid flows into a linear solvable problem. The Airy water wave and 
shallow water wave are considered as an example. 

There are a lot of good and advanced books in asymptotic and perturbation 
methods. They include Kevorkian and Cole (1981), Nayfeh (1973), van Dyke 
(1975), Bender and Orszag (1978), Lagerstrom (1988), Lin and Segel (1988), 
Holmes (1995), and O’Malley (2014). Holmes (1995) and O’Malley (2014) are 
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particularly useful references in terms of the historical development of the method, 
and both of them contain a lot of examples. The scope and coverage of Bender and 
Orszag (1978) are very comprehensive and provide detailed discussion on many 
issues that you may likely encounter in applying the perturbation method. The book 
by van Dyke (1975) aims at solving fluid mechanics problems. Nayfeh (1973) 
probably provides the most comprehensive references of the applications of 
perturbation methods in engineering, science, and applied mathematics. Most of 
these books are not targeted at the undergraduate level and readers may find it 
difficult to read. At the introductory level, Lin and Segel (1988) provided the best 
insights and is very easy to read for beginners. 

For asymptotic analysis of integrals, we refer to Erdelyi (1956), Olver (1997), 
Bleistein and Handelsman (1986), and Wong (2001). These books target graduate 
students and researchers and thus are not easy to read. We also highly recommend 
the book by Bender and Orszag (1978) on this topic as well. Their presentation is 
probably more appealing to undergraduates and engineers, but at the same time are 
very comprehensive. 

In the area of structural instability, the perturbation method has also been 
applied to the buckling of beams, plates, and shells (e.g., Reiss, 1969, 1977, 1980a- 
b, 1982, 1984, Reiss and Matkowsky, 1971). 


12.13 PROBLEMS 


Problem 12.1 Repeat the analysis in Example 12.1 but with boundary layer at x = 
1. Redefine the variable in the boundary layer as: 


E — fa (12.509) 
Find the inner expansion, outer expansion, and the composite expansion. 
Ans: 
yo(x)=0, y(E)=Cle* -I 41, yo ~ el" (12.510) 


Problem 12.2 Prove the following Stirling asymptotic formula for Euler’s gamma 
function: 


T(k+1) x V2rkk*e* (12.511) 


for k — oo, where the gamma function is defined by 
T(k +1) =[vetthat (12.512) 
0 


Answer the following questions one by one: 
(1) Show that 


[k+l = [,eetinat (12.513) 


(i1) Use a change of variable t = ks to show that 
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T(k +1) = kk* } " ell-stins] gy (12.514) 
0 


(iii) Use the Laplace method to prove that for k — co 
T(k+1) = V2akk*e* (12.515) 


Problem 12.3 Consider the asymptotic form of the following double integral 
2 2 
I(k) = | | ek oy, y)dxdy (12.516) 


for k — cand gis a regular function for all values of x and y. 


(i) Find the stationary point of the function 
f(xy)=x? -xyt+y (12.517) 
(1i) Apply the following change of variables to the integral 
uax-s, v=y (12.518) 
(111) Show that the Jacobian of the change of variables in (ii) is 1. 
(iv) Show that the asymptotic form of the integral given in (12.516) is 


4 
I(k) = 5 0020) (12.519) 
Hint: Use the definition of the gamma function. 


Problem 12.4 Reconsider the asymptotic form of Problem 12.2 using the formula 
given in (12.284). Show that the answer is the same as that given in (12.519). 


Problem 12.5 Find the asymptotic form of the following integral by the Laplace 
method: 


_ “ —xsinh?¢ 
I(x)=| e dt (12.520) 
0 


I(x) - LE (12.521) 


Problem 12.6 Find the asymptotic form of the following integral representation of 
modified Bessel function: 


Ans: 


K,(x)= | © @-8005h! eggh(vt)dt (12.522) 
0 
Ans: 
a x 
K,(x)=,{/2e> (12.523) 
2% 
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Problem 12.7 Extend the result for the Laplace method to include the fourth order 
derivative term of ¢ and the second order derivative term of f for the following 
integral 


I(x) = [. f (pee dt (12.524) 


where 
d(c)=0, o"(c)<0, f(c) #0 (12.525) 


Hint: See Section 6.4 of Bender and Orszag (1978) for the details of derivation of the 
result. 


Ans: 


I(x) ~ 


2a | £0), SOHO 
=x$"(0) xL 26") 1p") 
LOGO _SfoOle"OF I 

AsO 24d") 


ge yo + 
(12.526) 


Problem 12.8 Repeat the analysis in Problem 12.5 with the result derived in 
Problem 12.7. Find a two-term asymptotic expansion in the series. 


Ans: 


1 fz. 1 
e poids 12.52 
I(x) 5 lowe, (12.527) 


Problem 12.9 Assume the following two-scale perturbation expansions for a 
function w: 


u(t,é) =u°(t,r) +ul(t, rE + ac t)é* +... (12.528) 
where 

t=et (12.529) 

Show the validity of the following differentiation formulas: 

1 

u'(t,€) = “ =uy +[u? ture +[u, + ou le ae (12.530) 
u"(t,6)=ue +[uj, + ue, Je+[2u). +uo + suale® #5 (12.531) 
u"(t,€) =uy, + [Buy , + Uy JE + Buy, + 3p, + sti Je? +... (12.532) 


0 
tte 


0 


1 
u(t, é)= ron +[4u/ + ie Je+[6u,. + Aub + 5M Je” +... (12.533) 


CHAPTER THIRTEEN 


Calculus of Variations 


13.1 INTRODUCTION 


One of the main applications of the calculus of variations is related to the 
variational formulation in mechanics problems, which is the topic to be covered in 
the next chapter (i.e., Chapter 14). In traditional mechanics, the governing equation 
of a mechanics problem can be formulated via two independent paths: (i) 
Newtonian mechanics formulation by considering the equation of motion or force 
equilibrium for an small free body cut out from the original body; and (ii) 
variational formulation that requires the minimization of energy or some functional 
(in the form of integral). The calculus of variations provides the backbone for the 
second approach. The origin of the calculus of variations can be traced back to the 
time of Bernoulli and Euler. Its formal development was, however, mainly done by 
Euler and Lagrange. Seeking a functional (a function of admissible functions in 
integral form) that is stationary, Euler in 1736 and Lagrange in 1755 derived 
independently that a second order PDE called the Euler-Lagrange equation of the 
functional. The resulting governing equation of the problem is known as the Euler- 
Lagrange equation. A special case of it is known as the Beltrami identity if the 
integrand function inside the integral of the functional is independent of the 
independent variable of the problem. The term calculus of variations was 
introduced by Euler. 

The application of the calculus of variations is mainly used in searching an 
optimum solution of problems. For example, these problems include what is the 
shortest distance between two points in space, what is the shape of the strongest 
column (proposed by Lagrange in 1773), what is the shape of the column strongest 
against torsion (St. Venant problem solved by George Polya in 1948), what is the 
shape of a drum of minimized tone for a given area (Rayleigh conjecture solved by 
Courant, Faber, and Krahn in 1920s), what is the shortest curve between two points 
on a curved surface (geodesics problem first considered Euler in 1755), what is the 
shape of a simply connected electric capacitor that maximizes capacity (solved by 
Poincare and Szego), what is the shape of a soap film form between two metal 
circular rings (catenoid problem), what is the profile of a wire for a frictionless 
sliding bead giving the shortest travel time (brachistochrone problem posed by 
Johann Bernoulli), what is the least-perimeter of a soap bubble enclosing a given 
volume of air (solved by Schwarz in 1884 as sphere bubble), what solid with 3-D 
shape minimizes heat loss (Polya’s cat curling problem), what is the shape of a 
rocket nose that minimizes the air resistance in supersonic flight (Newton's minimal 
resistance problem), and what is the shape of a closed curve of fixed length giving 
the greatest enclosed area on a surface (isoperimetric problem or Dido problem). 
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The geodesic problem is of great importance in surveying, navigation on the 
surface of the earth, signals traveling on earth’s surface, and defining maritime 
boundaries. The term “geodesic line” was coined by Laplace in 1799. If the earth’s 
surface is approximated by a sphere, the shortest distance between two points is an 
arc of a great circle (Euler in 1755). For the more realistic ellipsoidal earth, the 
problem is more complicated and has been considered by many famous 
mathematicians and scientists, including Newton in 1687, Clairaut in 1735, 
Legendre in 1806, Oriani in 1806, Bessel in 1825, Gauss in 1828, and Poincare in 
1905. 

In the 19" century, Weierstrass realized that there could be subtle problems 
involved as the Euler-Lagrange equation is the necessary but not sufficient 
condition, and thus some of the solution is not the minimum or maximum. The 
study of the existence of an extremal value of functionals results in the so-called 
direct method of the calculus of variation. The main contributors are Weierstrass, 
Schwarz, Poincare, and Hilbert and their works are based on functional analysis 
and topology. This topic is, however, out of the scope of the present study. 

In this chapter, we will discuss the calculus of variations and the associated 
Euler-Lagrange equation for the case of a single variable and the case of multi- 
independent and dependent variables. The idea of the Lagrange multiplier will also 
be discussed. 

For example, the brachistochrone problem, catenoid problem, Dido’s problem 
of isoperimeter, and geodesics will be considered as illustrations in the present 
chapter. Among these problems, we should mention the problem of isoperimetry 
(i.e., solving a problem of isoperimeter like the Dido problem) in particular. In the 
19th century, Belgian physicist J. Plateau experimented with soap film and 
conjectured that every nice closed wire loop bounds a soap film or minimal surface. 
This is referred to as the Plateau problem in the literature. This conjecture was 
subsequently proved by J. Douglas in 1931, who was awarded the Fields Medal on 
this achievement (Douglas, 1931). Another Fields Medal recipient, Enrico 
Bombieri, received the medal because of work on higher dimensional minimal 
surfaces. Thus, calculus is an important topic in mathematics. 


13.2 FUNCTIONAL 


A functional / is defined in terms of a function F, which is in turn a function of 
another function y and its derivatives: 


1(y)= [Fess y(x)."(x)lde (13.1) 


where x is the variable of the function y. The objective of the calculus of variation 
is to find what admissible functions y(x) will lead to a maximum or a minimum 
value of the functional / or so-called stationary. Physically, / can be an arc-length 
that we want to minimize, an energy function of a problem, the shortest time of 
travel, the maximum or minimum area, etc. More generally, the functional may 
also involve higher derivative terms, or involve more dependent variables. Such 
situations will be considered in later sections. 
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13.3 ANALOGOUS TO CALCULUS 


There is close resemblance between differential calculus and the calculus of 
variations. In particular, differential calculus always involves the determination of 
the point xo of a single variable function f(x) within a bounded domain such that the 
function f will achieve a maximum or minimum value. Mathematically, to find the 
maximum or minimum, we are seeking the solution of 


af =0 (13.2) 
dx X=Xy 
To ensure that the point xo with vanishing derivative is a maximum, we need to 
impose an additional condition of 


2 
EA), 2% (13.3) 


Similarly, the condition for a minimum is 
2 
at >0 (13.4) 
dx X=Xq 


In the calculus of variations, we are looking for a permissible function y(x) such 
that the following integral / that we called “functional” attains an extremal value 
(either a maximum or a minimum) 


b 
I(é)= I Fix, y(x),y' (x) ex (13.5) 
where 
V(x) = V(x) + EN(x) (13.6) 
y'(x) = y'(x) +€N"(x) (13.7) 


In a sense, the functional can be considered as the function of a function. The 
admissible function is 7(x) which will vanish at the end points as shown in Figure 
13.1. The necessary condition for the extremal to occur is that 
dl(é) 
dé e=0 
Note that there is a close resemblance between (13.2) and (13.8). Figure 13.1 
illustrates the similarity between calculus and the calculus of variations. 


=0 (13.8) 


13.4 EULER-LAGRANGE EQUATION 


If (13.8) is satisfied, we also said that the functional / is stationary. Using the chain 
rule and noting (13.6) and (13.7), we get 


a ef OE (13.9) 
de Jal dy dg Oy’ OE 


Differentiation of (13.5) and (13.6) with respect to € gives 
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oy oy’ __, 

> = 13.10 
an ee (13.10) 
Substitution of (13.10) into (13.9) results in 


dl a OF OF 


+ "Idx 13.11 
i la ay” (13.11) 
Applying the condition (13.8) gives 
sr = 
a =f OF pen! te=0 (13.12) 
dé|.g “11 V BV | 
F(x) 
y Maximum ; 
af <0 
dx 
Negative 
radient 


m(x) b 


Figure 13.1 Differential calculus versus the calculus of variations 


I(y +én) 


Figure 13.2 Variations of a functional 


Applying integration by parts to the second term on the right hand side of (13.12), 
we find 


dl 
dé 


a raed dv+[ Gil = 0 (13.13) 


a 


7 ne d OF OF 
é=0 
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However, we have imposed that the admissible function vanishes at the end points, 
and this yields 


n(a) =n(b) =0 (13.14) 
This gives a differential equation for F as 
Oe 6 (13.15) 
ay dx oy 


For simplicity, we will drop the bar in all y in (13.15) in subsequent presentation. 
This equation is called the Euler-Lagrange equation, Euler equation, or Lagrange 
equation. The more popular choice is the Euler-Lagrange equation. This equation 
can be recast in a more explicit form by noting 


f f fy fay F F , AF 

dx ax Gyde Oy d& ox oy oy 
Substituting f = OF/dy in (13.15), we find another form for the Euler-Lagrange 
equation as 


y" (13.16) 


WOR. OF OF OF 
y =) +y at ; = 
oy OyOy  Oxdy’ Oy 
This is an alternate form for the Euler-Lagrange equation given in (13.15), which 
may not be easy to solve. We should note that the Euler-Lagrange equation is only 
the necessary condition for the functional being stationary, but it is not the 
sufficient condition. Therefore, if we get more than one solution, we have to check 
whether it indeed gives the required stationary state and whether it is a maximum or 
a minimum, or even a saddle point. 
It is also possible to rewrite the Euler-Lagrange equation in yet another form, 
in addition to (13.15) and (13.17). In particular, we note that 


d ,OF ,a OF. OF _,, 
res a ee Seer cement 4 
dx ~ oy dx Oy’ Oy 
Next, we can subtract (13.18) from (13.16) (with fbeing replaced by F) to get 


0 (i347) 


(13.18) 


dF d_,OF,. OF OF , ,d_OF 
Yt) (13.19) 
dx dx” oy Ox oy dx Oy 
This can be rearranged as 
d ,OF, OF  ,OF d_OF 
ys =yl (|) (13.20) 
dx oy Ox oy dx oy 


However, the bracket term on the right of (13.20) is exactly zero in view of the 
Euler-Lagrange equation obtained in (13.15). Therefore, we end up with the 
following form of Euler-Lagrange equation: 


‘ {F-y' * = =0 (13.21) 
dx Oy’ Ox 
This is equivalent to (13.15) and (13.17). 
A special form of the Euler-Lagrange equation is called the Beltrami identity, 
which is equivalent to the Euler-Lagrange equation for the case that F’ is not an 
explicit function of x. In particular, we have 


F =F(y,y’) (13.22) 
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Consider the second term on the right hand side of (13.15) by using the chain rule, 
keeping in mind that F' does not depend on x 

dF. OF , OF 

~ (j= y+ Sy 

dx Oy" yoy" oy 
Substitution of (13.23) into (13.15) gives the special form of the Euler-Lagrange 
equation 


(13.23) 


OF OF , OF ,, 
yr yy 
ay yay"? ay! 
Note that this is exactly the same as (13.17) if we drop the third term in terms of the 


x derivative. Finally, to prove Beltrami’s identity, we consider the following 
function: 


(13.24) 


H= a —F (13.25) 
Differentiation of (13.25) with respect to x using the chain rule gives 
dH | OF yn, OF 2, OF , ny (OF y  F yy 
i By? Bye? ay? yy aw tar 
(13.26) 
= YK oF oe OF yr Fy 9 
dyéy’” ay?" ay 
The last identity of (13.26) is a consequence of (13.24). Thus, we must have 
oF (13.27) 
oy’ 


where C is an arbitrary constant. This is called the Beltrami identity. Note that the 
Beltrami identity can also be recovered by setting the last term in the alternative 
form of the Euler-Lagrange equation given in (13.21) to zero [1.e., F # F(x) or F = 
F(y, y') given in (13.22)]. 


13.5 DEGENERATE CASES OF EULER-LAGRANGE EQUATIONS 


In this section, we consider some special degenerate cases of the Euler-Lagrange 
equation. 


Case (i) F = F(y,y’) 


This is exactly the case of the Beltrami identity and we have 
OF 


oy" 


(13.28) 
where C is an arbitrary constant. 


Case (ii) F = F(x, y’) 
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From (13.15), we have the first term zero being zero and the Euler-Lagrange 
equation is reduced to 


eae -0 13.29 
a [ ay! (13.29) 
Integrating (13.29), we get the following form 
aa =C (13.30) 
oy 
where C is a constant. Integrating one more time, we obtain 
F=Cy'+ f(x) (13.31) 
Case (iii) F = F(y’) 
For this case, (13.17) gives 
OF 
ari =0 (13.32) 


In general, the second derivative of F' with respect to y' is not zero, and thus we 
have 


2 
ad =0 (13.33) 
dx 
Or, we have y being a straight line as 
y=Cx+C, (13.34) 


where C| and C) are constants. 
Case (iv) F = F(x, y) 


For this case, the second term on the left of (13.15) is zero, and thus we obtain 


IF 
OF 26 (13.35) 

oy 

Finally, we must have the special case of 
F =F(x) (13.36) 


13.6 FUNCTIONAL OF SEVERAL VARIABLES 


When there is more than one dependent variable, the functional may be formulated 
as 


b 
I={ F(x, y,z 5y',z' )dx (13.37) 
The corresponding Euler-Lagrange equations are 
F d_ OF 
i La (13.38) 
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)=0 (13.39) 
When there is more than one independent variable, the functional may be 
formulated as 
r={[ F(x, yuu, .u, dvdy (13.40) 
R J 


The corresponding Euler-Lagrange equations are 


OF 0 .0OF,. 0 OF 226 (13.41) 
Ou Ox Ou, Oy Ou, 


For a functional containing derivative of higher than first order, we have 


fz } ” F(x, yy, yee (13.42) 
Xo 


The corresponding Euler-Lagrange equations are 
2 


OF d_ OF. d° (OF 
( + 


oy dx dy" dx* dy" 
For the cases of several dependent and independent variables, the functional may 
be formulated as 


y=0 (13.43) 


[= {I F(X, YUU, U,,VsV,5V, )dxdy (13.44) 
) J 


The corresponding Euler-Lagrange equations are 
OF 0 .,0OF,. OOF 
(—) (—) =0 (13.45) 
Ou Ox Ou, Oy Ou, 
OF OOF, 0 (OF, _ 
Ov Ox Ov, Ody Ov, 
It is not difficult to extend these Euler-Lagrange equations to more variables and 
higher derivatives. 
A number of examples will be considered next. 


0 (13.46) 


13.7 CATENOID 


When a soap film is formed between two circular metal rings, the optimum shape 
of the soap film can be expressed in terms of hyperbolic cosine functions. This 
particular problem is called catenoid. It was first considered by Euler. The actual 
shape of the catenoid is shown in Figure 13.3. 

By symmetry, we put the origin of the coordinate system at the mid-section of 
the soap film, as shown in Figure 13.3. The surface area of the ring (shown as 
dotted lines in Figure 13.3) at a distance x from the origin can be formulated as: 

dA = 2m yds (13.47) 
Along this unknown curve, the length increment of the curve ds between the two 
rings can be evaluated as: 


(ds)? = (dx) + (dy? (13.48) 


Calculus of Variations 781 


ds 
<= = |i+é +e (13.49) 


Substitution of (13.49) into (13.47) and ee of dx from 0 to a gives 


This formula can be recast as: 


A=( Qmyfl + y dx (13.50) 
0 


where 2a is the distance between the two steel rings. In the calculus of variations, 
we are searching for an optimum function that gives a stationary functional. In the 
problem of a catenoid, we are looking for a function of the shape of the soap film 
such that the area of the soap film formed between the rings is a minimum. Thus, 
mathematically we expect the functional to be 


I=[oy Ly de (13.51) 


Figure 13.3 Mathematical formulation of the catenoid (photo on the right is reproduced from 
soapbubble.dk with permission) 


Comparing to (13.5), we obtain 


F(x,y,y')=yylt yy? (13.52) 


Differentiation of (13.52) respect to both y and y’ gives: 


OF yy’ 
[= (13.53) 
oy he yy" 
ae es as (13.54) 
oy 
Then, using (13.15) the Euler- —— a becomes 
yl+y F< je ]=0 (13.55) 


ht 
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Recall from (13.53) that 


IF 
—=finy' 13.56 
oy f(y.) (13.56) 
Using the chain rule of differentiation, we get 
d ae ke = dy 0 OF dy" (13.57) 


dx Oy’ Oy Oy’ dx * By Oy" dx 
Substitution of (13.53) into (13.57) gives 


0 OF dy y” 


(—) (13.58) 
oy Oy’ dx hs y? 
Ce ee (13.59) 
oy’ Oy’ dx (1+ y"*) 
Substitution of (13.58), (13.59), and (13.53) into (13.55) yields 
1-27 =0 (13.60) 
l+y’ 
This is equivalent to the following nonlinear second order ODE: 
1+ y'—yy"=0 (13.61) 


In an earlier chapter, we have introduced the method of reduction of order of 
differentiation. In particular, we introduced 


y=p (13.62) 
The differentiation of (13.62) gives 
(2 Oa (13.63) 
dx dy dx dy 
Thus, (13.61) can be reduced to a first order ODE as 


d 
1+ p? yp =0 (13.64) 
dy 
This first order ODE is clearly separable, and can be rearranged as: 
Pap J (13.65) 
l+p y 


It is straightforward to see that 
ld@+p’)_ dy 


(13.66) 
2 1+p? y 
This can be integrated immediately to give 
Cy fl+ p’ =y (13.67) 


Recalling the definition of p from (13.62), we get 


opie =y (13.68) 


Rearranging (13.68) gives 
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a @2y 2) (13.69) 
dx Cy 
This can be integrated as 
a = [dere (13.70) 
Cra 
Co 


It is clear that we can introduce the following change of variables: 
y=c,cosh@, dy=c,sinh6d0 (13.71) 


This change of variables is an obvious choice by noting the following identity of 
hyperbolic functions: 


cosh” 9-1 = sinh? @ (13.72) 
Substitution of (13.71) and (13.72) into (13.70) gives 
G= 2 (13.73) 
Cy Cy 


Combining (13.71) and (13.73) gives 
y=c, cosh(~ +) (13.74) 
Cy © 


As expected, we have two unknown constants for second order ODEs given 
(13.61). Let the boundary conditions for the soap film be given as 


y(0) = b, BAO) <4 (13.75) 
dx 
Substitution of (13.74) into the boundary conditions given in (13.75) gives 
Co =b, c, =0 (13.76) 
Finally, we have 
y= beosh() (13.77) 


We have just obtained the sectional profile of a catenoid of the soap film shown in 
Figure 13.3. 


Figure 13.4 Coordinate system for a 3-D surface of the catenoid given in (13.78)—(13.80) 
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For a three-dimensional shape, we can recast the solution as: 


x = boosh -cosu (13.78) 
y= beosh sin (13.79) 
Z=Vv (13.80) 


where x, y, z, u and v are defined in Figure 13.4. For this case, intuition does not 
work: the minimum soap film is in the shape of revolution of hyperbolic cosine, 
instead of a cylindrical surface. 


13.8 BRACHISTOCHRONE 


The brachistochrone was first formulated by Galileo in 1638, but he was unable to 
solve the problem. This problem was considered one of the founding problems in 
the calculus of variations. This problem was originally posed as a challenge to 
other mathematicians by Johann Bernoulli in 1696. The word “brachistochrone” is 
from Greek, and literally means “the least time.” The problem was solved 
independently by the Bernoulli brothers, Leibniz, and Newton. This problem 
investigates the optimum shape of a frictionless wire along which a bead will slide 
down with the shortest time, as shown in Figure 13.5. This problem was studied by 
Galileo Galilei in 1638 but he mistakenly got the answer as a quarter of a circle. As 
a side note, his last name is actually Galilei although most people just called him 
Galileo because it is how he referred to himself. 


y Q 


Figure 13.5 Brachistochrone problem of a frictionless sliding bead 


Since the sliding is frictionless, the energy of the bead must conserve. In particular, 
the kinetic energy gained in the sliding must come from the drop of potential 
energy from points P to Q: 


=m = mgy (13.81) 


Thus, the velocity can be evaluated as 


v=,)2gy (13.82) 


From (13.49) in the last section, we have 
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ae . 1+ 
dt Yat 
Combining (13.82) and (13.83), we get 


dt (ae d. (13.84) 
= xX . 
\2gy 


The sliding time for the bead can be evaluated by integrating (13.84) as 


re) Tx ae (13.85) 


Now we are searching for an optimum profile y(x) such that the functional T (in this 
case is the time of travel) being the minimum. Clearly, this is another problem that 
can be solved by using the calculus of variations. In particular, the integrand 
function F' can be identified as: 


Vv 


(13.83) 


FO, Y) Ae (13.86) 
b = J2ey . 


Recalling the Euler-Lagrange equation, we have 


]=0 (13.87) 


In particular, using (13.86) we have 


OF t aig? 
= (13.88) 
Cy = 2gy 2y 


d as é (<= dy | é oe 
dx oy’ Oy Oy’ dx Oy’ oy’ dx 
a , r (13.89) 
l , re i by 
\2ay 2yVJl+y” J2ey yl+y” (+y?y? 
Using (13.88) and (13.89) in (13.87), we have 
—Dy+y/7)4 7? = ay. =0 (13.90) 
l+y’ 
Equation (13.90) can be simplified as 
2yy"+1+y? =0 (13.91) 
Similar to the catenoid problem, we can reduce the order of the ODE by using 
Vv=p (13.92) 
In view of (13.92), we can reduce (13.91) to a first order ODE as 
ae ae =0 (13.93) 
dy 
This is a separable ODE and can be integrated readily to give 
In y =—In(1+ p*)+Inc (13.94) 


Taking the exponential function, we find 
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Wl+y"J=q 
This can be rearranged as 

Sa ia 

dx y 


To integrate (13.96), we introduce 
y=c,sin’?¢, dy=2c,singcosddd 
Finally, (13.96) is reduced to 


2 
dy |¢ ed @ = 6oeg 
dx \csin* ¢ 
Combining (13.97) and (13.98), we have 


2c, sin’ ddg = dx 
This can be integrated readily as 


x= 2p-sin2) +e, 


At point P, the initial condition of the sliding bead can be formulated as: 


x=0,t=0, y=0,¢=0 
Thus, we have c2 = 0. To simplify the presentation, we define 
0=2¢ 
Then, the solutions of x and y in terms of @are 


Cy . 
=—(d-sin0 
x 7 sin 0) 


C 
=—(1-cos@ 
y (1—cos 0) 


Finally, at Point O shown in Fig. 13.5, we have the condition 
O0=7,y=2a 


(13.95) 


(13.96) 


(13.97) 


(13.98) 


(13.99) 


(13.100) 


(13.101) 


(13.102) 


(13.103) 


(13.104) 


(13.105) 


The final profile of the brachistochrone can be expressed in a single parameter @ as 


x =a(6-sin@) 
y =a(l—cos@) 


Figure 13.6 The locus of a cycloid 
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x =a(@-sin@) (13.106) 
y =a(l1-cos6) (13.107) 
The profile of this parametric curve is also known as a cycloid. Physically, it is also 
the locus of a point on a rolling wheel, as illustrated in Figure 13.6. Note however 


from Figure 13.5 that the y-axis of the brachistochrone is going downward. Thus, the 
solution of the brachistochrone is actually an inverted cycloid. 


13.9 TAUTOCHRONE 


Although the tautochrone problem is not a problem that needs to be solved by using 
the calculus of variations, it should be considered together with the brachistochrone 
problem. In particular, Huygens in 1673 discovered that no matter where we start to 
slide the bead along the cycloid, it always takes the same amount of time for the 
bead to slide from the starting point to the bottom. This is indeed an amazing result. 
To show this, we differentiate both (13.103) and (13.104) to give 


dx 

— =a(l-cos0 13.108 

70 ( ) ( ) 
OY aang (13.109) 
do 


Using these results, we find the increment of the curve of the cycloid as 


Ca + ey =a°[(1-2cos0 +cos* 8) +sin” 0] 


(13.110) 
= 2a’ (1—cos 6) 
Recalling from (13.83), we have 
v= SF = Pay (13.111) 


The travel time can then be written 


[a2 2 = 
ae ds _ dx +dy" _ a,[2(1—cos@)d0 © 4 40 (13.112) 
§ 


\2ev Jey {2ga(1—cos 0) 


Since the vertical drop from the top of the cycloid to the bottom is 2a, as shown in 
Figure 13.5, we have the initial and final conditions from (13.94) as 
t=0, y=0, 0=0 (13.113) 
t=7T, y=2a, O=7 (13.114) 
Thus, (13.112) leads to the travel time from the top of the cycloid to the bottom 


7=[° fia Ie (13.115) 
oVeg g 


Next, we are considering the drop of the bead from any initial height, which is 
measured yo from the top of the cycloid as shown in Figure 13.7. The coordinate of 
the sliding motion is given by 


Vo = a(1—cos 6) (13.116) 
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The velocity is accordingly revised as 


v= 5 = Pay) (13.117) 


Note from (13.117) that we must have y > yo and at the starting point we have y = 
yo or v= 0. Combining (13.116) and (13.117) yields 


2q- /2(1—cos 6) 
t =|" arto) a TOY ap 030) 
% .J2ga(cos A — oe 4 \| cos Oy —cos@ 


Using the following identities for cos0 


fa) | _ 
£086) ~ 6080 = Yeos*(L)—cos"(S)], sin = a (13.119) 


we can convert (13.118) to 


eh oN (13.120) 


cos Joos? (0 /2)—cos? (0/2) /2)-—cos 26/2) 
To evaluate this integration, we introduce the following change of variables 
_ ey Se sin(@ / 2) do (13.121) 
cos(G / 2) 2cos(G / 2) 


The travel time is reduced to 


arto 
E=2 “| du (13.122) 
: iB " Vl-w? 


Finally, we can introduce another change of variables as 
u=sina (13.123) 


Equation (13.112) is reduced to 


m/2 
A 2) a= [en (13.124) 
g40 g 


This is precisely equal to (13.115). Therefore, we have established the validity of 
the tautochrone, which means “equal time” in Greek, with “tauto” for same and 
“chrone” for time. Figure 13.7 illustrates that although different beads are starting 
from various y, they all arrive the bottom at the same time. 


x x 


Figure 13.7 Illustration of the tautochrone problem at ¢ = 0 and t= ¢, 


Calculus of Variations 789 
13.10 LAGRANGE MULTIPLIER 


Sometimes we have to minimize a functional subject to certain constraints. This 
problem can also be considered by the calculus of variations by using the Lagrange 
multiplier. This technique was proposed by Lagrange in 1788. In particular, we 
want to consider the stationary value of the following functional: 


b 
f= | F(t,z,z)dt (13.125) 
which is subjected to a constraint that 
b 
I G(t,z,2)dt =C (13.126) 


where C is a constant. The superimposed “dot” implies a derivative taken with 
respect to t. We can formulate the functional in terms of A which is defined as: 


b b 
i= [F(@,2,2) +AG(6,2,2)]dt = | A(t,z,Z)at (13.127) 


where A is called the Lagrange multiplier and A is now the new Lagrangian. 
Following the same procedure of deriving the Euler-Lagrange equation shown in 
Section 13.4, we have 


an ce (13.128) 


We will apply the Lagrange multiplier technique to the isoperimetric problem in the 
next section. 


13.11. DIDO’S PROBLEM (ISOPERIMETER) 


According to legend, Dido arrived in Tunisia in 814 BC with her entourage after a 
power struggle with her brother at Tyre in Lebanon. She requested a piece of land 
and founded the city of Carthage. The land given to her could only be enclosed by 
a bull’s hide, so she smartly cut the hide into long thin strips and used it to embrace 
a circular piece of land. Dido eventually became the first queen of Carthage. 
Therefore, the optimum shape of a closed curve that can enclose the greatest area is 
a circle. 


Isoperimetric 


x 


Figure 13.8 Dido’s problem of greatest covered area 
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Therefore, the problem of Dido is to find the greatest area being enclosed by a 
closed curve of a fixed length. Recall from (13.48) that the length of the curve can 
be written as: 

(ds) = (dx)? +(dyy (13.129) 
Writing the increment along the curve ds in terms of a time parameter f: 


ds =|? + dt (13.130) 


where the superimposed dot implies differentiation with respect to ¢. Thus, the 
length of the perimeter can be expressed as 


L=qh i? +i? at=p (13.131) 


where p is a fixed constant. To express the area covered by the closed curve, we 
recall Green’s theorem that 


J (fax + gdy) = las OF pity (13.132) 
Taking the following ae of fand g 
y xX 
SS % = 13.133 
[a= 5e 2° 5 ( ) 
we have (13.132) becoming 
= =f (xdy — ydx) = aed (13.134) 


a 
Making a closed curve, we arrive at the following enclosed area 


[P< 2: 
Au xf (xj — yx)dt (13.135) 


The functional in (13.135) and the constraint (13.131) can be combined using the 
Lagrange multiplier as 


es ee ae 
I= Pac, x,y, pdt =P Oa yi) + Ay? +y" }dt (13.136) 


Therefore, for the present case of two dependent variables, we have the Euler- 
Lagrange equations as 


OA doa 
—-—[—]= 13.137 
ox aan! ( ) 
Oh Og (13.138) 
oy dt oy 
Using the definition of Lagrangian A defined in (13.136), we find 
Se (13.139) 
Ox 2 ete ox 2 
OB edgy o_o (13.140) 
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Substitution of (13.139) and (13.140) into (13.137) and (13.138) leads to the 
following Euler-Lagrange equations 
d,. 1 A3 


x 1. 
+ —y=0 13.141 
d = Ay ple 
_ 13.142 
dt I “Te + ‘2° 
Integrating with respect to ¢, we immediately obtain 

je (13.143) 

eae 
er (13.144) 

ay 


These two equations can be combined to give 

(x-a)’ +(y—by =27 (13.145) 
This is an equation of a circle with radius 4 at center (a,b) and thus physically the 
Lagrange multiplier is the radius. Using the constraint in (13.131), we have the 
Lagrange multiplier and the enclosed area being 


2 


i=, aa (13.146) 
Qn’ An 


As expected by Dido, the optimum shape is indeed a circle. 


13.12 GEODESICS 


In this section, we will present the formulation for finding the geodesics, or the 
shortest curve between two given points on a given surface. Mathematically, the 
equation of the three-dimensional surface can be expressed as: 

G(x, y,z) =0 (13.147) 


Introducing a time parameter ¢, the arc length between any two points a and b can 


be evaluated as 
b b 
r= S(&),2)at =| Jr ti? +22d¢ (13.148) 


where 
. ad . dy . a 
eee y= he Z= zr (13.149) 
The arc must be on the surface G = 0, thus we can impose a relation between the 
velocities of the variables. First, we can rewrite 
z= g(x(t), y(d)) (13.150) 
Differentiation of z in (13.150) gives 
Pag.ttg,y (13.151) 


Taking a second derivative of (13.151) with respect to x gives 
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a oO : . : . 
—=—(g,F4+ = 2X42, 13.152 
A a ByV) = Bg Xt SyV ( ) 
On the other hand, we observe that from the chain rule 
d . . 
Her) 7 Sak t Ba) (13.153) 
Comparison of (13.152) and (13.153) gives the following identity 
a d 
—=— 13.154 
a ai (Sy) ( ) 


If we follow the same procedure in obtaining (13.152) to (13.154) but taking 
differentiation with respect to y, we get - following identity 


Oz 
g 13.155 
a ad L(g ) ( ) 
Let us now rewrite our functional by eliminating the time derivative of z in (13.148) 
b b -: . - 9 
r= F(x, y, at = | P+ +(g,k+9,5) at (13.156) 


Referring to Section 13.6, the corresponding Euler-Lagrange equations for a two- 
variable system are 


OF d_ OF 
— -—(—)=0 13.157 
ode oes 
ae cai wer (13.158) 
Oy dt oy 
The first term on the left of (13.157) can be evaluated as 
OF of 0 : .. Of . . 
pee mane + = + 13.159 
ao ea oe 8yy) a (2.5% + 859) ( ) 
In view of (13.151), (13.159) can be simplified to 
OF of Oo ‘ . Of & 
ese 2 + = 13.160 
Be GE oe Bt OD) Be oe 


In parallel to this development, we can repeat the procedure for the first term of 
(13.158), and we eventually obtain 


OF _&Ff 0 
ay & a * 
For the second term of (13.157), we have 
fa eee ae 
Ox Ox OZ Ox Ox 
The last part of (13.162) is obtained in view of (13.151). Taking the time derivative 
of (13.162), we pp at 


+g,y)= a“ (13.161) 


(13.162) 


dO, dof ad ¥ d of, ad of d 
dt a “at at one a oe at Eee at © (13.163) 
sft CaS | 


dt Ox dt O& Oz Ox 
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The last line of (13.163) results from the substitution of (13.154). Finally, (13.160) 
and (13.163) can be substituted into the first Euler-Lagrange equation given in 
(13.157) to give 


d of of 
0 13.164 
rare )4 12. (2) ( ) 
On the other hand, we can deal with the second term of (13.158) in a similar 
manner; we get 
d OF. doa dg Of Oz 
££ DL), 44 Fe 
dt Oy dt oy dt @ Oz Oy 
Finally, combining (13.161) and (13.165) into the second Euler-Lagrange equation 
given in (13.158), we obtain 


(13.165) 


dof d of 


= 13.166 
dt GS ee dt Oz )= ( ) 
Now, we apply the most crucial step in our analysis in which we set 
a 
“(4)- At)G (13.167) 


where G = 0 has been defined as our a of consideration in (13.147) and A(A) 
plays the role of the Lagrange multiplier (we will show this in a short while). We 
note from definition (13.147) that 


G,dx + G,dy + G,dz =0 (13.168) 
which can be rewritten as 
dy dz 
G1 G, +G, “ =G, +G,g,=0 (13.169) 
: dx * dx 


Note that dy/dx = 0 since x and y are two independent variables. Or equivalently, 
we have 


2 —_ 7 13 . 170 
x G, ( ) 
In similar fashion, we can also have 
=-— (13.171) 
By =~ G 


Employing (13.170), (13.171) and (13.167) in the special form of Euler-Lagrange 
equations (13.164) and (13.166), we finally get 


d of 

a = At)G (13.172) 
of 

7 roe A(t)G, (13.173) 


By eliminating 4 from (13.167), (13.172) and (13.173), we have 
2 .0f are ee Ff) 
A(t) = ato = dt Oy _ dt é (13.174) 
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Finally, recalling our arc length problem defined in (13.148), we can simplify 
(13.172) to 
fi-f _f9-if _ ft-F 
Gf? Gf Gf? 
This is the final governing equation for our geodesic problem. 


(13.175) 


Example 13.1 Re-derive (13.174) by combining the Lagrange multiplier method 
discussed in Section 13.10 in conjunction with the functional of several variables 
discussed in Section 13.6. 


Solution: The functional with constraint can be formulated by combining (13.147) 
and (13.148) as: 


b 
I= {£65.24 ADGRO, vO, 2)}al 
(13.176) 
=[ AGiny.j.2.2d0 


where A(¢) is the Lagrange multiplier. 
Applying the Euler-Lagrange equation for three variables, we have 


eG 13,177 
ox dt i ( ) 


— -——[—]=0 (13.178) 


0 (13.179) 


Substitution of A defined in (13.176) into (13.177) to (13.179) yields 
406 _4 Fg 


- (13.180) 
Ox dt Ox 


A—-—[=]=0 (13.181) 
X 


A—-——[—]=0 13.182 
ax dita! ( ) 

Thus, we have 
a « a5 a 
don dt OY a 
Aj= 4 & -F OF 4 
G G G 


x yv Zz 
This completes the proof. Therefore, we see that the physical meaning of A(t) 
introduced in (13.167) actually plays the role of Lagrange multiplier. It links the 
original functional with the constraint of the problem to form the new functional for 
minimization. Note thar the Lagrange multiplier method is much simpler than the 
intuitive step employed in (13.167). 


(13.183) 
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Example 13.2 Find the shortest curve between two points 4 and B on a plane. The 
arc length is defined as 


1=[" Fo)de= |" tye (13.184) 
x x 


The problem is shown in Figure 13.9. 


y B=(x) y2) 


x 
Figure 13.9 Shortest curve between points A and B on a plane 


Solution: For this problem, the Lagrangian is identified as 


F(y"')=yl+y? (13.185) 


Oh, Se (13.186) 


oy oy fi+y? 


The Euler-Lagrange equation requires 


Thus, we have 


d y’ 
si a P| (13.187) 
ax h+y? 
Integration of (13.187) gives 
——, (13.188) 


fl+ yl” 
Rearranging (13.188), we obtain 


yl =eVl+y” (13.189) 


Squaring both sides and solving for y’, we get 


| 2 

Cc 

y =,/— > (13.190) 
L—¢* 


This can be integrated immediately to give 
2 
y= = =X +e, = mxtey (13.191) 
—¢ 


This is an equation of a straight line. Therefore, the shortest distance between two 
points is a straight line joining them, as expected. 
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However, if we want to minimize the surface area formed by the revolution of 
a curve between two points A and B shown in Figure 13.9, one is tempted to 
speculate that it must be a revolution of the straight line obtained here. Sometimes, 
we have to set aside our intuition and to rely on mathematics. In fact, we find that 
the answer is not a straight line but a hyperbolic cosine. This is the catenoid 
problem that we discussed in Section 13.7, and the minimal area of revolution is 
not formed by a straight line. 


Example 13.3 Consider the shortest curve between two points A and B on a sphere 
defined by: 

G(x,y,z)=x +y*+2°-r? =0 (13.192) 
where r is the radius of the sphere shown in Figure 13.10. This problem is also 
known as the Columbus problem because it is related to the shortest path in 
navigation on a voyage. 


Figure 13.10 Shortest curve between points A and B on a sphere 


Solution: Differentiating (13.192), we obtain 


Gz=2x, G,=%y. G=2e (13.193) 
Subsequently, (13.175) is reduced to 
fx-xf _fy-yf _f2z-7 (13.194) 


Daf? 2yf? 22f? 
Rearranging terms in (13.194), we get 
ft ee (13.195) 
ff  yx-xy yz-zy 


Note that 
d(yx — xy) 


- = Ye +X — yx xy = yx -—xy (13.196) 
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BOD) 3a (13.197) 
In view of these identities, (13.195) can be integrated as 
In(yx — xy) = In(yz - zy) +C (13.198) 
Taking an exponential function on both sides, we get 
yx — xy = C,(yz-zy) (13.199) 
This can be further rearranged as 
HUE oe (13.200) 
x-Qz yp 
Or equivalently it can be written as 
aloe ad (13.201) 
x-Cz y 
We can integrate both sides one more time to get 
In(x—C\z) =Iny+C* (13.202) 
Thus, the geodesics must be on the following plane 
x-C\z=Cyy (13.203) 


This is an equation for a plane passing through the origin. Thus, the geodesics must 
be on the intersection between the sphere and the plane through the origin as shown 
in Figure 13.11. This result was first obtained by Euler in 1755. This is the reason 
why the shortest path of flight from Hong Kong to Los Angeles is not flying over 
the Pacific Ocean via Hawaii but instead flying over Alaska. 


Figure 13.11 Great circle as the shortest curve between points A and B on a sphere 
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Example 13.4 Find the shortest curve between two points A and B on the surface of 
a circular cylinder of radius a. The cylindrical coordinate is (7,@z). 


Solution: The curve length on the surface of a circular cylinder is 

(ds)? =(dr)* +(rd0)? +(dzy (13.204) 
On the surface of the cylinder, we have r = a. Thus, we have dr = 0, and (13.204) 
becomes 


ds 2 2 dz 2 
—)° =a +(— 13.205 
( 7 my a” +( 7 D ( ) 
Using (13.205), we have the curve path 


ds = fa’ + (<0 (13.206) 


The curve length between two points A (a, @, 2:1) and B (a, @, zz) is 


_f% | dz.o 
s=J, la (yao (13.207) 


Clearly, we want to minimize the functional s for the shortest curve. Thus, we 


identify that 
2, (42,2 ' 
= +(—) =F 13.208 
{4 CP (z') ( ) 


Note that F is independent of z and 0, and thus we have case (iii) in Section 13.5 or 
the Euler-Lagrange equation becomes 


d*z 
==0 (13.209) 
do 
Or, the shortest curve is 
z=CO+C,, r=a (13.210) 


This is the equation of a circular helix. The problem and its solution in terms of the 
helix are illustrated in Figure 3.12. Problem 13.14 gives the formulas for z and s. 


a op 
eas 


Figure 13.12 Shortest curve between points A and B on a circular cylinder 
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13.13. CATENARY AS A SAGGING ROPE 


When a rope or chain is hanging between two supports, nature chooses the shape of 
the hanging rope by allowing its center of gravity at its lowest possible position. 
The shape is called catenary. The functional / in this case is the y-coordinate of the 
center of gravity together with the constraint G of the length of the hanging rope: 


[ove y?ax 


[yx] = ——— (13.211) 
[- 1+ y? dx 
Glvol= [ l+y?de=L (13.212) 


Note that the numerator is essentially the same as that of (13.52) of the catenoid 
problem. The denominator becomes a constant by virtue of the constraint given in 
(13.212). Thus, we are minimizing the numerator of (13.211) and the constraint in 
(13.212) by the Lagrange multiplier, and we have 


r=] yity? + Aafl + yd (13.213) 


In other words, we have 


A=yylty? +aVi+y? =(+ Dyl+y? (13.214) 
which is not a function x. This corresponds to Case (i) in Section 13.5 and thus the 
Euler-Lagrange equation given in (13.28) becomes 

Ae (13.215) 
oy 
Substitution of (13.214) into (13.215) gives 


(y+ A fl+y? -y'(v+4) 


Rearranging (13.216) gives 


’ 


a? 
l+y” 
(yt A) =k Jt y” (13.217) 


The following change of variables is introduced: 


y= 2 = sinks (13.218) 


xX 


(13.216) 


Using (13.218), (13.217) becomes 
(v+A) =k, cosht (13.219) 
Combining (13.218) and (13.219), we have 
_ dy _ k,sinhtdt 
y' sinht 


dx 


= kat (13.220) 


Integration of (13.220) gives 
x=kt+k, (13.221) 
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Therefore, back substitution of (13.221) into (13.219) yields the hyperbolic cosine 
function as the catenary: 

x—-k, 
ky, 
For the values of ki, k2, and A, we have to satisfy the boundary condition of the 

hanging rope. 

Alternatively, (13.222) can be obtained using a mechanics approach. In 
particular, Figure 13.13 shows a particular case of a hanging rope with two 
supports at different elevations and the mass per length of the rope is assumed as 
(kg/m) and the horizontal component of the tension in the rope is 7p. For this case, 
ky, ko, and A in (13.222) can also be determined with the boundary conditions given 
in Figure 13.13. 


(y +A) =k, cosh( ) (13.222) 


Figure 13.13 Boundary conditions of a hanging rope 


A small rope of length ds can be cut out as a free body and the forces applied on this 
segment are shown in Figure 13.14. The vertical and horizontal force equilibriums 
give 


(T + dT)sin(@+d6) =T sin@+ yds (13.223) 
(T + dT) cos(0+d@) =T cos@ (13.224) 
By neglecting the higher order terms, we have 
d(T sin @) = uds, d(T cos@) =0 (13.225) 
The second equation of (13.225) gives 
Ty) =T cos 0 (13.226) 


Physically, (13.226) shows that the horizontal component of the tension in the rope 
is a constant because there is no net horizontal force applied on this rope element. 
Then, substitution of (13.226) into the first of (13.225) gives 


2 Giese (13.227) 
ds 
Note that the slope is defined as 
mos” (13.228) 
dx 


Thus, we can rewrite (13.227) as 
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d a _h (13.229) 
ds dx Ty 
ty 
x+dx 
‘ T+daT 
ae é +40 
a 


Lids 
Figure 13.14 Shortest curve between points A and B on a circular cylinder 


This can be integrated with respect to s leading to 


a oe 6 (13.230) 
dx I 
On the other hand, we can sa the chain rule to  Y 
d_ dy. d,_dy 
= 13.231 
eae - ra i os Fe ae ( ) 
This can be rearranged to give 
2 
ie een al (13.232) 
dx? To dx 
In view of the following identity, 
ds dy 2 
—=,/l+(— 13.233 
es ( Pe, ( ) 
we can rewrite (13.232) as 
2 
EF ee (13.234) 


dx” Ty dx 
Since the differential equation does not depend on y and x, we can apply the 
standard rule of reduction of order as: 
_ ay 
dx 
Substitution of (13.235) into (13.234) results in 


(13.235) 
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@ 150 oe (13.236) 
Q | 1+ Dp 1) 
Integration on both sides gives 
sinh! p=“ x+C (13.237) 
1) 
Alternatively, this can be rewritten as 
dy wn fl 
— = p=sinh(—x+C 13.238 
rue ( 7 ) ( ) 


0 
The zero slope condition at the origin shown in Figure 13.13 gives C = 0. 
Integration with respect to x instantly gives 


ie 
y= cosh() +k (13.239) 
Hu 1 


The zero slope displacement at the origin shown in Figure 13.13 gives K = —To/u. 
Finally, we get 


T 
y=feosh(“)-1] (13.240) 
Hu 1 
Comparison of (13.240) with (13.222) gives 


ip 
are ky =0 (13.241) 


in (13.222). We can see that the Lagrange multiplier plays the role of normalized 
length in terms of rope tension divided by mass per length. 

The catenary appears naturally in nature in the form of a hanging spider web, 
of hanging rope, and of hanging chain. We have seen that it also forms the catenoid 
of soap film spreading over two rings. Many man-made structures were inspired by 
the catenary. The most vivid example is the inverted catenary of the Gateway Arch 
built in St. Louis, Missouri, USA. It stands 192 m above ground level. The vault at 
the Casa Mila, Barcelona, Spain and vault at Ctesiphon, Iraq are also inverted 
catenaries. The vault of Ctesiphon stands 37 m above ground level and is believed 
to be the tallest vault in a structure in the world. As a side note, the trademark of 
McDonald’s is also made of two inverted catenaries. 


13.14 NEWTON’S PROBLEM OF LEAST RESISTANCE 


13.14.1 Introduction 


In 1685, Newton included in his celebrated Principia the problem of minimum 
resistance on a solid of revolution moving in a rare gas modeled as non-viscous 
flow, regarding the optimum shape of the solid of revolution (Newton, 1685). 
Tacitly, Newton assumed that the section of the solid must be of convex shape and 
the body must be axisymmetric. The gas flow is assumed so rare and dispersed that 
resistance of the gas flow on the solid can be considered as particle impact. Thus, 
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gas flow is assumed as corpuscular flow. There are also no interactions between 
these globally evenly distributed-particles. The assumption of convexity of the 
solid surface ensures that there is only a single impact of these particles on the 
body. Newton claimed that this problem might be useful in ship design. One main 
problem is that there is no proof of Newton’s optimum shape reported in Principia. 
This problem was considered by Huygens and David Gregory. The rigorous proof 
of Newton’s solution was given in 1902 by Kneser. Nevertheless, it is commonly 
believed that Newton’s problem is one of the first problems of its type that 
prompted the development of the calculus of variations (isoperimetric problems are 
other examples of such problems). 

It turns out that, instead of its use in ship design, as originally proposed by 
Newton, Newton’s formulation was found applicable to bodies traveling at high 
supersonic speed in air, such as missiles. In particular, the so-called Newton’s 
cosine-square law for resistance (which will be discussed later in this section) was 
coincidentally obtained for the pressure coefficient when Riemann’s shock wave 
conditions are taken into consideration. Thus, Newton’s model was found 
applicable to supersonic flow. 


13.14.2 Newton’s Sine-Square or Cosine-Square Law of Resistance 


Newton formulated the impact-induced resistance on a solid with a spherical 
surface in terms of the resistance of the normal impact on a flat surface. It turns out 
that the formulation is general for any curved surface of revolution (Goldstine, 
1980). In particular, Figure 13.15 shows the impact force on a flat cylindrical 
surface as well as on a curved surface of revolution. The resistance on normal 
impact is assumed as f. The projection of the force falong the inclination of the line 
drawn from the center of the projectile is f cos@ A second projection of this 
inclined force on the curved surface of the projectile is 


fcos@sind /fcosé f CO80 
6 dx 
eeueseatuy f dy 
— JU, 
Cylinder 
with flat 
surface 


Figure 13.15 Illustration of the Newton’s cosine-square law 


fi =f cos’ O= f sin? 5 (13.242) 
| 


Physically, this is the force of impact on the curved surface along the flow 
direction. This is normally referred to as the sine-square law or cosine-square 
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resistance law of Newton. Note that the force component normal to the flow 
direction is 

ft, =f cos@sind (13.243) 
However, by symmetry this force perpendicular to the flow will be cancelled out. 
Therefore, the net resistance force on a segment of the curved surface is 


dF, = f cos” Ods (13.244) 
Therefore, the total resistance force on the curved surface is 
R= anf” ycos” Ody = 2af |” ycos* Ods (13.245) 
YI | 
From Figure 13.15, we see that 
d 
coso = (13.246) 
ds 
Consequently, (13.245) can be expressed as 
R=2a rf” non i dy (13.247) 
Note that the curved segment is given by 
(ds) = (dx)? +(dyy (13.248) 
This can be rewritten as 
Byp — (13.249) 
ds a ax y 
dy 
Substitution of (13.249) into (13.247) gives 
R= anf |” — (13.250) 
ma +(e fay 
dy 
By noting 
HD ae (13.251) 
dx 
we can recast (13.250) as 
R= anf |?» dx (13.252) 
x 1+ y" 


Another form of (13.250) can be formulated by assuming a parameter ¢ such that 
x=x(t), y=y(t) (13.253) 
With this parametric form, we have 


go gue, goles 2=* (13.254) 
dt dt dy y 
Substitution of (13.254) into (13.252) a 
“3 
R=20f(? at (13.255) 


a x +y° 
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Note that all (13.250), (13.252) and (13.255) are equivalent and all of them have 
been adopted in the literature. The next question is to minimize the resistance R, 
and we are going to see that finding its solution is not straightforward. 


13.14.3 Newton’s Resistance Law for Supersonic Flow on Solid 


Although Newton was thinking about ship design when he proposed the impact 
theory discussed in the last section, it was subsequently found applicable to 
consider the minimal drag on bodies of revolution at high supersonic speed when a 
shock wave was formed on the surface of the bodies as shown in Figure 13.16. 


Shock wave 


Figure 13.16 Resistance on supersonic flow 


According to Eggers et al. (1953), the pressure coefficient is defined in terms of 5 
as: 

c, =f» = sin? 5 (13.256) 

Yoo 

where p and q are the static and dynamic pressure and subscript “oo” implies values 
at infinity or far from the body of revolution. The angle 6 is the angle in the 
meridian plane between the free stream and the tangent to the body surface. When 
the curvature of the body is small in the stream direction, the hypersonic layer will 
be thin. Subsequently, (13.256) can also be used to estimate the pressure 
coefficient, and thus the pressure drag, on the surface of the body. This formula is 
generally acceptable for the case that the hypersonic similarity parameter K is 
greater than one: 


K=M,">1 (13.257) 


where d and / are the diameter and length of the body, and Mx is the Mach number 
at far field. When the pressure coefficient over the body is known, by neglecting 
the base drag at the far end, the pressure drag of a body can be integrated as: 


C i d/2 ! 
Kota O 279, [ Cpydy = 274, |, C,y'de (13.258) 


To simplify the presentation, we can also define a drag parameter as 


D= 
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D i 
Ip= =| C,yy'de 13.259 
D nd. 6 pyy ( ) 
Substitution of (13.256) into (13.259) gives 
! 
78 } sin? Syy'dx (13.260) 
0 
To find sind, we note from Figure 13.16 that 
(ds)? =(dx)’ + (dy), sind = - (13.261) 
s 
Combining these results, we get 
(2 
sin? 5 = —4xX_=_¥ (13.262) 


12 
dx 


Substitution of (13.262) into (13.260) gives 


l 2yy? 

ins I = dx (13.263) 
Ol+y 

This is of the exact mathematical form of Newton’s law of resistance that we found 

in (13.252). Thus, Newton’s law of resistance is found applicable to hypersonic 

flow, and thus it is useful for missile shape design. 


13.14.4 Eggers et al. (1953) Parameter Solution 


Eggers et al. (1953) proposed that the drag parameter be modified to allow for any 
finite region of flat nose of radius y; and of infinite slope at the front. Thus, 
(13.263) can be modified as 


12 3 
Ip=y2 +( a2) dx (13.264) 
Ol+y’ 
Eggers et al. (1953) proposed to consider three different cases of constraint 
conditions. We will, however, restrict our discussion to the case of a given length 
and base diameter (case (a) in their report). In particular, we have 
B 
pal : 
l+y’ 


Since the Lagrangian is not a function of variable x, the Euler-Lagrange equation is 
reduced to the Beltrami identity as: 
_OF 


ay! 


(13.265) 


y—-F=C, (13.266) 


Differentiation of (13.265) gives 
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OF  b6yy" 4yy" _ 2 
dy’ l+y? (ity?) (1+) 


t 14 
= Byy? + yy} 


, (13.267) 
2yy' rR 
== B+ 
Gy | 
Substitution of (13.267) into (13.266) leads to 
13 

mien =C (13.268) 
les) 


Note that we have allowed a non-pointed tip at the nose (such that (0) = y1), and 
the end points y; of the minimizing curve are not fixed yet. Thus, we have to 
impose another condition at the terminal points. Following from Courant and 
Hilbert, Eggers at al. (1953) imposed the following condition at y = 1: 


y 


d/2 
(0, y)) 


(0,—1) d/2 


Figure 13.17 The enlargement of the nose condition of the body of revolution 


OF dy 
oy’ dy 


( *) =< =2y——j5| = 0 (13.269) 
Thus, we find 

yO)=y =1 (13.270) 
The system of (13.268) and (13.270) can be solved in parametric form. In 
particular, we can rewrite (12.268) as 


12\2 
y= sate ae (13.271) 
4 oy 
Let us rewrite the first derivative of y as p 
Gdtpry dy _, 
= : =—= 13.272 
y= zi oe i ( ) 


Taking the total differential on both sides, we find 
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C, (l+p’ 3(1+ p? 
7 no = (13.273) 
_& U+p \p* -3) 
4 dp 
4 P 
Substitution of (13.273) into the second equation of (13.272) gives 
a 
pepe I SG, (13.274) 
Pp 4 Pp 
Integration of both sides leads to 
D702 
y= ( Ore MES) pace 
4 5 
P 
Cr,l 2 3 
= (—-+-p)ap +0, (13.275) 
Pp p 


‘34 i. 3 
=—l (in p+—-+—) +€ 
4 pap 


In summary, we have the coordinates of the curved surface in parametric form in 
terms of the derivative p as: 


C +e 
got p) 


13.276 
| = (13.276) 


x=“ (np+ +46, 
4 p 4p 


From the boundary condition at x = 0, we have 
yO)=y, p(0)=1 (13.277) 
Using these boundary conditions, we have 


=e fat) (13.278) 


Substitution of (13.278) into (13.276) finally yields 
fo 7 1+ p’)? 
x=“l(np+—>+—;--), y=4 B 
4 p 4p) 4 4p 
By referring to (13.270), we can start with p = 1 for x = 0. Figure 13.17 suggests 
that we should decrease p (less than 1) as both x and y increase. 

In order to give a plot of the curved surface on the meridian plane, we plot on 
Figure 13.18 the profile that gives the minimal resistance based on Newton’s 
impact theory for least resistance for the case of //d = 3. In addition, in Figure 
13.18, we also plot the following profile of a 3/4 power law: 


(13.279) 


te (13.280) 
21 
Numerical results show that the solution given by the least resistance of (13.279) 
(solid line) is virtually the same as that of (13.280) (dashed line). We will 
demonstrate in the next section that (13.280) is indeed an approximate solution of 
the minimal resistance problem. 
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2y/d 


Figure 13.18 Profile of the surface with minimal resistance and of 3/4 power law for /d =3 


13.14.5 Approximation by 3/4 Power Law 


Let us consider the following approximation of (13.263) that y ' is small compared 
to unity (i.e., y'<< 1) 


I 
<2) dx = 2] wy? dx (13.281) 
0 


That is, we will only minimize the numerator of (13.263). For such approximation, 
we have 


Fry? (13.282) 
Using this F’, we get 
= =3yy”, ca i (13.283) 
oy oy 
Differentiation of the first term with respect to x gives 
d OF d 2 
=—3yy" =3y'+6yy'y" 13.284 
no a ( ) 


Substitution of (13.283) and (13.284) into the Euler-Lagrange equation given in 
(13.15) gives 
OF d OF 


Oy dx oy 
To further integrate (13.285), we multiply it by y’ to get 


=-2(y"? +3yy'y") =0 (13.285) 
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. , id d 
PR P= Oy 8 (13.286) 
Integration gives 
yy =C (13.287) 
This can be rewritten as 
dy 1 
+ =—~C 13.288 
dx yl 1 ( ) 
Integration one more time yields 
y =(C,x+C,)*" (13.289) 
The boundary condition can be written as 
yv(0)=0, vD=d/2 (13.290) 
These boundary conditions give the conditions as: 
4/3 
C; aS , C,=0 (13.291) 
Finally, we get 
d /X.3/4 
= (2 13.292 
: ( ; ) ( ) 


This gives the approximation given in (13.280) and this gives an excellent 
approximation for the optimum shape with minimal resistance, as shown in Figure 
13.18. As reported by Eggers et al. (1953), experiments conducted in the 
supersonic wind tunnel at Ames Aeronautical Laboratory for objects satisfying the 
power law with n = | (cone), 3/4, 1/2, 1/4, and an Ogive (an object with a roundly 
tapered end as shown in Figure 13.19) show that the shape with n = 3/4 indeed 
gives the minimal resistance. The experiments were conducted for Mach numbers 
ranging from 2.73 to 6.28. Figure 13.19 shows the experimental results obtained by 
Eggers et al. (1953). Thus, Newton’s impact theory for minimal resistance is 
verified for a body of revolution in supersonic flow. 


13.15 SUMMARY AND FURTHER READING 


The calculus of variations is a classical mathematical topic in applied mathematics 
and engineering. The historical development of the calculus of variations has been 
reported in excellent detail by Bliss (1925, 1930) and by Goldstine (1980). The 
book by Goldstine (1980) covered many technical details of its development. 
Readers are referred to them for more discussion of the calculus of variations. 
There are many good books on the calculus of variations, including Forsyth (1960), 
Goldstine (1980), and Weinstock (1974), just to name a few. Simmons and Krantz 
(2007) provided a very good elementary introduction to the calculus of variations. 
The history of isoperimetric problems was covered by Fraser (1992). 

In the 1990s, Newton’s minimal resistance problems were revisited by many 
researchers for the cases in which the body of revolution is non-symmetric and 
concave, which allow multiple impacts of the particles. However, this new analysis 
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depends heavily on differential geometry which is out of the scope of the present 
chapter. 


Figure 13.19 Pressure foredrag Cop for various shapes of solid of revolution (adopted from 
Eggers et al., 1953). 


13.16 PROBLEMS 
Problem 13.1 Find the function )(x) that minimizes the following functional: 
1x) = [= y?)dv (13.293) 
Ans: 
y(x) = C, sinx + C, cos x (13.294) 
Problem 13.2 Find the function )(x) that minimizes the following functional: 
I(y) = [ x60"? -y?)e (13.295) 


Ans: 
y(x) = CJo(x) + C,%(x) (13.296) 
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Problem 13.3 Hamilton’s principle is the most general fundamental principle for 
analyzing rigid-body mechanics. It can be formulated using the calculus of 
variations. More specifically, there are n generalized coordinates in the Lagrangian 
defined in the functional: 


b 
1) = FL. q, Oo 4y(0.44 Oo Gn (ODA (13.297) 


where 

L(t, q(t), In). 4 (0), J, (0) =T -V (13.298) 
The kinetic energy and potential energy of the system of n generalized coordinates 
are normally referred to as T and V. Find 
(1) The Euler-Lagrange equation 
(1i) The Euler-Lagrange equation for the case that L is not a function of time 
(ii) Consider the special case of the Lagrangian that T is quadratic in generalized 
velocity and V is a function of generalized coordinates only 

T= DY ay (G15 On) 4d) (13.299) 
i=l j=l 


V=V(Qqhs5n) (13.300) 


Show that the constant in the Beltrami equation in (ii) is the negative sign of the total 
energy. 


Ans: 

(i) ee (126 i=1,2,...,.0 (13.301) 
0g, dt 04; 

. . OL 

ll L- —=C 13.302 

(it) da 2, ( ) 

(iii) C=-(T+V) (13.303) 


Problem 13.4 Find the Euler-Lagrange equation for a particle in a conservative 
force field (e.g., gravitational field) by considering the minimum of the functional 


: 1 2 +2 +2 
(yx) = fons + mi? + mx2) —V(x,,Xp4X3) dt (13.304) 
Ans: 
d.. 
ris edad where r = x,e; (13.305) 


Problem 13.5 Find the Euler-Lagrange equation for the following functional 
1 
1(y(2)) = [J 5 +05 yay (13.306) 


Ans: 
V-u =0 (13.307) 
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Problem 13.6 Reconsider the shortest curve on the sphere by using polar 
coordinates: 


x=RsinOcos¢, y=RsinOsing, z=Rcosd (13.308) 
(i) Show that the corresponding functional in polar form is 
IQ) = RI [1+sin? o2)"}""a0 (13.309) 


(ii) Show that the shortest curve is on a great circle. 


Problem 13.7 Find the Euler-Lagrange equation for the following functional for 
beam bending 


EI d’w. 


T(W(x)) = fe ay qw]dx (13.310) 
Ans: 
d‘w 
EI = g(x) (13.311) 
dx* 


Problem 13.8 Find the Euler-Lagrange equation for the following functional for a 
circular plate bending under ae loading g(r) and fixed at the edges: 


1 dw, dw dw _2q 


T(w(x)) = Dal roey +— res Me rs ee noe (13.312) 


Ans: 


4 3 2 
iW, 5a ld‘w | 1 dw_ qr (13.313) 
dr* de ordr r dr D 


Problem 13.9 Find the geodesics on a right circular cone of semi-vertical angle o 
(see Figure 13.20). It is given that the differential of an arc ds on a right circular 
cone is given by 

(ds)? =(dr)* +(rd0)” +(rsinad¢)y” (13.314) 
If the vertex of the cone is at the origin and the z-axis is the axis of the cone, the 
polar equation of the cone is 

O=a (13.315) 

The functional is the arc length defined by 


-f & a sin” wide (13.316) 


a = cos(¢sina +C;) (13.317) 
rsing 


Ans: 
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Shortest curve 
On cone surface 


Figure 13.20 Shortest curve on cone between two points 


Problem 13.10 In the text, we have considered the catenoid problem, a case of 
minimal surface for soap film. The catenoid problem is actually a one-dimensional 
problem because of the surface of revolution (see (13.47)). In this problem, we will 
consider the problem of minimal surface of soap film formed in a closed wire loop 
defined by z = u(x,y). In particular, the functional of minimal surface is 


1=[[ fitus tu dedy (13.318) 


(i) Referring to (13.41) for a two-variable case, show that the Euler-Lagrange 
equation can be written as 


Find the following: 


OU, OU, 
x) 4 ~)=0 13.319 
ax F? ox F? ( ) 
where F is defined as 
= fl+uz +u;, (13.320) 


(11) Show that if the first derivative of u is small, the Euler-Lagrange equation 
becomes the Laplace equation (i.e., membrane equilibrium of small deflection is 
governed by the Laplace equation). 


(111) Show that the Euler-Lagrange equation for the minimal surface can also be 
written as: 
UU, =0 (13.321) 


2 2 
(l+u;)u,, +(+u;)u,, —2u,UjUy, 


x 


(Note that this PDE is extremely difficult to solve.) 
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Problem 13.11 Continued from Problem 13.10, if the domain D is given by a 
rectangle of size a and b. Show that a helicoid defined by 


u(x, y) = tan7'(~) (13.322) 
x 


is a solution for the minimal surface in a rectangular domain. 


Problem 13.12 Fermat’s principle of least time is a powerful tool in studying 
geometric optics. The least time can be formulated as functional and the path of 
light can be solved by using the resulting Euler-Lagrange equation. Figure 13.21 
shows the formation of a mirage in desert areas. The travel time by light path is 


T= far=fo=—fnat=—f foal (13.323) 


where v is the velocity of light in air and c is the speed of light in a vacuum and n is 
the index of refraction. In desert areas, the ground surface is much hotter than the 
air above the ground. The refraction index approximately increases with height, as 
shown in Figure 13.21. This is the mirage phenomenon that occurs in desert areas. 
The functional in terms of the path of travel can be set as 


fe [fot + x ay (13.324) 


Refraction index 
ny) =n (l+ay) 


Figure 13.21 Mirage formation and the path of the light with least travel time 


(1) Show that Euler-Lagrange equation leads to 


dx C 
= (13.325) 
vy UPw-c 
(ii) Show the solution for y is 
1 
Fem la eee (13.326) 
a an C 


where xo is, as shown in Figure 13.17, the center of symmetry. 
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(iii) Find the unknown constant in terms of yo and show that the final solution is 

1 (a@yo+l a(x- x, 
LOAD oy MM) 

a a ayy +l 


y= ] (13.327) 


(iv) Show that for small a we can approximate the hyperbolic solution given in 
(13.327) as a parabola: 
A(X — Xq ‘lg 


= 13.328 
Y=Vo Way +1 ( ) 


Problem 13.13 Find the Euler-Lagrange equations for the following functional: 


b 
[ =| [F(t,x,x, y,y) + AG(t, x, x, y, y) lat 


; (13.329) 
- A(t, x, x, y, y)dt 
Ans: 
ae ae, (13.330) 
Ox dt Ox 
A A 
eh ect (13.331) 
oy dt oy 
Problem 13.14 Continue the calculation of Example 13.4. 
(i) Find the equation of the shortest path joining points A and B; 
(ii) Find the distance s between points 4 and B. 
Ans: 
pap, 2 (13.332) 
0, -4, 0, -G, 


s= fa? (0,-4) + -2" (13.333) 


CHAPTER FOURTEEN 


Variational and Related Methods 


14.1 INTRODUCTION 


The idea of using the variational principle (or the minimum energy hypothesis) can 
be traced back to Archimedes, Aristotle, and Galileo. Variational methods have 
been developed since the time of the development of the calculus of variations. The 
calculus of variations developed mainly by Euler and Lagrange aims to maximize 
or minimize some functionals, such as Dido’s problem of maximizing area, 
brachistochrone (path of quickest descent), tautochrone (optimum curve for sliding 
bead on wire), geodesics (shortest path on surface), and catenoid problems 
(minimum area of revolution of soap films). The reason why nature chooses to 
minimize or maximize certain quantities has puzzled the greatest philosophers, 
scientists, and mathematicians. In view of this, Fermat formulated his principle of 
least time for optics, Maupertuis formulated the principle of least action, and 
Hamilton formulated his principle (the path of any motion minimizes the integral of 
the difference between kinetic and potential energies over interval of time). In 1760 
and 1761, Lagrange developed the principle of virtual work and the Lagrange 
multiplier in the context of variational mechanics. The principle of least action had 
been discussed by Euler in 1744 for the case of column buckling, and by Leibniz in 
1705, although credit is normally given to Maupertuis. All these principles can be 
considered as some kind of variational principle. Hamilton’s principle is 
considered the most general of all, linking all phenomena in mechanics, optics, 
gravitation, electricity and magnetism, and quantum mechanics by a single integral 
(Kline, 1959). The development of the functional includes Legendre’s work on 
distinguishing maxima and minima, and Jacobi’s and Weierstrass’s work on 
existence of extrema in a functional. 

Our discussion in this chapter focuses on minimization or maximization of 
functionals and their relation and application to solving problems of differential 
equations. In the last century, variational methods have been associated with the 
formulation of differential equations for physical phenomena, and associated with 
approximation methods such as the finite element method. In solid mechanics, the 
Veubeke-Hu-Washizu (VHW) principle and Hellinger-Reissner (HR) principle are 
of fundamental importance in finite element formulation. Both of these are 
important variational principles in solid mechanics. The VHW variational principle 
was apparently formulated independently by Veubeke in 1951, by Hu in 1955, and 
by Washizu in 1955. Although both Veubeke and Washizu were visitors at MIT 
(Massachusetts Institute of Technology) in 1952, they never cited one another. The 
HR variational principle was studied by Hellinger in 1914 and by Reissner in 1950. 
The HR variational principle can be considered as a special case of the VHW 


818 Theory of Differential Equations in Engineering and Mechanics 


principle. Interestingly, Reissner was a professor at MIT when both Veubeke and 
Washizu were visiting. The starting functional normally possesses the physical 
meaning of some kind of energies. Alternatively, numerical methods can also be 
formulated in terms of integrals with the kernel being the product of the differential 
equation and some weighting functions, which is somewhat arbitrary as long as 
they are admissible kinetically. This method is known as the weighted residue 
method, which is closely related to the variational principle. The most notable 
weighted residue methods is the Galerkin method, whereas the most notable 
variational methods is the Rayleigh-Ritz method. Incidentally, both Galerkin and 
Ritz aimed to provide approximate solutions of problems of plate bending in 1915 
and 1908 respectively. Both beam and plate bending problems will be considered 
as examples. 

This chapter introduces the fundamental concepts and ideas of the variational 
principle and its associated methods. More references will be given in the summary 
section for more serious readers. 


14.2 VIRTUAL WORK PRINCIPLE 


The idea of virtual work can date back to the time of Bernoulli, and it is based on 

the conservation of energy. Implicitly, work done by frictional forces or other 

irreversible processes are neglected. Let us consider the equilibrium of a body V 
and the corresponding prescribed boundary conditions (S = Sa+S,): 

V-o+f=0, in V (14.1) 

o-n=T, onS,, u=u, onS, (14.2) 

where o, f, T and n are stress tensor, body force vector, traction vector, and unit 

normal vector to the surface of the body S. The superimposed bar are used to 


denote those given traction and displacement on the corresponding boundaries. The 
traction and displacement boundaries are denoted by Sc and S, respectively. 


Figure 14.1 A deformation body with volume V and with boundaries S, and S, 
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For two unrelated states of a deformable body, one is an arbitrary admissible stress 
o that satisfies equilibrium in V and traction boundary conditions on So, and the 
other is an arbitrary admissible displacement u* that satisfies a compatibility 
condition in V and displacement boundary conditions on S,. Note that the asterisk 
for the displacement field is used to emphasize that the stress (without asterisk) and 
displacement fields (with asterisk) are unrelated. These two states are unrelated and 
not necessary real (this is the reason why we call them virtual in the first place). We 
said that virtual work is done, if the admissible stress state has undergone an 
unrelated admissible displacement state. In this sense the work done is not 
necessarily real (or virtual). The external virtual work done by an external force 
(body force f and applied traction and displacement) on the body undergoing an 
admissible displacement field w is 


I, =[s-utav+ | T-uras+ | 7-ads (14.3) 
V Ss. S. 


The internal virtual work is the work done by the admissible stress, which is in 
equilibrium with the external applied force and traction, on the unrelated associated 
strain (1.e., resulting from u*): 


I, -| o:e*dV (14.4) 
V 


The principle of virtual work says that the external virtual work must be equal to 
the internal virtual work when the equilibrated forces and stresses undergo 
unrelated but consistent displacement and strain: 


fo: e*dV=[ f-u*dV + [ F-u*as + [T-ads (14.5) 
V S, S, 


ul 


V 


If the body is rigid, the internal virtual work will be zero. In the process of analysis, 
we never impose any constitutive response of the deformable body, and thus, the 
principle is valid regardless of the material behavior. The principle can also be 
applied to consider the case of large deformation as long as the appropriate stresses 
are used in the energy calculation in (14.5). The principle can also be extended to 
dynamic problems, if the inertia force is interpreted as body forces in (14.5). For 
the case of a dynamic system of rigid bodies, it is equivalent to the Lagrange- 
D’Alembert principle for dynamics. 

In applying (14.5), we may impose equilibrium of the “real” stresses and 
forces that are in equilibrium on a consistent virtual displacement and strain in the 
formulation. This will lead to a special case called the principle of virtual 
displacement, which will be discussed next. On the other hand, we may impose a 
consistent real displacement and strain on self-equilibrated virtual stresses and 
forces in the virtual work equation. This will lead to a special case called the 
principle of virtual traction. The principle of virtual work occupies a major place 
in the area of structural analysis, solid mechanics, and finite element analysis. 


14.3 VIRTUAL DISPLACEMENT PRINCIPLE 


If we impose self-equilibrated “real” stresses and forces on consistent virtual 
displacement and strain in (14.5), we have 
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u*=Ou, &*=068 (14.6) 
This virtual displacement must be zero on the displacement boundary S,. Applying 
this notation and constraint on the virtual displacement, we get 


fo: dedV =| f -dudV + | TF -duds (14.7) 
V 


V S, 


This is called the principle of virtual displacement. When the body is elastic, (14.7) 
is related to Castigliano’s first theorem. In other words, if a body is in equilibrium, 
the total virtual work done is zero. In structural mechanics, the principle of virtual 
displacement is normally used to find the real force or stress of a body. 


14.4 VIRTUAL TRACTION PRINCIPLE 


If we impose consistent “real” displacement and strain on equilibrated virtual 
stresses and forces in the virtual work principle, we have 


o=00, T=6T, f=6f (14.8) 

The virtual force is zero on the traction boundary So. 
[oo: exav=[5f-u*av + | 57 ads (14.9) 

Vv Vv s 


This will lead to a special case called the principle of virtual traction. This is also 
called the principle of complementary virtual work. When the body is elastic, 
(14.9) is related to Castigliano’s second theorem. The principle of virtual force is 
normally used to find the real displacement of a body. 

The principle of virtual work will be illustrated by example in a later section. 


14.5 HAMILTON’S PRINCIPLE 


As we remarked in the introduction, the Hamilton principle is very general in the 
sense that it unifies optics, mechanics, electricity, and magnetism by a single 
minimum principle. The following integral is defined in terms of kinetic energy T 
and potential energy V as: 


r=[" (T -V)dt (14.10) 


The principle states that / is at minimum for the path traversed by an object during 
its motion (including the path of light or other electromagnetic waves!) from time fo 
to time f;. For deformable solid, it can be rewritten as: 


a=[) (U(e,)+T -W)at (14.11) 


where W is the work done by external forces and U is the strain energy density. For 
static problems, the total potential energy for a deformable body can be defined as 
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Hl, = [U,) — fiu;}dV - | Tju,dS (14.12) 
A KY 


On the other hand, the total complementary energy for a deforming body is defined 
as 


M1. = [U.(o,)av - | Tazas (14.13) 
V 5, 


For solving equilibrium problems, we need to minimize the total potential energy 
or maximize the total complementary energy. These two functionals in terms of 
energies given in (14.12) and (1.13) are the basis for the variational principle in 
formulating differential equations of physical or mechanics problems. Hamilton’s 
principle will lead to an equation of motions whereas either total potential energy 
or total complementary energy will lead to equations of equilibrium. As 
demonstrated in Figure 2.10 of Chau (2013), these two energy functions are 
complementary to one another in the following sense: 

Il. +1, = oy, (14.14) 


When (14.12) or (14.13) is used in numerical analyses, such as the Rayleigh- 
Ritz method, Galerkin method, or finite element method, it is well known that both 
of them have their limitations. In particular, methods based on minimum total 
potential energy use displacement as the unknown and are more accurate for 
displacement prediction but less accurate for stress prediction (as they were 
calculated based on the numerically obtained displacement); whereas those based 
on maximum total complementary energy use stress as the unknown and are 
accurate for stress prediction but less accurate for displacement prediction. Trefftz 
in 1926 demonstrated that the upper bound of torsional rigidity could be found by 
using minimum potential energy, whereas the lower bound of torsional rigidity can 
be found by using maximum complementary energy. A similar idea was proposed 
by Prager and Synge in 1947 for determining the upper and lower bounds of elastic 
modulus by using the functional space concept. This method has been used 
successfully for finding the Young’s modulus for both cylindrical and rectangular 
specimens of arbitrary shape under compression with end constraints (Chau, 1997, 
1999a). In general, the minimum total potential energy formulation will lead to a 
lower bound for the displacement prediction (i.e., the numerical model appears to 
be stiffer than the actual system), whereas the maximum total complementary 
energy formulation will lead to an upper bound for the displacement prediction 
(1.e., the numerical model appears to be softer than the actual system). This general 
observation is illustrated in Figure 14.2 using the beam problem as an example. 

In finite element methods, the first one is normally referred as the 
displacement method whereas the second one is normally referred as the force 
method. In view of these limitations for both formulations, more general variational 
principles have been proposed. They are normally referred to as mixed variational 
principles because both displacements and stresses are unknowns of the problems. 
In the next two sections, we will consider two generalized (or mixed) variational 
principles; they are the Veubeke-Hu-Washizu (VHW) principle and the Hellinger- 
Reissner (HR) principle, which are more powerful in numerical analysis. These 
principles are the basis for such mixed or hybrid finite element methods. 
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However, the finite element method is a topic that cannot be covered in the 
present or later chapter. We will only briefly summarize the idea of the variational 
principle and give a simple example of the case of incompressible flow in Chapter 
15. 


Displacement g 
i“ 
x I orce ot elements 
‘method orl 
oe ae Upp bo 


i. a bound 
Displacement oll, =0 


method 
Number of finite elements V 


Figure 14.2 Illustration of the upper and lower displacements by force and displacement methods 


14.6 VEUBEKE-HU-WASHIZU PRINCIPLE 


In this section, we will consider a general variational principle called the Veubeke- 
Hu-Washizu principle. It is formulated for elastic bodies and aims for numerical 
methods, such as the finite element method. Let us consider the mathematical 
formulation for an elastic body V as: 


Oy +S; =0, inV (14.15) 
1 : 
Ey = 5 a +u,;;), mV (14.16) 
Oy =AyyEq, OC Ey =O q, mV (14.17) 


The first of these is an equilibrium equation, the second is a strain-displacement 
relation, and the third one is a constitutive law (or Hooke’s law). The boundary 
conditions are: 
u;=u;, onS, (14.18) 
T,=o0,n;=T,, onS, (14.19) 
The first of these is called an essential boundary condition (i.e., displacement 
boundary condition), and the second is called a natural boundary condition (i.e., 
traction boundary condition). The Veubeke-Hu-Washizu principle can be 
expressed as 
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= 1 
Tym = [UE )~ ft, - yy — 5,5 +H) DAV 


ae (14.20) 
-{ Tuas — | T,(u, ~ ii, )dS 
S S 


where independent functions subject to variations are oj, ¢j and uw; and, that is, 
there are 15 unknowns (note that both stress and strain are symmetric). In terms of 
the finite element method, stress, strain, and displacement are also independent. 
That is, we have relaxed the relations given in (14.15) to (14.18). Since both stress 
and displacement are unknowns, they are called the mixed variational principle. 
Comparing the VHW variational principle in (14.20) with the total potential 
energy in (14.12), we observe that there are two extra terms appearing in (14.20). 
The principle can be interpreted in the following form 
Tyan = [U6 ))~ Jia = Ayley — 5 4, #4. AY 
on (14.21) 
- | Zu,as | p,, -m yas 
Ss S, 


where both A, and p; are Lagrange multipliers and they are 


= C. DE (14.22) 


We can see that by adding these two constraints we are actually relaxing the strain- 
displacement relation and essential boundary condition. We allow the flexibility 
that the approximation is sought such that all (14.15) to (14.19) are satisfied 
approximately in a global sense (integrating over the whole body and whole 
boundary). By applying the Gauss theorem, we note that 


I 
Je (u,+u, AV = [oyu AV =-[oyud¥+ [ oynjuas 
V Vv V S,+5S, (14.23) 
= | oy, ;ujdV + } Tu,dS 

V Se tS 


Substitution of (14.23) into (14.20) gives 


Ty = | UE) 9484 -y,, + Adusav — | (1 - Tuas + | Tizds 
V S, S, 


o u 


(14.24) 
From (14.24), we see that the equilibrium equation will not be satisfied 
automatically by our approximate solution. 
The variational principle requires the functional of the VHW formulation to 
be stationary or 
Ollyyw =0 (14.25) 


where 6 denotes the variation. Taking variations of (14.20), we get 
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OU (é;) = 1 
ST yany = [S85 — Hid “Ley — 5, +4) 60 }AV 
V y 
l = 
+] oy (Ou, +5uj;) ode, 3dV — | Touas (14.26) 
V So 
-( [(u; —u;)n Oo, +o,n Ou; |dS 
Su 
By applying the Gauss theorem, we have 
[o,ou, av = -| oj, ;OujdV + | o;n ;Ou;dS (14.27) 
Vv Vv S545, 


Using this result, we can rewrite (14.26) as 


au 2 1 
ST yaw = | (— oFy 85 — (0, + FSU; +15 (up +Mps) ~ py AV 
V 


ff 


: (14.28) 
=| (T, -T,)du,dS - J (u; —i,)n 60,45 
Ss S 


In view of (14.25), we have the following Euler equations and boundary 
conditions: 


C= a inV (14.29) 
Oo; +f,=0, inV (14.30) 
Ey = Stu, +U; 7), inV (14.31) 
L=o4e=7). 05, (14.32) 
u;=u;, ons, (14.33) 


Therefore, we do not assume any of these five equations are satisfied in the VHW 
variational principle. 
In the next section, we will consider another mixed variational principle. 


14.7 HELLINGER-REISSNER PRINCIPLE 


Another variational principle in close relation to the VHW variational principle is 
called the Hellinger-Reissner (HR) principle. The functional of the HR principle is 


1 =z 7 = 
Typ = J elon +ujj)+ fujsdV — J Tuas — | Tu, —u,)dS (14.34) 
V 5 S 


where the independent functions subject to variations are oy, u; and 7;. Again the 
equation can be interpreted as total complementary energy by adding two 
constraints as: 
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1 2 = . 
T= fw. (1) 5 Aap, +My) + Su saV | Tu,ds — | p,(u;—i;)dS. (14.35) 
V Sg Su 


where the Lagrange multipliers are defined as 


Ay =F)» PT, (14.36) 


If we set Aj = 0 and p; = 0, we recover the total complementary energy given in 
(14.13). Using the Gauss theorem, we obtain another form of the HR functional: 

Tee [Ulo)+y,; + fru,}dV | (7, -T)u,ds [ nmas (14.37) 

Vv S S 


7 


Let us consider the corresponding Euler equation by taking the variations of (14.37): 
OTL yp =0 (14.38) 
The variation of (14.38) is 


Olly ale Fo OO +MFOy, 5 +(y,) + ()5u;}AV 
. 7 (14.39) 
7 [un oy + (yn, —T)dudS — | t,60yn dS 
Sy, 
By applying the Gauss theorem, we have 
[ud0,,,dV =-[u, 50,dV+ | wdo,n,d (14.40) 
V V 5,78, 
Substitution of (14.40) into (14.39) gives 
dT yp = J (eu, 80% +(0y,; + fr)Su;}dV 
(14.41) 


“Joon T)ou,ds - [a — u;)n60,dS 
S 


u 


Since the variations can be sien we require the following equation be satisfied: 


= 30,” inV (14.42) 
o,;,+f,=0, inV (14.43) 
T; = On; = =f, on So (14.44) 
u;=u;, ons, (14.45) 


We see that there is one less equation compared to the case of the VHW variational 
principle discussed in the previous section. In this sense, we can see that VHW is 
more general. 
We can substitute (14.14) into (14.34) and get the following identity: 
Te =A yyw (14.46) 
Therefore, the HR variational principle is consistent with the VHW variational 


principle. However, they are not the same since we need to impose one more 
condition (14.14) to link them. 
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14.8 RAYLEIGH-RITZ METHOD 


In this section, we introduce a method developed by Rayleigh in 1877 and by Ritz 
in 1908. In this approach, a finite number of approximations are assumed: 


U = Ug + >, a,.U,. (14.47) 
r=l 

v=v9+ >. bv, (14.48) 
r=l 

w= wy +> ¢,w, (14.49) 


Note that we have used the physical components u, v, and w to represent tensor 
notation uw, v2 and u3. However, these approximations must be selected in such a 
way that the essential boundary condition is satisfied identically by choosing 
Uj =U, YW=V, Wo=w, ons, (14.50) 
u,=0, v,=0, w,=0, (r=1,2,...,2) 0n S, (14.51) 
Note, however, we do not need to satisfy the traction boundary condition imposed 
by (14.19). Substitution of this approximation into (14.12) and let us express the 
total potential energy in terms of the unknown constants a,, b,, and c,. Next we can 
set the variation to zero as 


oll, =0 (14.52) 


We use this condition to seek for the optimum values of the constants a,, b,, and c; 
(r = 1,2,...,2). Therefore, the variables in the formulation are those constants given 
in (14.47) to (14.49). Thus, (14.52) implies 
orl, en, orl, 
=0 =0, =0 (14.53) 
Oa ob 


We can see that we have exactly 31 equations from (14.53) for the 3” unknowns. 
Solving for theses constants, we find the approximations assumed in (14.47) to 
(14.49). This procedure is called the Rayleigh-Ritz method. Note that the Rayleigh- 
Ritz approach is only an approximate method. Alternatively, we can also substitute 
(14.47) to (14.49) into the virtual work principle with virtual displacements as 


ou = Ysa, » OV= > 6b, , OW= > 66», (14.54) 
r=l r=l r=l 


Equation (14.7) leads to the following 3n equations: 


’ 
r 


OG, Oty Ot, = = 
: = = : - dS = 14. 
by =f (Get Bett Folie + | (Te -TeduydS=0 (14.55) 


y. 


M Phy Pip. « hie G dV +| (T,-T,v,dS=0 (14.56 
: - +f jv.dV + ,-T, = . 
af Gott Et inay +] -Tds=0 (1456) 


OT 1. = =] 
N, } Ow + se + F)w,aV + | C, —T,)w,dS =0 (14.57) 
Vv Ox ov Oz a 
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where the stresses are 


Ou “, Ou VE 
226, 4 ag *4— 14.58 
Om { Ox 24 ox l- ay? ( ) 
Ov, VE. 
» 2265 “415, i} (14.59) 
= 1-2v 
Vey 
= 264 M+ 8 re (14.60) 
pe Ou ,& a ye 5 oe ‘te OW (14.61) 
ox a a Mar OZ 
Cy = Gite, My - Ca Le: (14.62) 
Oy Ox “> oy ox 
G(s a (14.63) 


“ , Ho ow, 
=G + . 
oe —— 


The 3n equations from ae are found to be identical to (14.55) to (14.57). Thus, 
the Rayleigh-Ritz method is equivalent to the virtual work formulation. 


cay 14.64 
a y (14.64) 


Example 14.1 Find the system of equations for the coefficients a, for the following 
Rayleigh-Ritz approximation 


x l-x “ . knx 
=—y, +— y+ sin —— 14.65 
Yn =7 Vt ——Yo 2% i ( ) 
for the functional: 
I 
ILy]= J Le? +4? +2 fiax (14.66) 


with boundary conditions: 
VO)=y%, VO= YH (14.67) 


Solution: Note that the assumed y, given in (14.65) automatically satisfies the 
essential boundary condition given in (14.67). Substitution of (14.65) into the 
functional (14.66) gives 
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n 3 


kh k h _ kax .h 
Ity,]=> Daca f [p = cos “cos “+gsin “sin “|ax 
k=l h=l 


ykn yok krx xy, I-x . knx 
AQ a flo 7 P cos j ql + Yo) Sin —— 


fool (14.68) 
x” [lax 
+f sin ytc+ [toe eo 22004 gH yg 
x(1 
ae re 71 4 yd 
The unknown coefficients can a found by the following conditions: 
OE ay a =0,°-, a =0 (14.69) 
Oa, da, da, 


Thus, we have 


0= dal, go HO con cos MEX gsi KE sin 


yn yohn hrx xy, I-x . Ax 
+ cos + + sin —— 14.70 
3 [tea 27 00s A + g(t + yp) sin (14.70) 


+f sin “eds 


where k = 1,2,3,...,1. Once the functions p and g are given, integrations can be 
conducted and the unknown constants can be determined analytically. 


14.9 WEIGHTED RESIDUE METHOD 


Consider a differential equation given in a symbolic form as: 

Lu) = f (14.71) 
We normally cannot satisfy the equation pointwise (if we can, we actually have the 
analytic solution). The weighted residue method is an approximate technique that 
requires the differential equation be satisfied in a global sense. That is, the integral 
of this differential equation multiplying by an arbitrary weighting function is zero. 
Mathematically, it is 


[ie@ — flw,dV =0 (14.72) 
V 


where w; is the weighting function and the approximate function is assumed in 
series form similar to that in the Rayleigh-Ritz method 


it = dp +d (14.73) 
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We are searching for some approximate functions that satisfy (14.72). More 
discussions of the weighted residue method in the context of finite difference will 
be given in Chapter 15. For examples, the following cases are of particular interest. 


14.9.1 Least Square Method 


The least square method of Gauss is recovered if the weighting function equals the 
error function 

vi = Lla(g)\—S (14.74) 
where the approximation of the unknown has been defined in (14.73). This least 
square of the error concept has been found very powerful in fitting data to a straight 
line (or so-called linear regression). 


14.9.2 Point Collocation Method 


The point collocation method is recovered if the weighting function equals the 
Dirac delta function 

y; =0(x-x;) (14.75) 
The point collocation method is a standard procedure in solving boundary integral 
equations (e.g., Ho and Chau, 1999). 


14.9.3 Petrov-Galerkin Method 


The weighted residue method is also known as the Petrov-Galerkin method if the 
weighting function does not equal the fundamental function ¢ defined in (14.73) 


y, #0, (14.76) 
The method is more general than the Galerkin method. 


14.9.4 Galerkin Method 


The Galerkin method is recovered if the weighting function equals the fundamental 
function used in (14.73): 


y, =9¢, (14.77) 


The Galerkin method is a powerful method because it can be shown that it 
originates from the principle of virtual work. It will be further discussed in the next 
section. 


14.10 GALERKIN METHOD 


In this section, we will discuss the Galerkin method in more details. Let us consider 
the two-dimensional cases such that an approximation for w can be expressed as 
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T(x») = 9%) (14.78) 
i=l 
where the unknown constants c; are to be determined. Note that @; needs to satisfy 
both essential and natural boundary conditions (or displacement and traction 
boundary conditions in the case of solid mechanics). For the case of f = 0, 
substitution of (14.78) into (14.72) gives 


[L@e wiv = [Leo wis aedy=0 (14.79) 
V V i=l 

If we pick the approximate function defined in (14.78) as the weighting function, 

we have 


[LQ ee, oi ydxdy = 0 (14.80) 
V i=l 

This is called the Galerkin method and it can be shown that it is the same as the 
principle of virtual displacements. For second order PDEs, it can be applied to 
elliptic, hyperbolic, and parabolic types. In the Rayleigh-Ritz method, we need the 
functional of the problem, however we do not need that in the Galerkin method. All 
we need is the differential equation of the problem. Strictly speaking, the Galerkin 
method is not a variational method since we do not need a functional. For physical 
problems in which the energy function exists, Galerkin is similar to the Rayleigh- 
Ritz method except for the choice of the approximate functions in (14.78). The 
trying functions in the Rayleigh-Ritz method need only to satisfy the essential (or 
displacement) boundary condition, but not the natural (or traction) boundary 
condition; whereas the trying functions in the Galerkin method needs to satisfy both 
essential and natural boundary conditions. This is the main limitation of the 
Galerkin method. 


Example 14.2 Reconsider the minimization problem given in Example 14.1 and 
derive the Galerkin method for it: 


= i 2 2, 
ILy]= J Le? +4? +2 fix (14.81) 


with boundary conditions: 


VO=y%, VDO=Y (14.82) 
Use the following approximation: 
[= 

YET MATA Vo tay tagw, ++ dW, + (14.83) 


Solution: Substitution of (14.83) into (14.81) and differentiation the functional with 
respect to the unknown constants a; (i = 1,2,...) gives 
ol u 
<= 2° [(py’)w,, + gw, + fv, Jd = 0 (14.84) 
ay 
Note that this is the same as the procedure of the Rayleigh-Ritz method. In 
obtaining (14.84), we have used the following identities: 
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ey =w, yy =W, (14.85) 
Ca, Ca, 
The first term in (14.84) can be evaluated by integration by parts as: 
: I . : I I U : rr 
[Cr mar= | Cry 400,) =py'vnl-[ er) de (14.86) 
To remove the boundary terms, we must impose 
w, (0) = w, (J) =0 (14.87) 
Thus, (14.84) becomes 
i 
[++ S)w,dx=0 (14.88) 
This is precisely the Galerkin method for the following differential equation: 
—(py')'+ q+ fy =0 (14.89) 


Through this example, we show that the Rayleigh-Ritz method and the Galerkin 
method is equivalent if the boundary conditions given in (14.82) are identically 
satisfied. 


We have shown that for the particular functional given in (14.81), the Raleigh-Ritz 
method and Galerkin method are equivalent if the boundary conditions given in 
(14.82) are identically satisfied. In fact, we can formulate this observation more 
generally. Putting in a general term, we consider the following boundary value 
problem of a differential equation: 


L[u]=0 (14.90) 
subject to the following boundary conditions: 
u(a)=y (14.91) 
Bluj=¢ (14.92) 
where L and B are given differential operators. The Galerkin method can be cast as: 
U =U +A,W +aWy +... (14.93) 
such that 
uy(a)=y (14.94) 
Blu l= ¢ (14.95) 
w, (a) =0 (14.96) 
Bw, ]=0 (14.97) 
The coefficients for a;, i = 1,2,3,... can be determined from: 
le L{ulw,dV =0 (14.98) 


where V is the domain of the differential equation being defined. 

The Galerkin method can also be shown as equivalent to the principle of 
virtual work. In particular, for the more general functional for the two-dimensional 
case the functional can be cast as: 


T[uj= [Fl ytstesttegseoly tyy sells ]dtdy (14.99) 


832 Theory of Differential Equations in Engineering and Mechanics 


Using the calculus of variations, we can obtain the first variation symbolically as 
si= | Llu]ou dxdy = 0 (14.100) 
V 


Note that (14.98) is equivalent to the principle of virtual work. We see that if du is 
replaced by the fundamental function w,, the Galerkin method is therefore 
consistent with the principle of virtual work. It is a direct method for finding the 
stationary value of the functional (14.99). However, we do not need a functional of 
the problem in the Galerkin formulation and we only need the Euler differential 
equation of the functional. Therefore, it is more general than the Rayleigh-Ritz 
method and is applicable to problems in which a functional does not exist. We will 
further illustrate this idea using the following example. 


Example 14.3 Solve the following cantilever beam problem shown in Figure 14.3 
using the principle of virtual displacement together with the Galerkin method. 
The boundary conditions at x = 0 are 
w(0)=0, w’'(0)=0, (14.101) 
You can assume the Euler-Bernoulli beam theory and the following two term trial 
functions: 


W= ax? + ax° (14.102) 


Figure 14.3 A cantilever beam subject to two concentrated forces 


Solution: Note that by choosing (14.102), both of the boundary conditions given 
(14.101) are satisfied. In a sense, both essential and natural boundary conditions are 
satisfied, and thus it is a feasible choice for the trying functions of the Galerkin 
method. It is also the reason why we need a trying function of a power cube. In 
addition, we cannot add a linear term of x in (14.102), if we do, the second 
boundary condition of (14.101) will not be satisfied. 

The bending moment corresponding to the deflection given in (14.102) is 
(Timoshenko, 1956) 


M =-Elw" =-EI (2a, + 6apx) (14.103) 
The bending stress induced by this moment is (Timoshenko, 1956) 
o= “es (14.104) 


The nonzero internal virtual work is 


W, = | o:5e.dedyde (14.105) 
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Using Euler-Bernoulli beam theory, we have the bending strain proportional to z 
measured from the neutral surface (mid-surface) of the beam: 

&, =—2zn(x) (14.106) 
where 77(x) is an unknown function to be determined. Substitution of (14.106) and 
(14.104) into (14.105) gives: 


_f Mz __f71 2 Oe is 
W, = | zdn)dxdydz = —( 74 2"dydz)Mndx=—| ° Méndx (14.107) 


Figure 14.4 shows the more general case of external virtual work due to distributed 
load, concentrated load, and concentrated moment. The external virtual work due to 
virtual deflection dw is 


Wy = i q(x)Swdx + P.5w(21)+[-M bw'(21)] (14.108) 


where the minus sign in front of the moment and rotation product term indicate that 
the convention of moment is opposite to the positive slope. 
For the present case shown in Figure 14.3, we have 
W,, =-Péw(l)- Péw(21) (14.109) 
Using the principle of virtual displacement, we balance the internal and external 
virtual work to give 


21 
- I, M6éndx = —PSw(I)— Péw(2l) (14.110) 


Z,w(x) 


Figure 14.4 The general case of a beam subject to distributed load, point load, and concentrated 
moment 


Taking the virtual displacement of (14.102), we obtain 
Ow =6a,x° + 5ayx° (14.111) 


Using Hooke’s law to relate (14.104) and (14.106), we get 
2 
M =-51* =-£Iy (14.112) 
dx 


Thus, we have 
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d°Sw 
on = (14.113) 
dx* 
Substitution of (14.111) into (14.113) gives 
d° Sw 
on = ee = 26a, + 60a,x (14.114) 


Substituting (14.111) and (14.114) into (14.110) gives 
21 2 
(-2{ Max + 5PI”)8a, + (-6f Mxdx +9PI)5a, =0 (14.115) 
0 0 
This implies the following two equations to be satisfied: 


21 
-2( Max +5PP? =0 (14.116) 
0 


21 
-6| Mxdx+9PE = 0 (14.117) 
0 


Substitution of (14.103) into (14.116) and (14.117) and integration of the resulting 
equations gives 


Pl 
8a, + 24a,/ = -5 — 14.118 
1 2 EI ( ) 
Pl 
24a, + 96a,1 = —9 — 14.119 
ay ay EI ( ) 


Solving for the unknown constants, we find 
11 Pl _1Pl 


a= , a 14.120 
‘8 EI’? 4 EI 
From (14.102), we have the approximation as: 
W(x) = De eae (14.121) 
8 El 4 El 
Substitution of (14.121) into (14.103) gives 
M(x) = =( 1/-6x)P (14.122) 


For the present problem, it can be solved exactly by taking the moment by cutting a 
free body of the beam, and the resulting moment can be plugged into (14.103) to 
solve for the deflection. The final exact solutions for the deflection and bending 
moment are found as: 

M(x)=(3/-2x)P, O0<x<l 


14.123 
=(2/-x)P, I<x<2l ( ) 
w(x) Soe, O0<x</ 
e , pp Bp (14.124) 
Ga 4522). geee 
EI 6 2 6 


The details of this evaluation will be left as an exercise for readers. The 
comparison of the results of the Galerkin method and the exact solution is given in 
Figure 14.5. 
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Exact 


w( xpi 
pe ii SH RRIRSERE Ss Galerkin method 


| 


Figure 14.5 The deflection of a cantilever beam subject to two point forces shown in Figure 14.3. 
The solid and dotted lines are the exact solution and the Galerkin approximation 


The comparison of the exact solution given in (14.123) with the Galerkin 
approximation given in (14.122) is plotted in Figure 14.6. It can be seen that the 
approximation for deflection shown in Figure 14.5 is much better than that of the 
bending moment as shown in Figure 14.6. Actually, this illustrates a very general 
observation that the virtual displacement method (the present case) gives a more 
accurate approximation for the deflection in (14.121), whereas the virtual force 
method will give a better approximation for moment. In the present case, the 
bending moment is evaluated as the second derivative of deflection given in 
(14.121), and this indirect evaluation always gives a less accurate prediction. 

Mix) * 

PI 


SS 9s=3e=ear"" Galerkin method 


Ww 


Exact 


ri) xii 
Figure 14.6 The comparison of bending moment of a cantilever beam subject to two point forces 


shown in Figure 14.3 


In the present example, we consider a fixed end cantilever beam subject to 
concentrated loads that can be solved quite easily because we want to compare the 
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approximation of the Galerkin method with the exact solution. More generally, for 
more complicated mechanics problems where an exact solution is not possible, an 
approximation like the Galerkin method will be found very useful to provide the 
first order approximation and gives insight into the solution of the problem. 


Example 14.4 Consider the two-dimensional stress analysis shown in Figure 14.7. 
Find the stress state in the plate by the Galerkin method. 


Figure 14.7 Rectangular plate subject to parabolic pulling from two ends 


Solution: As shown in Chau (2013), this kind of plane stress problem can be solved 
by using Airy’s stress function: 
a a Ce 
ee 

oy “ax Oyox 

The stress function satisfies the biharmonic equation 
4 4 4 

a 2 A PE ay (14.126) 
oy OvOx" Ox 


The boundary conditions can be prescribed as: 


(14.125) 


O7rx 


2 2. 2, 
e =F 7), Oe Sh aj (14.127) 
oy b Ovox 
2 2 
~f =0, — =0, y=+4b (14.128) 
IX VOX 


We choose the following fundamental function: 


1 ly? 
b=5 Ty (1 ; 3 )+ (x? -a?)? (7 —B?)P? (a tax? +a,’ +...) (14.129) 
Differentiating (14.129) with respect to x and y, we have 


2 
>, vy 2 2)2/2,2 42 2 2 
thay Oe ey OPO Naan aay) (14.130) 


#2a3(x? —a?)?(y? =)? +16a,y"(y? =? (x? =a?) 
bey = 1Oxy° —a°)\(y* —b° Yay tay” + azy” +...) (14.131) 
+8xya9(x? —a?)° (9? -B*) +8190? ay? BP 
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Pee = 4X7 —a? )(y? —b°Y (a, + ax" +05y* +...) 
416x745 (x? oF = y + 2a, (x* avy (y” =p ¥ 
It is obvious that all boundary conditions given in (14.127) and (14.128) are 


satisfied identically. It is a matter of art, however, to identify the feasible form of 
the fundamental solution given in (12.129). Using (14.100), we have 


(14.132) 


a eb 
} J (vip)dpdrdy = 0 (14.133) 
The variation of the fundamental function is 
Ob = (x? aay (9? =p) (da, + 6ax° + day" +...) (14.134) 


To further simplify the problem, we only retain a). Substitution of (14.130) to 
(14.132) into (14.133) gives 


ore 2 422 2 Dvrq 2 32 2 es ly 
[J 240 BP Pay +32Gx? —a°)(3y? —b ay + 24(x7 — a? Jay ~ 0} eee 


x(x? —a’)°(y? —b’) dxdy =0 
Therefore, conducting integration, we have 


64,4 256 64 
j= ib (— eh 4— or ee 14.136 
=H/t ce re P yt ( ) 
This gives the approximate stress field: 
2 
Onn = Tl 25) +400 vy Gy -B)4, (14.137) 
Oy, =4(y? —B’) Bx? —a")a, (14.138) 
Oy, =—16(x" — a” (y* —b*)xya, (14.139) 


For the special case of a square plate a = b, we have 


ie 
a, = 0.042532 (14.140) 
b 


14.11 KANTOROVICH’S METHOD 


In this section, we introduce a method closely related to the Galerkin method by 
considering the more complicated problems of plate bending. Consider the case of a 
rectangular plate subject to two-dimensional bending, as shown in Figure 14.8. The 
plate has fixed supports on y = +8, and is free on x= +a. The governing equation of 
the problem is found to be a nonhomogeneous biharmonic equation (Timoshenko 
and Woinowsky-Krieger, 1959): 


Otu Otu xz Otu _ g(x,y) 


VV-u =Viu =—+ 
at aya? ayt COD 


(14.141) 


where D is 
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3 
D= —— (14.142) 
12(1-v~) 
The boundary conditions are 
u=0, On 6, on y=+b (14.143) 
oy 


We assume the following fundamental function: 
U=P(V) AO) + ANS) +--+ On, fn) (14.144) 


For the special case that the loading is symmetric with respect to y, we have u being 
an even function of y. One such function is 


A Y=Or BY Yh? (14.145) 
As a first approximation, we consider the following term: 
(x,y) =(y7 —b°Y f(x) (14.146) 


where fis an unknown function to be determined. Note that this method is not exactly 
the Galerkin method since we have not specified the boundary conditions on x = +a. 
The function f will be determined analytically. Thus, the current method is a semi- 
Galerkin method. This method was first proposed in Kantorovich and Krylov (1964) 
and was called Kantorovich’s method by Reiss (1965). 


Figure 14.8 Rectangular plate with two free edges and two fixed support edges 


To proceed with the first approximation, we have 


a(y=(0" -b’Y (14.147) 
Using the Galerkin method, we have 
b 
} vii —Deaiyddy =6 (14.148) 


Substitution of (14.147) and (14.146) into (14.148) gives 
b 
| (247 42012? =467) f’ +(y' =o" yf ale b*?)?dy=0 (14.149) 
-b 


Integration gives the following results: 


b 
[2407 0? =" (14.150) 
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12 
(ie 8(3y? -b?)(y? —B?)? dy = -2 28? (14.151) 
105 
256 
—b’)* dy =—— 0? 14.152 
(i (y? —B’)*dy = 6 ( ) 
Substitution of (14.150) to (14.152) into (14.149) gives 
12 12 
a Bf) a 50 f"4 =o" f =p) (14.153) 
where 
b Xx, 
diane (14.154) 
For the homogeneous case, we have the exponential form for fas: 
f=e* (14.155) 
The characteristic equation of the homogeneous form of (14.153) is 
BOO yg pye a IPG (14.156) 
315 105 5 
The roots of A in (14.156) are: 
A,23,4 =" (4a £if), a=2.075, B=1.143 (14.157) 


The general solution becomes 


F(x) = 4, cosh(a~)cos(B » + A, cosh(a ? sin(B p 
b (14.158) 
: Me gk ; x x 
+B, sinh(@ D> sin(B re + B, sinh(a D> cos(P D> + f(x) 
For the special case of uniform distributed load, we have 
b 16 
q 7,2 p2\2 q 
=| +()’-b*)dy=—+ 14.159 
qi(x) - Ydy= 24 (14.159) 
The particular solution becomes 
128 BS BS 
= 14.160 
wie ae ( ) 
Thus, we have 
q 
et 14.161 
24D ( ) 
For the case of a symmetric ae condition on x = +a and x = —a, we must 


have u(x,y) and f(x) an even function of x. Thus, (14.158) is reduced to 
x x : pe x 1 q 

= Acosh(a — —)+ Bsinh + 14.162 

f(x) = Acos (ao) s ko sine) yD ( ) 


We further suppose that for the case of fixed end supports on x = ta, we have 
f(#a) = f'(£a) =0 (14.163) 
Substitution of (14.162) into (14.163) gives 


a a . a. a l gq 
Acosh(a— —)+ Bsinh =0 14.164 
cos ie noe) Si re aT) a ( ) 
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A sinh(a 7) cost > i cosh(@ si ra) 


B 
b 
This provides two equations for two unknowns A and B and the solutions are 


_n lg pue2t4 


(14.165) 


a a... a : a a 
+B F cosh(@ Pe + —sinh(@ ee re 0 


; (14.166) 
% 24D Yq 24D 
where 
Yo = Bsinhayucoshayu+ asin Bucos Bu (14.167) 
y, =-(acoshaysin But Psinh aucos Bu) (14.168) 
Y, = asinh aucos Bu — 6 coshaysin Bu (14.169) 
a 

== 14.170 
Has ( ) 

For an infinitely long plate, we have a > « and pt > ©, we have 
in(x,y) = L(y) -5?? (14.171) 


24 D 
This equals the exact solution. For a square plate np =1 and v = 0.3, we have the 
solution at the center as 


: 1 2b)" 
(0,0) =0.479 64 = 0.01362 20) (14.172) 
24 D Eh? 
The exact solution for this case of a square plate is 
Umax = 0.0138 426)" (14.173) 
Eh 


Thus, the approximation differs only by 1.3% from the exact solution. Referring to 
Timoshenko and Woinowsky-Krieger (1959), we can also find the maximum 
bending stress in the plate accordingly (p. 42 of Timoshenko and Woinowsky- 
Krieger, 1959) 


6M 6M, 
On) itax = a (Oy max = os (14.174) 
The bending moment can be found from deflection as 
Ou O7u O-u Cru 
M, =-D(—+v—), M, =-D(—,+v—) (14.175) 
Ox oy : oy Ox 
Using these expressions, we find 
g(2b)" q(2b)° 
(O,.)max = 0-140 ye (O,, max = 0.138 72 (14.176) 
The exact solution for the bending moment is 
g(2b)” 
(Oy max = (Py max = 9.137 2 (14.177) 


Thus, the error of bending moment by the semi-Galerkin method (or Kantorovich 
method) is about 1~2%. 
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14.12 ©>FUNCTIONAL FOR BIHARMONIC EQUATION 


As remarked in the introduction, both the Rayleigh-Ritz and Galerkin methods were 
originally motivated by plate bending problems. In this section, we will consider the 
following functional form for the plate bending problem: 
9. <2: Ou Oru Ou 2 
U ={[ [(V2u)? -200-v) (eS?) 9 faded 
Ox" Gx" — éxdy (14.178) 


Ou 
-2 p(s)uds + 2) _m(s) — 


where distributed load g divided by D is denoted by f (= q/D), point force on the 
boundary by p, and concentrated moment on the boundary by m. Physically, the 
functional is the total energy minus the external work done as: 


u=|[ (av-du,)-v, (14.179) 
where 

2. 2. 2 

Fe fy pe +M, ay +2M,, Si (14.180) 
20 ae By” ” Oydx 

dU, =uqdxdy (14.181) 

Ou 
— as 14.182 
U; 2l p(syuds +2) m(s) ds (14.182) 


Substitution of the following definitions of bending and twisting moments into 
(14.180) and the result into (14.179) gives (14.178): 


2 2 2 7) 
M, = pe Sige +). ede ave *) (14.183) 
Ox Oy : Oy Ox 
Oru 
M == PU=V) 5 (14.184) 
x 
The calculus of variations requires 
quran) _6 (14.185) 
da a, 
That is, we have to substitute 
u<utan (14.186) 


into (14.178) and differentiate it with respect to a. More specifically, we find 


<u +an)) = vu)" + 2aV7uV72n +.07(V2n)7] 


(14.187) 
=2V*uV?n +2a(V°ny 
d O&(u+an) (uta do an, a a 
a) Oa OEE g Ty E gy 
da ox ox da ex ox” éy oy 
(14.188) 
_ Puan  eudrn an On 


- a 
oy? ax? Ax? dy" ax? dy’ 
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d O(utan). d , uy Ou. On. » On.» 
de Gee ae a Gap? 
: (14.189) 
= Hy 28) 4 20 2? 


OxOy Ox a OxOy 
Setting a = 0 into (14.187) to (14.189) and substituting the results into (14.178) 
gives 


2 2 
[[ even 1 CEN, Ou On 7 ou on 


2 ay? dy? Gx? —s Oxy Axdy 


-2) p(s)nds if 2f ms) has =0 


)—2 fn|dxdy 


(14.190) 
This equation can be further be simplified using the following identity 


2, 2 2 2 2 
VuV7n = Vue 7 n,  OV°u 67 , 2,8 n , Ov u On 


dy" ax? Ox Ox Ox? oy oy 
ae? On On OV7u nowt én Vu OPV u 
ay? ax x a sy: 8 ay? 
Ae ~~ 7 a . (14.191) 
+ + =([—(V~u sa Vru | 
Cae ae ae ( A a ay 
42 av? vy a owt ate ae 
oy Oy ax? dy" 
Applying (14.191), we have 
{| V7uV7y dxdy = {| outa? ea 
A A Ox ox oy oy 
(14.192) 


ad, OVeu, 0, OV-u aVvru  OVeu 

d d) 
WLR OG Oty Ose ee | a tae e 
The first and second terms on the right hand side of (14.192) can be reduced to 


{| ev ey. © py | Veta ey 04.193) 
A Ox ox oy oy va ox ov 


a, 8V7u, a, OV7u eV-u , OV7u 
+ dy = d d. 14.194 
ROBO Ge = | ny a 4.194 


Using these results, we finally get 


I, V7uV"n dxdy = If, nV *udedy+ | vaus sds fas n 


In addition, it is straightforward to show that 


“ds (14.195) 
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Ou on, Oud’n . Ou dn 
z If, 2 ) dxdy 


ax? Gy? ar ax? OxOy Oxdy 
{| 0 0°u dn d'u On, 0 Oudn Ou On 
A Ox dy" Ox Oydx dy’ Oy ax” Oy dyOx Ox 
where (14.196) is the second term in the first integral of (14.190). It can further be 
reduced to 


(14.196) 
)]dxdy 


=e én Ou any, et én Ou On 
Tr @y? Ox Oydx Oy " ax” Oy Ayax Ax 

Note also that by the chain rule we have 
On _ On es | ON On _ on on 


y,, lds (14.197) 


: oa (14.198) 
Ox Os Ox On Ox Os On 
On _ On Os Rs ee : (14.199) 
Oy Os Oy Onoy Os”” On 
The directional cosines are given as: 
x, =cos(s,x), y, =cos(s, y) (14.200) 
x, =cos(n,x), y, =cos(n, y) (14.201) 
Substitution of (14.198) and (14.199) into (14.197) gives 
a x24 Ou 2 Ou on 
2 
r'3y ar x In ~2XnVn Bye aa 
, (14.202) 
2u Ou ,On 
— or x, X, +(x —ds 
a VsVn — éy 2: ( Vn V5, "Sox as 
Applying integration by parts to the last term, we have 
u Ou Ou .0n 
ae VsVn dy aD Xs Xn (X5Vn VsXn) Byax! as 
(14.203) 


2 


=f 6 ou SO ay oy 
lr és ox Ta Vs¥n oy? Vn Vs n dyex 


Using (14.195), (14.202), and (14.203), we obtain the following formula for 
(14.190): 


{I (V4u—4 ydxdy +] (Mieminn a -{ [P(u) + p(s)Inds (14.204) 
A D ie on is 


where 
M(u)= V-u (1 wie xe 4+ *u ae: ‘u ) (14.205) 
_ Xn a 2 Vn n¥n aya : 


a) (& u Ou Ou 
Xx, +(x x 14.206 
as os 2 VsVn ay 2 sn ( sVn — Vs oe a ( ) 


P(u)= ev =u+(1-v) 


Therefore, the Euler equation of the problem is 
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Vase (14.207) 
D 
and the boundary condition on J” 
M(u)+m(s) =0 (14.208) 
P(u)+ p(s) =0 (14.209) 


We can see that the variational method provides a systematic approach to derive the 
governing equation of plate bending given in (14.207). For the case of a built-in or 
fixed support, we have boundary conditions as 


“ 


xo 230. on (14.210) 

On 

Employing the Galerkin method, we have 
=> aK % (14.211) 

k=1 


In the context of numerical analysis, the so-called strong form requires 


u(x, y)— Dag (%¥) 


k=1 


<é (14.212) 


where ¢ is the error control. This is a rather strong point-wise requirement. The 
weak form, however, relaxes this requirement to 


If [u(x, v)—- YM (x, y)|dxdy <¢ (14.213) 
k=l 


The required accuracy is satisfied in a global sense in the weak form. According to 
the previous section, the Galerkin method requires: 


} | [v4u, -Lo,dxdy = 0 (14.214) 
A De 
where s = 1,2,..., n and 
(X,Y) = >) Pe (%Y) (14.215) 
k=l 


14.13. VIBRATIONS OF CIRCULAR PLATES 


For simplicity, let us consider the axisymmetric natural vibrations of circular plates 
with built-in edges (see Figure 14.9). In this case, we have 


@ 1a alee 
dr? Yr ar dr’ “rdr 
where J is the square of the normalized vibration frequency: 
_120-v*)pa* 
EP 

where is the circular frequency of vibrations and p is the density of the plate. 


Viu—Au =( 


)-Au=0 (14.216) 


A (14.217) 
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Built-in edges 


Figure 14.9 Axisymmetric vibrations of circular plates of radius a with built-in edges 


The built-in edge requires 
Ou 
Fees 
r=a or 
The following fundamental solutions can be shown to satisfy the built-in conditions: 
Uu, = QP, (r) +yQy (r) +. + GnPn (r) 


=0 (14.218) 


r=a 


2 2 2 (14.219) 
= a,(1-)? +a,(1-4)? +...44,0-5)™" 
a a a 
Substitution of (14.219) into the following Galerkin method gives: 
I } (V4u,, — Au, )o,dr =0 (14.220) 
A 
Taking the first two terms in (14.219) we have 
192 Aa‘ 144 Aa* 
a +a =0 14.221 
iC 9° 5 ) + ay ( - = ) ( ) 
144 Aa* 96 Aa’ 
a +a =0 14.222 
iC 5 Se )+ ay ( aa ) ( ) 
The characteristic equation becomes 
(Aa*y? at +193536 =0 (14.223) 
The smallest root of (14.223) is 
AS ws (14.224) 
a 
With this value of 1, we find 
0.455a, —1.4a, =0 (14.225) 
The approximation of the deflection in (14.219) is 


2 9 
uy = a[(1-—)? + 0.3250-—)*] (14.226) 
a a 
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14.14 SUMMARY 


In this chapter, we have presented an overview of variational methods. Because of 
its relation with the Galerkin method, the related principles of virtual work, virtual 
displacement, and virtual traction are introduced. The more general Hamilton 
principle, Veubeke-Hu-Washizu principle, and Hellinger-Reissner principle are 
reviewed in view of their fundamental importance in numerical analysis, such as the 
finite element method. Approximate techniques related to the variational principle 
are introduced in Sections 14.8-14.10, including the Rayleigh-Ritz method, 
weighted residue method, and Galerkin method. A semi-Galerkin type method 
known as Kantorovich’s method, a term coined by Reiss (1965), is introduced using 
plate bending problems as an example in Section 14.11. In essence, for problems 
governed by partial differential equations, the Galerkin method is used to 
approximate one of the variables whereas the other one is solved analytically. 
Therefore, it is a semi-Galerkin type or semi-analytic technique. Functional 
formulation for plate bending is considered in Section 14.12 before the vibrations 
of circular plates are considered. 

There are a number of good books on the variational method, including Mura 
and Koya (1992), Washizu (1982), Kantorovich and Krylov (1964), Reiss (1965), 
and Reddy (2002). There are also more specialized methods similar to the Galerkin 
method, such as the Trefftz method. In this method, the approximate solution is 
selected such that the governing equation is exactly satisfied and the boundary 
conditions are satisfied approximately in a variational sense. However, it is not easy 
to find the approximation that satisfies the governing equation, and thus the Trefftz 
method is not discussed in the present chapter. 


14.15 PROBLEMS 


Problem 14.1 Consider the two-dimensional stress analysis shown in Figure 14.7. 
Find the stress state in the plate by the Galerkin method using the following 
approximation of Airy’s stress function: 


1 ly 
b= Ty 5 pte ay (y? Bb?) (a, +ayx? + a5y" +...) (14.227) 


(i) Show that the constants a1, a2, and a3 are governed by 


64 25667 645% >, 64 64b4 >, 646° 646° (e 
a,(—4 = q)+a,a (4 qt aa’( 7+ a ae 
7 49 a Ta 77 49q 49 a Tl a a'b 
(14.228) 
64 645 5,192 25667 192 5% > 646° 645° T, 
a t + Aya + t + Aa t = 
er 7 ® CB We@e 7 ® : GF a 77 » ab? 
(14.229) 
64 645° >, 64 6454 >,192 b* 256b* 192 b° (e 
a,(—4 q)+aya (—4 7) +a,a° ( at - a= 5 
7 lle 711 71a Ta Tat 14° ad 
(14.230) 


(11) For the case of square plates, show that 
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Ti T, 

a, = 0.040402, a) =a; = 0.011742 (14.231) 
a a 

Problem 14.2 Find the deflection of the cantilever beam problem shown in Figure 

14.10 using the principle of virtual displacement together with the Galerkin method. 


W(x) =-——x" +——x (14.232) 


Problem 14.3 Find the deflection of the cantilever beam problem shown in Figure 
14.11 using the principle of virtual displacement together with the Galerkin method. 


Figure 14.11 A cantilever beam subject to a concentrated moment 


w(x) = sag 


14.233 
2EI ( ) 


Problem 14.4 Find the deflection of the cantilever beam problem shown in Figure 
14.12 using the principle of virtual displacement together with Galerkin method. 


Ans: 


w(x) = ~ | qopi+4M)x yo 2 (14.234) 
SEI 12 El 


Figure 14.12 A cantilever beam subject to two concentrated forces and a concentrated moment 
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Problem 14.5 Show that the functional of the following Sturm-Liouville problem: 


Fag eat oars =0 (14.235) 
dx dx 
y'(a)=0, y'(b)=0 (14.236) 
is 
f= af 2)? -q)y? — Ary? Jax (14.237) 
a dx 


Hint: Find Euler’s equation of this function. 


Problem 14.6 Show that the functional of the Laplace equation: 
V’y =0 (14.238) 


a OY.2 _ OW.r 
J all a) ta) lane (14.239) 


Hint: Find Euler’s equation of this function. 


Problem 14.7 Following the procedure used in Section 14.12 for plate bending, 
show that for functional: 


EI f! d*y.5 ! 
Ity]= I dx I dd 14.240 
Ly] 5 a 91% ( ) 
we have, by substituting y = ut+an, 
di(ut+an)) _» (14.241) 
da a=0 
The stationary value of the functional requires 
1 q? 2 l 
Er| (hy Dar qndx =0 (14.242) 
0 dx? dx? 0 


Problem 14.8 Show that Euler’s equation for the function given in (14.243) is the 
Euler-Bernoulli beam theory: 
q? y 
ares —q(x)=0 (14.243) 


Problem 14.9 Consider the case of a simply supported beam subject to a uniform 
distributed load shown in Figure 14.13. The essential boundary conditions are 
u(0) =u(/) =0 (14.244) 
whereas the natural boundary conditions are 
u"(0) =u"(1) =0 (14.245) 
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Use the Rayleigh-Ritz method by adopting the following approximation: 
u(x) = c, (x) + co, (x) (14.246) 
where 
&(x)=x7-x), (x)= x*(I-x). (14.247) 
Note that the approximation given in (14.247) only satisfies the essential boundary 
condition. Find the Rayleigh-Ritz approximation. 


q(x) = Jo 


Figure 14.13 A simply-supported beam subject to uniform distributed load 


Ans: 


qol* x? 
ie, 14.248 
m= 24EI- 1 PP ( 


Problem 14.10 Find Euler’s equation of the following functional for membrane 
deflection: 


1 Ou Ou 
Mul =[f 7S? +} - fuldeay (14.249) 
A2 Ox ov 
where 7 is the tension in the membrane and fis the applied load on the membrane. 
The boundary condition is 
u=0, onl (14.250) 
Ans: 
Ou Ou 
aaa a (14.251) 


Problem 14.11 Further simplify (14.205) and (14.206) by referring to the following 
diagram: 


(1) Referring to Figure 14.14, show that 
Xy = Vy =n, =Cos(t,x), Y, =X, =n, =cos(n, y) (14.252) 


(ii) Show that (14.205) and (14.206) can further be simplified to 


2 2 2 2 
uta) = Coy 8 4 Thy P8 20--v)nyny ZH (14289 
Ox oy oy Ox Oyox 
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a 92 a2 2 2 
P(u)= 2 Vut— (2 faye ~)n,n, ‘ec us ie ~)n,n, 
On ds ax? ay *” ay’ ax? ‘ 
if (14.254) 
oa 
+(l-v)(n2 —n2 
(l—v)(n, maa ] 


Figure 14.14 The relation between directional cosines between normal and tangent 


Problem 14.12 Show that along a straight edge along x = a the boundary conditions 
given in (14.208) and (14.209) become 


2 2 3 3 
CE Fag: idle v) EE sp (14.255) 
Ox oy ox OxOy 


CHAPTER FIFTEEN 


Finite Difference and Numerical Methods 


15.1 INTRODUCTION 


The origin of the finite difference method (FDM) probably traces back to the time 
of Leibniz and Euler (e.g., Euler’s method in 1768), and subsequently evolved into 
different techniques (e.g., Runge-Kutta method). The FDM became more 
established after 1928 after the Courant-Friedrichs-Lewy (CFL) stability condition 
was derived for hyperbolic type partial differential equations (Courant et al. 1928). 
Its day-to-day application, of course, starts with the popularization of computers, 
especially personal computers. Although for solid mechanics and structural analysis 
the emergence of the finite element method in 1960 took over the role of finite 
difference in numerical analysis, in the area of fluid mechanics, the finite difference 
method remains a popular choice. 

There are many different kinds of finite difference schemes. In general, it can 
be classified into explicit and implicit finite difference schemes. For differential 
equations in time variables, the explicit methods naturally suggest that the unknown 
function of the next time step can be expressed explicitly in terms of the numerical 
results of the previous time steps, whereas in the implicit methods the unknown 
function of the next time step cannot be expressed explicitly in terms of the 
numerical results of the previous time steps. In structural dynamics, the most 
popular methods of finite difference for second order differential equations are the 
Wilson 0 method and the Newmark B method (Bathe, 1982). These methods will 
be discussed in Section 15.3. Multi-step methods, such as the Adams-Bashforth 
method and Adams-Moulton method, and the predictor-corrector method, use more 
than just the time step before the current step, but the solutions from the last few 
time steps. Polynomials of different orders are used to fit the first derivative in the 
differential equations. The predictor-corrector method combines the explicit 
together with the implicit method. 

A major topic in numerical integration for time dependent nonlinear systems 
is the Newton-Raphson method. For searching roots in nonlinear equations, the 
Newton-Raphson method has been proposed. For structural dynamics problems, 
the stiffness matrix of the numerical model needed to be evaluation at each iteration 
step within a time step of integration. Therefore, it is computationally very 
demanding, especially for a large system (1.e., stiffness matrix of very large size). 
There are various modified versions of the Newton-Raphson method. If we use the 
initial stiffness of the model in all time steps as well as in iteration steps, this 
method is called the initial stress method. Clearly, this scheme will not converge 
very fast or may not converge at all. The modified Newton-Raphson method takes 
the stiffness matrix from the last time step during the whole process of iterations 
within a time step. Yet, the convergence may not converge fast enough for some 
highly nonlinear problems. Therefore, it has been proposed that the stiffness matrix 
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should be updated once the convergence is not up to expectation within the 
iterative steps within a time step. This leads to the so-called quasi-Newton and 
BFGS methods. If the stiffness matrix is formulated between the starting point of 
each iteration process and the current iterative point, the method is called fixed- 
point iteration or the secant method. For mechanics problems, the control 
algorithms for nonlinear problems are very important. They are displacement, load, 
and arc-length controls, and pros and cons of these algorithms will be discussed. 

In the final sections, we will illustrate the application of the finite difference 
method as well as the finite element method to the incompressible potential flow 
problems. 


15.2 FINITE DIFFERENCE FOR FIRST ORDER ODE 


Let us consider the following first order ODE in this section: 


= IY) (15.1) 


More generally, we can extend this first ODE to a system of n coupled first order 
ODEs as: 


CZ Ky)=F0 (15.2) 


where C is a matrix of size nxn and y and fare nx1 vectors. The linearized form of 
(15.2) can be written as 


D Ky 
Co + Ky=f(0) (15.3) 


All kinds of finite difference schemes can be formulated by starting with Taylor 
series expansion: 


At d’y 
3! dt 


Ar? d*y 
it, 2! dt” 
In the following subsections, we will apply the general mean value theorem (see 


Article 150 of Hardy, 1944) to (15.4) in arriving at different types of finite 
difference schemes. 


dy 
dt 


Vitus) = Vt, + At) = yt,) + At +... (15.4) 


t=t, t=t, 


15.2.1 Forward Difference (Euler) Method 


If we evaluate the second derivative at an appropriate point, say ¢,+@At, all higher 
derivative terms in the Taylor series expansion can be dropped 
dy M At? d’y 
2 

dt tty 2 dt t=t, +O,At 

where 0 < @ < 1. This is called the general mean value theorem or Taylor theorem 
(e.g., Article 150 of Hardy, 1944). To simplify the following presentations, we 
should adopt the following notation: 


VG) = Vn+t (15.6) 


(15.5) 


VWOns) = VU, +At) = y(t, ) + At 
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With this notation, (15.5) can be rearranged as: 
dy|  _Ynut Yn _ At d?y 
dt|,_, At 2 dt’ 


(15.7) 


t=, +O,At 
If we drop the last term, we get the following forward difference approximation or 
the Euler’s finite difference scheme as: 


dy wont Yn (15.8) 
at | At 


Comparing (15.7) and (15.8), it is clear that the error is proportional to At. We say 
that this error is of the order At or known as the order of the error O(A?) (e.g., 
Erdelyi, 1956; Bleistein and Handelsman, 1986). 
Substitution of (15.8) into (15.1) yields 

Vn+i =Nn f Cia =t)F tata) (15.9) 
Therefore, once we know y, and ¢,, we can find y+: from (15.9). Since the 
unknown is on the left hand side only, it is an explicit finite difference scheme. 
Similarly, we can substitute (15.8) into (15.3) to get 


Cc Cc 

+(-—+K)y. = 15.10 
At Vast ( At Wn ti ( ) 
When C is a diagonal matrix (i.e., all off-diagonal terms are zeros), the unknown 


vector yn+1 can be expressed explicitly as: 


2 GC 
Vast = ACK nt Su} (15.11) 


However, when C is non-diagonal, (15.10) is not explicit but implicit. That is, we 
need to solve the matrix equation to find yn+1. 


15.2.2. Backward Difference (Euler) Method 

Alternatively, we can evaluate the function at a previous step ¢,-1 or substitute —Az, 
for At, in (15.5) to get 

dy 
dt 


At? d’y 
ee dE 


n 


W(t, 1) = VE, — At) = v(t, )— At (15.12) 


t=t,, -0,At 
where 0< @& <1. By rearranging (15.12), we obtain the first derivative term as 


dy|  _Yn-Yp1 , At d’y 


dt At 2 dt 


(15.13) 


=, t=, —O,At 
Therefore, by dropping the second term on the right of (15.13), we have the 
backward difference scheme as 


won Yn-1 (15.14) 


It is clear from (15.13) and (15.14) that the error is again of O(Af) which is the same 
as that of Euler’s forward difference scheme. 
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Substitution of (15.14) into (15.1) yields 


Vast = Vn t Gast —t)F Ena Vn) (15.15) 
Therefore, once we know y, and t,, we can find yn+1 from (15.15). Unless the 
function fis given as a simple function such that y,+; can be solved explicitly from 
(15.15), the backward difference scheme is an implicit method. Similarly, we can 
substitute (15.14) into (15.3) to get 


Cc Cc 
pat Fen et Pee (15.16) 


Even when C is a diagonal matrix (i.e., all off-diagonal terms are zeros), it is 
unlikely that K will be diagonal. If this is the case, the original problem (15.3) is 
totally uncoupled. That is, each component of vector y can be determined 
separately. Therefore, the backward difference scheme is implicit. 


15.2.3. Central Difference (Crank-Nicholson) Method 


The idea of applying the general mean value theorem can be extended to consider 
higher order derivative terms in (15.4). In particular, we can apply the general 
mean value theorem to the third derivative as: 


d rae Pe 
ee ee es (15.17) 
it, dt t=t, dt t=t,, +O,At 
d At? d? At da? 
Yn = Iq Ate : » (15.18) 
dt |,_, 2 dt 6 dt 
t=t, t=, —O,At 


forsome 0< 63<1 and 0< @&< 1. Subtracting (15.18) from (15.17) we have 


3 
a (15.19) 


t=t, -O,At 


3 3 
fv d°y 
my | ae 


n 


dy 
dt 


Vans ~ Yn = 2At 


t=t, +O,At 


Dropping the higher order term, we obtain the central difference scheme for the 
first derivative as 


dy wd ntl ~Sn-1 (15.20) 

dt 2At 
When we apply this central difference scheme to the heat conduction problem, it is 
also known as the Crank-Nicholson method (Zienkiewicz, 1977). The first 
derivative in (15.20) is expressed in terms of the solution at time ¢,+At and t,—At, 
and thus it is natural to approximate the right hand side of (15.1) as 

F nav» Ynvi) + fn» Yn) 

bats y) — +1 +1 ; 1 1 


By using (15.21), the finite difference will remain a two-level scheme. 
By applying (15.20) and (15.21) to (15.1), we have 


1 
Vay = Vn-at 2 (at ~ Cyl Lf (that > Yn) + z Gia >Vn-l | (15.22) 


t=t, 


(15.21) 
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Since y,+1 appears implicitly on the right of (15.22), it is an implicit method. 
Following a similar two-level scheme for y and f, we have 


1 1 
v= 5 nt fe Vrs) ’ f > 5 Sot + Sv) (15.23) 
Thus, substitution of (15.23) and (15.20) into (15.3) results in 
C «K C OK 1 
+ + + = + 15.24 
ore , War +¢ oh Wnt 5 (fit Fn) ( ) 


Following the remarks made earlier, (15.24) is clearly an implicit finite difference 
scheme. 

The central difference scheme can sometimes appear in a slightly different 
form. For example, we can expand the Taylor series expansion using half time step 
Ad/2 as: 


At d At? d? At’ d° 
Ynatt? = Int S|. eae a (15.25) 
2 dtl, sty 8 dt t=t, 8 dt t=t, +O,At/2 
At dy At” d*y At’ dby 
Vn-/2 = Vn 5) A (15.26) 
2 dt\,,, 8 dt me 48 dt t=t,-O,At/2 
The difference between (15.25) and (15.26) is 
dy At’ | a? y d° y 
Ynais2 ~ Yn-1/2 = At Ht 48 | ae - (15.27) 
tty, i t=t, +O,At/2 t=t,-O,At/2 
Now, the first derivative of y becomes 
dy wl ntl/2 7 Yn-1/2 (15.28) 
dt At 


t=t, 
Using a two-level scheme similar to (15.21), we find that (15.1) can be evaluated in 
a slightly different form 


1 
Vnt/2 = Yn-1/2 + 2 (tna1/2 ~ trad S tear > Vnsi2) + SACRE Yn) | (15.29) 


Similarly, the central difference scheme for (15.3) becomes 


C K C K 1 
ea 5 Wns2 +¢ re + 5 nar = 5 (fia + Fnsii2) (15.30) 
Alternatively, we can also shift half of the time step to rewrite (15.30) as 
C K C K 1 
+ + 7 15.31 
oe 2 nai ( At 2 Wn 2 S; Fai) ( ) 


This is the Crank-Nicholson (or central difference) scheme given in Zienkiewicz 
and Morgan (1983). 

The interpretation of these three finite difference schemes can be seen in Fig. 
15.1. The actual slope BD at time ¢, can be represented by slope AB in the 
backward finite difference scheme, by slope BC in the forward finite difference 
scheme, and by AC in the central finite difference scheme. 
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y(t) 


Figure 15.1 Interpretation of the first derivative using the forward, central, and 
backward finite difference schemes 


15.2.4 Weighted Residue Approach for Finite Difference Scheme 


As shown by Zienkiewicz and Morgan (1983), all of the forward, central, and 
backward finite difference schemes can be recovered as a special case of a more 
general formulation using the weighted residue approach, which has been 
introduced in Section 14.9. 

In particular, we can adopt a time discretization using linear shape functions 
as shown in Fig. 15.2. In particular, the unknown function y can be approximated 
by 


Nt=1-T Ni, =T 


1 


L* 


element [0] 
0 l 2 3 4 


Lao it & & & * 


Figure 15.2 Time discretization using nodal values with interpolation function 


VY I= DV InNn (15.32) 


m=1 
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If we define the time element 1 between times ¢, and ¢,+1, the approximation for the 
unknown function y can be rewritten as 


yry =i, + Pa (15.33) 
where 
Ni =1-T, Ni =T (15.34) 
t-t 
T= 2, At, =tai-t 15.35 
At = (T39>) 


By formulating the weighted residue of the approximation of (15.3) when (15.32) is 
substituted into it, we have 
I Sha KF-L0 [dt =O, (15.36) 
0 dt 
for n = 0,1,2,...00, and W,, is called the weighting function. Applying different forms 
of the weighting function will result in different kinds of numerical methods. For 
the case that W,, is only nonzero within the n element, (15.36) can be simplified to 


ie oA 50 |i =. (15.37) 
In view of (15.34), the time derivative of (15.33) becomes 
OY a Va lwil (15.38) 
dt At At 


n n 


Substitution of (15.33) and (15.38) into (15.37) and change of variables from ¢ to T 
leads to 


1 1 1 1 
Eft [Ir rar vm e| Sf gare [gona », 
At, 20 0 At, 20 0 


1 
=[ FG, +TAt, )W,,dT 
0 


(15.39) 
Equation (15.39) can be simplified as 
Cc Cc : 
—+y,K +4-——+(1-y, )K = 15.40 
‘¢ Vn ae | At, ( Vand ly, Si, ( ) 


where 
l 1 . l 1 
His ) W,TaT | | W.dT, f,= i f(t, +TAt, W, dT if W,dT (15.41) 
0 0 0 0 
If the applied force fis smooth, we can also approximate the forcing term by node 
values by using the same interpolation or shape function given in (15.33) as 
S(t, +TAt,) © LyNn (1) + fr Naa() (15.42) 
Thus, we can approximate (15.3) as 


Cc C ; ; 
+r |e +|-Ss0-ram |, =(1-y,)f, +% Sat (15.43) 


n n 
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15.2.5 Dirac Delta Function and Point Collocation Method 


The simplest type of weighted residue formulation is obtained by using the Dirac 
delta function for the weighting function. This method has been mentioned in 
Section 14.9.2. of Chapter 14. This type of numerical technique is also known as 
the point collocation method (e.g., Section 2.2.1 of Zienkiewicz and Morgan, 1983; 
Section 1.4.1 of Brebbia et al., 1983). For example, the point collocation method 
has been found useful in investigating stress concentration due to rivet loading in a 
finite strip (see Ho and Chau, 1999). 
That is, we can assume 
W,, =O(T -8) (15.44) 


where the Dirac delta function 6 can be defined as 


0 T#0 
O(T -0@)= 15.45 
( ) lim - T=0 ( ) 
AT>0 AT 


We will summarize some properties of the Dirac delta function here without 
discussing the distribution theory as we did in Section 8.11. This function tends to 
infinity at T = @ but however its integral over the entire domain is finite and 
defined by 


[, @- ear =i (15.46) 


The following sifting property of the Dirac delta function is most useful for our 
discussion in this chapter: 


[, mmeoc _0)aT = h(0) (15.47) 


where @> 0 (named by van der Pol). Before we continue to consider (15.43) by 
using the Dirac function given in (15.44), an informal discussion of Dirac delta will 
be informative. 

This delta function was motivated by its application in quantum mechanics 
and was proposed by Dirac to deal with jump properties of physical quantities 
(Dirac, 1947). Dirac shared the 1933 Nobel Prize in physics with Schrédinger. In 
civil engineering and mechanics, they include impulsive force, concentrated force 
(in contrast to distributed force), concentrated moment in beams and structures, 
heat source and dipole in heat conduction, fluid sources (such as point sink or point 
source as in Chau, 1996 and Kanok-Nukulchai and Chau, 1990 in fluid-infiltrated 
solids) or fluid dipole (see Rudnicki, 1986; Chau, 2013), point charges, dipoles, 
and surface layers in electrostatics. However, if we restrict to our basic assumptions 
on the existence of continuous, smooth, differentiable functions, the Dirac delta 
function is certainly not qualified to even be called a “function” in the normal 
mathematical sense. Dirac was aware of its limitation and called it an “improper 
function,” and he recommended its use in mathematical analysis provided that no 
inconsistency would follow from its usage. That is, once the solution of a physical 
problem is solved by adopting the Dirac delta function, it should be subject to 
classical mathematical analysis to show rigorously that it does satisfy all the 
conditions posed in the original formulation of the problem. Strictly speaking, all 


Finite Difference and Numerical Methods 859 


procedures or formulas derived for differentiable functions cannot be applied to 
deal with the Dirac delta function unless we can generalize the concept of function 
to include “strange” functions like the Dirac function. Because of this reason, a 
whole new area of mathematics appeared and it is now called “generalized 
functions” or “theory of distribution” (Chapter 9 of Sneddon, 1972; Chapter 2 of 
Stakgold, 1979). This is a classic example of how mathematical development is 
motivated by physical problems. 

The subject of generalized functions or theory of distribution was first 
considered by Bochner in 1932 and Sobolev in 1936, but it was the work of 
Schwartz in the 1940s that put the generalized functions on a firm foundation. Later 
contributors include Gelfand and Shilov (1964) and Zemanian (1965). 

The idea of the theory of distribution has been covered in Section 8.11 and is 
briefly summarized here. First, a so-called “support” (or range) around the Dirac 
function was defined. Then, some very smooth testing functions with rapid descent 
were defined such that they are differentiable within the support. At the same time, 
these testing functions are required to be absolutely integrable over the domain of 
the variable. Outside the support, these testing functions would vanish identically. 
The functional space of these admissible testing functions is called the Schwartz 
space. Rules of mathematical operations are formally defined in this Schwartz 
space. In general, many distributed forms of the Dirac delta functions can be found 
and each mathematical form also consists of infinite sequences of such testing 
functions (Sneddon, 1972). Although these admissible testing functions in 
Schwartz space are very smooth, their functional (i.e., testing function that satisfies 
the finite integration requirement) can describe the “wild” nature of the Dirac delta 
function. 

Here we list some admissible choices for the Dirac delta functions using 
Schwartz’s theory of distribution (Stakgold, 1979): 

t 


6(x)= bone ey (15.48) 
ot sat 
5(x) = es red (15.49) 
. tet (4x 
5(x) = ie, re (15.50) 
O(a) = lim aa (15.51) 
-|xP-/4¢ 
5(x) =e Gay? (15.52) 


Note that some of these delta sequences have been introduced in Chapter 8 in a 
slightly different manner (see (8.197), (8.208) and (8.209)). If ¢ is closer to 0° or R 
is closer to 00, the steeper these functions are and the more they resemble the actual 
delta function. Stakgold (1979) showed that these functions can be put into a 
general form as a theorem: 
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fal) =—— f (2) (15.53) 
a a 


where a > 0 and {fc{x)} is a delta family as a > 0 provided that fis a nonnegative 
locally integrable function satisfying 


” fax =1 (15.54) 
Or equivalently, by setting & = 1/a we have 
S, (x) =k" f (kx) (15.55) 
where {Sx (x)} 1s a delta sequence k — oo. Thus, we have 
5(x)= lim f,(x), d(x) = lim S;(x) (15.56) 
a->0 k>o 
Another type of Dirac delta sequence is (Stakgold, 1979) 
Stine (15.57) 
Roo JX 
_ 32 
tim 8-2) afc 
d(x) =4 °°" | (1- x?) de (15.58) 
-l 
0 |x| 21 
2 
1 ( E r (A a7 
6(O™) =4 51 24 14+r* —2rcosd (15.59) 
0 \9| >a 
k : 
lim 1 em sees | E 7 
Six at pa 2 2msin(, x) (15.60) 
0 |x| >a 


Note that (15.57) and (15.60) have been covered in Section 8.11.2; whereas, 
(15.59) appears in the Poisson integral in potential theory (see Section 9.7.6). 


Example 15.1 Construct a delta function sequence for the following function: 


s)=—- 5 (15.61) 
y +X 


where x is the variable and y is a parameter. 


Solutions: 
Let us make the following observation of this function. We note that when x = 0, 


g(x)= e (15.62) 
y 


We note that when y > 0, 


Finite Difference and Numerical Methods 861 


g(x)~ (15.63) 


x 
Figure 15.3 plots g(x) versus x for three different values of y. As shown, the smaller 


is y the steeper is the function. As expected, the Dirac delta function type is 
expected as y > 0. 


Note that 


| a = : wo '{ (15.64) 
yot+x y 


: in (Gea ee", yet 
| = —tan1(~)", =)? - (15.65) 


Te: 
Led d Er ee elisa" p24 
y y 
2 
(5-4 (15.66) 
oy +x? |y| 
Thus, we can define a testing function as 
2 
1 1 1 
f(x)= ue 1 (15.67) 


ay? +x") [yay +x*) |v] al +(e/|y)7] 
such that (15.54) is satisfied identically. The last part of (15.67) is clearly in the 


structural form of (15.53) and (15.56), as remarked by Stakgold (1979). Therefore, 
we have the delta function as 


O(x) = tim £@) = : 


lim u 5 
boy] afl+(x/|y)7] 


(15.68) 


Thus, we also have 
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. y 
lim ; 5 


= lim g(x) = 72. 5(x) = asgn(y)d(x) (15.69) 
yooxy* +y" — y>0 y 


where sgn (x) is defined as the sign of x 
1 x>0 


sgn(x) = 

en(x) e x<0 
It should be noted that this sign function is also a kind of generalized function 
similar to that of the Dirac delta function (Sneddon, 1972). 


(15.70) 


Substitution of (15.44) into (15.43) yields 


Cc Cc , , 
[Eo +04 ‘| -S aoe |, = (-O)f, +O fi (15.71) 


When we specify 0= 0, we obtain the forward (Euler) difference scheme 


Cc Cc 
+(-—_+K) yp. = 15.72 
At, Viti ( At, Wn Sin ( ) 


This is the same as that given in (15.10). When we specify @= 1/2, we obtain the 
central (Crank-Nicholson) difference scheme 


C K C OK 1 
Ge pt aT pte eo) (15.73) 


At 7 


n 


This is the same as that given in (15.31). When we specify 0 = 1, we obtain the 
backward (Euler) difference scheme 


Cc Cc 
Gp Ont a" = Tes (13.74) 


This is the same as that given in (15.6). Thus, all finite difference schemes can be 
interpreted as the weighted residue method with the appropriate delta function as 
the weighting function. 


15.2.6 Stability Condition of 2-Level Scheme 
Let us consider the case without the forcing function (i.e., f= 0) in (15.71). 


Cc Cc 
—+0K +|-——+(1-0)K =0 15.75 
|< Jona At ( ) jp ( ) 


To consider the stability of the 2-level finite difference scheme, we first consider 
the eigenvalue of the original ODE given in (15.3) by seeking a solution of the 
form 
y=ae™ (15.76) 

Thus, we have 

(AC + K)a=0 (15.77) 
For physical problems, the matrices C and K are normally positive definite. In other 
words, we have for any nonzero vector x 
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x’ Cx>0, x’ Kx >0 (15.78) 

For such cases, there are infinite distinct eigenvalues and all of them are negative 
and real. The eigenvalue problem becomes 

det(AC + K) =0 (15.79) 


Let the eigenvalues and eigenvectors of (15.79) be Am and @n(m =1,2,...,M). Using 
the concept of modal decomposition, we can express the nodal unknown in terms 
of the eigenvectors as: 


M M 
Vn = > Bate > Vn = se (15.80) 
m=1 m=1 
where MM is the dimension of the unknown vector y. This model analysis closely 
resembles the eigenfunction expansion discussed in Chapter 10. 
Substitution of (15.80) into (15.75) gives 


m m 
n n 


I 
[Loc ae f-£ +(1-0 rand ya, =0 (15.81) 


Note, however, that the eigenvector a» satisfies 


An Ca.,, + Ka, = 0 (15.82) 
Rewriting (15.82) in another form, we obtain 
C'Ka,, =—An Gy (15.83) 
Substitution of (15.83) into (15.81) results in 
1 1 
—-O,, ay yn"! =[—+(1-O)A, ae, 15.84 
OF m) ‘m)m Ue ( ) ml ‘m)m ( ) 
Therefore, the coefficient of the n+1 mode can be expressed in terms of that of the n 
mode as: 
— + (1 - A)Am 
= Win (15.85) 
—-A 
Gp Mn) 


To ensure a converged solution for the finite difference solution given in (15.80), we 
must have 


| Sie (15.86) 
for m = 1,2, ..., M. Therefore, we require that 
1/At,+0-@)A 
l<[ n+ ) my) <i (15.87) 


as the condition of stability. However, if we want to converge monotonically to the 
true solution, the modal participation factor has the same sign at each time level n 
and (15.85) yields 
ve Lat, +1 O)Ay 
y, : 1/ At, —H,, 


]>0 (15.88) 
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If (15.88) is not satisfied, the solution will be oscillating around the true solution. 

Therefore, the 2-level finite difference scheme will be stable and free of oscillation 

if 

1/At, +U-94)/ 

pee ny <l (15.89) 
1/At, —@A, 


m 


0<[ 


Recall that we use different 9 for different finite difference schemes. Thus, for 
difference methods with a fixed @ (15.89) imposes the corresponding maximum 
time step that can be used such that numerical instability can be avoided. 


Example 15.2 Use the finite difference method to consider the special case of a 
single ODE with initial condition of y = 0 at ¢ = 0, and determine time step to 
ensure numerical stability: 


2 aZy=0 (15.90) 
Solutions: 
The eigenvalue equation of (15.79) becomes 
det(kA +2) =0 (15.91) 
That is, the eigenvalue is A= —A /k. Thus, (15.84) is simplified to 
Goto Dna tg td-OF)4=0 (15.92) 


The stability condition (15.87) becomes 
1/At, —(1-O)A/k 
1/At, +O /k 
Since 2 /k > 0 and 0 < @< 1, the second inequality of (15.93) is automatically 
satisfied. The first inequality of (15.93) can be simplified to 


(15.93) 


(1-20) AAt, < 2k (15.94) 
Therefore, the finite difference method is unconditionally stable if 
O21/2 (15.95) 


Since 0 =1 for the backward Euler difference method and 6 =1/2 for the central 
(Crank-Nicholson) method, these schemes are unconditionally stable. If 1/2 >= 0, 
we have conditional stability if the time step satisfies the following condition 


AAt, < = (15.96) 
(1-20) 
The numerical solution will be stable and free from oscillation if 
1/At, -(1-O)A/k 
/At, ~( a ( >0 (15.97) 
1/At, +OA/k 
This condition can be simplified to 
(1-0) AAt, /k <1 (15.98) 


Therefore, the forward (Euler) difference scheme may not converge if a “big” time 
step is used. The regions of stability are illustrated in Figure 15.4. 
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_ 4 / 
7At, cat 9= 1/2 frowo=1 


Oscillation 


No oscillation 


0) 0. 1 
a 0 


Figure 15.4 Regions of stability and no oscillations 


15.2.7. Courant-Friedrichs-Lewy Condition 


The propagation of mechanical disturbances in solids is of profound importance in 
many disciplines including physical sciences and engineering. In these kinds of 
problems, the loading or disturbance is applied at such a “fast” rate that the effect 
of inertia cannot be ignored. These loadings are normally described as suddenly 
applied. 

Numerical integration of the wave signal cannot violate the causality of the 
arrival of wave signals. In particular, in Chapter 9 we saw that the method of 
characteristics leads to the concept of the domain of dependence. This concept 
leads to the so-called CFL condition (or Courant-Friedrichs-Lewy condition) 
discussed in this section. 

Let us consider the simplest wave equation: 


Uy = CU, (15.99) 
with initial conditions: 
u(x,0)= f(x), u,(x,0) = g(x) (15.100) 
Introduce a new notation for the solution at time ¢ = ¢, and x = x; as: 
nj =U,» x;) (15.101) 
where 
t, =nAt, x; = jAx (15.102) 


Using the second order centered difference scheme, the wave equation can be 
approximated as 


Un+j — QU, j zi Un-l,j =? Un, [+1 —2u, ; FUy jy 


(At)? (Ax)? 


(15.103) 
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We can rearrange this to give 


cAt .9 cAt. > 
Un; = 2[1— es Wy j rece (Uy js +My ja)—Upaj (15.104) 


Therefore, (15.104) suggests e Unt fam On Unj+k Where k = 0, +1 and upy-1;. 
Then, reapplying (15.104) to the solution at time step n—1, we have that wn+1/ 
depends on wp-1,;+4 where k = 0,£1, +2. Further reapplying (15.104) again, we have 
that wn+1 depends on w,-1,;+4 where k = 0,+1, +2, +3. Eventually, we can trace back 
all the way down to the initial time 7 = 0, such that w,+1, depends on 

{Uo j1¢0 & =0,41,12,..., tn} (15.105) 


In other words, the solution depends on a certain domain of the initial data. These 
initial data are known as the numerical domain of dependence. As shown in 
Chapter 9, the characteristics of (15.99) are the two curves: 

Ct x= et =x, (15.106) 

Cl: x+ct=x; (15.107) 
where x; is any point on the initial curve defined in (15.105). Figure 15.5 shows 
two choices of the initial numerical domain of dependence. The numerical domain 
of dependence on the left includes the characteristics (dashed lines), whereas the 
numerical domain of dependence on the right does not include the characteristics. 
Thus, the left one may converge to the actual solution whereas the one on the right 
can never converge to the exact solution because parts of the initial data have not 
been employed in the numerical integration. This can be summarized as: 


Mapes ee (15.108) 
Ax 
At 
Unstable: c—>1 (15.109) 
Ax 
:x—ct = const. C_:x-ct=const. 


f 


Ax 
x ——t=const. 
x1 const At 


Figure 15.5 The importance of including the domain of dependence 


This is called the Courant-Friedrichs-Lewy criterion for the stability of numerical 
integration for the hyperbolic type of PDE. Therefore, this criterion only applies to 
wave type propagation of information. Once the spatial discretization is selected, 
the time step must satisfy the following inequality: 


Stable : es (15.110) 
c 
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15.2.8 von Neumann Test 


The von Neumann test can be used to determine whether a finite difference scheme 
is stable (i.e., converges to the exact solution) or unstable (i.e., does not converge 
to the exact solution). 
Let us consider the following diffusion equation: 
Uu, = C,U,, (15.111) 


Applying the following finite difference to the derivative terms, we have 


Uu, =F [u(xt+k)—ux,0)] (15.112) 
Uy. = lsh) 2a) tule) (15.113) 

To simplify the notation, we introduce 
Um» = U(mAx, nAt) (15.114) 

Then, (15.112) and (15.113) becomes 

1 
u,= Fy ima = Unn | (15.115) 

1 
Ux = tn n — Un n FU al (15.116) 
x (Ax)? +1, i 1, 


Thus, we can substitute these results into the diffusion equation given in (15.111) 
and rearrange this result as 


c,At 
Un n+t Unn + (any? Unsin Qn n Hl ig) (15.117) 
We now seek an exponential solution as 
imO ind 
Unn=ee€ (15.118) 


Substitution of (15.118) into (15.117) gives 


5 - - ‘ Cc. At 4 . : é ye , 
ind pi(at)4 = ein? gina pow (gerne _ ei? gina + elm Ne gine (15. | 19) 


(Axy’ 
Thus, 8 and 4 resemble spatial and temporal variables. This can be simplified to 
. At _; ; At 
el =14 2 [el 246°] =1- "2 ~ cos) (15.120) 
Ax) 
This can further be rewritten as 
; 4c,A 
fain ¥ sin?(2) (15.121) 
(Ax) 2 
Note further that 
cilRe(A)+ilm(Ay] _ LiRe(4),-Im() (15.122) 
For a large time variable A, we have 
Im(A)>0>e™™ +0, asd<>oo (15.123) 


Equivalently, we also have 
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“1 <1= stable (15.124) 


This is because the solution will converge to zero as suggested by (15.118). 
Imposing condition (15.124) onto (15. Ny we obtain 


4 At Ac, At 
Osi ng ys <2 (15.125) 
(Ax) (Ax)? 
Thus, the von Neumann test gives the condition 
At 21 
eS (15.126) 
(Ax)" 2 
Consequently, the von Neumann test leads to the stability condition of 
ia? 
At< col (15.127) 
26 


2 
If the time step is not set according to (15.127), the numerical scheme will not 
converge to the exact solution. 

Note, however, that a numerical scheme being stable does not necessarily 
imply that the answer must be accurate. If the time step is taken to be too small, the 
round-off error may accumulate in the large number of operations. It is because all 
computers can only retain a finite number of digits in each numerical calculation. If 
the number of steps becomes unnecessary large, the error from round-off 
accumulation may eventually degrade a “good” numerical scheme. Similar to the 
asymptotic series expansion, we should not be too greedy on the accuracy. For this 
reason, modern numerical codes normally allow the time step to vary depending on 
the error control scheme such that a larger time step is used whenever possible and 
a very small step size only where necessary. 


15.3 FINITE DIFFERENCE FOR SECOND ORDER ODE 


We now consider the second order ODE, which repeatedly appears in dynamics 
formulation using force equilibrium of mechanical systems. In particular, consider 
the following linear system of ODEs: 
2 
M “C4. ka= fi) (15.128) 
dt dt 
For the time variable, we can discretize using a trial function in time as: 


a~a=) a"N,,(0) (15.129) 
m=1 
where JN, has to be at least of degree two. Figure 15.6 shows a three-node quadratic 
element in time. 
As illustrated in Figure 15.6, an approximation for a depends on the value of 
three time nodes as: 


a=a" Ni t+a°" NS tae"? NB (15.130) 


where the approximation functions are 
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TA-T . TA+T 
NI, =) Wn =1-7, Mo = S2 as.tzy 
Pa Non ae Nonst Non42 
1 
t ” wt eeesanenenenesnnenente” t : aie acetachsn digs ua saaseeeee™ bon 2 
2 i, 2n+l Ie is 
Node 0 12 3 4 5 6 7 
O—_®—_O—_®—_O0—_-®—_-O—_-—_- 
Time 4 4 bf th ty ts bk ty 
Element | 0 | 1 | 2 | 3 
Figure 15.6 Discretization of time in nodes and elements 
with 
pat Bet (15.132) 
At,, 
As shown in Figure 15.6, we have 
At, ~ bon42 lonwt ~ bons Lon (15.133) 
It is straightforward to see that at time fz, 
Pre=l, N3n =1, None =0, N3n+2 =0; (15.124) 
at time fi 
T=0, Nj, =0, Non =L Now =93 (15.135) 
and at time f2)+2 
f=, Non =0, N3nsl = 0, N3n42 =1,. (15.136) 
Note that the differentiation of the trial functions given in (15.131) leads to 
1 1 
n ——+T n n a+P 
dN3,, _ 2 AN3na1 2T AN3n9 = 2 (15 137) 
dt At, | dt At, df At, 
d°N3, 1 d Nia 2 dN as i (15 138) 
ad? AR edt? A?’ dt? AP , 
Using these approximations, we have 
2 
ui a = M( I az" 2 qn +g?) (15.139) 


Ate At Ate 


dt’ 
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da . ; ve 2T ; ae 
C _ C( az" a ee (15.140) 
dt At, At, At, 
Ka K( a T) a" +(1 T? Ja 2n+1 een) (15.141) 
Using the weighted residual Lane we have 
% 2 
| ies C+ da. Ka— f (NW, dt =0 (15.142) 
0 dt dt 


With the time discretization shown in Figure 15.6, we can break down the time 
integration into segments: 


bons2 kg a da 
| [Ma +co + Ka—f (t)W,dt=0, n=0,1,2,.. (15.143) 
bn t? 


In view of (15.139) to (15.141) and the definition of (15.132), the integration in 
(15.143) leads to the consideration of the following integrations: 


boyny2 ' 
[ G+ T Maat = At,y{ W,dT re 
on -1 
tc : 
{ an, d= Atl, W, dT (15.145) 
by - 
[ Qn+2 fi< r W, He ij Qn+2 is G +T) 215 T+T)]W,,dt 
m boy (15.146) 


=At,{—+yv-2 W,,aT 
Gt7-2Bf WM, 


(ag a1 pw. at= } aah rat 
tm 2 tm 2 (15.147) 
= At, {-1+ AL Mar 
iad A 1 
W, dt = ) [—-(<+T)+—T(1+T)W,dt 
le 2 2 2 (15.148) 
=A1, (5-74 B}) WaT 


iC | T) 


2n 


bons bone 
[P° Carwae= fP"-2e TD Wae 
fo fm (15.149) 
= At, {1 ane 27} WaT 
zs 


where 


1 
[ Genmar 
yor2 (15.150) 
[ mar 
-1 
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1 
{ T(1+ Dy de 
p= 2 (15.151) 
[ mar 
-1 
Substitution of these results into (15.143) gives 
a”? (M+ yAt,C + BAt K) 
+a°"* "2M + (1-2y)At,C + G -~28+y)At? K] (15.152) 
+a>"[M —(1-y)At,C + GS + B-y)AtK]= f"At 
where 
1 
} Slbnn ba TAt,, W,,aT 
—_— (15.153) 


1 
[ mat 
-l 


Note that the actual value of B and y depend on the weighting function that we used 
in (15.142) and (15.143). The formulation given in (15.152) is related to the 
Newmark method, which will be introduced in a later section. If we used the same 
interpolation function for the forcing term, we would have 


a = -" Nz, go Noa TT Nes (15.154) 
It is straightforward to see that 
n n 1 n 1 n 
P= BFPO HS 2B ENS HS + B- (15.155) 


In the next two sections, we will introduce the two most popular methods in 
structural dynamics, namely the Wilson 0 method and Newmark 8 method. 


15.3.1 Wilson 0 Method 


The Wilson @ method is an extension of the linear acceleration method. According 
to Figure 15.7, the acceleration at a time t+r can be expressed by linear 
extrapolation as: 


ue =u ‘ogc =") (15.156) 
t 


where @= 1. The Wilson @ method is an implicit method because the stiffness 
matrix K has to be evaluated at the unknown displacement Ut“. This method is 
unconditionally stable for 92 1.37, but in actual numerical calculation, = 1.4 is 
normally used. The idea of the Wilson @ method is to allow a bigger time step but 
at the same time the numerical scheme remains stable. The acceleration is assumed 
to be linear from ¢ to #+@At. When @ = 1, it reduces to the linear acceleration 
scheme. 
Integration of (15.156) gives the velocity as well as the displacement as: 
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ue =U! Gre Coe ar (15.157) 
3 : 
Ut =u! Urs sO'e fae _U') (15.158) 


yee 


t t+At t+OAt 
Figure 15.7 Sketch showing the Wilson 6 method 


Substitution of t = OAt into (15.157) and (15.158) gives 
Ut! 0 4 Ur + — (GPP a0) (15.159) 


ute = U' +U'GAt 


+o ore ». 2U’') (15.160) 


Solving for the velocity and acceleration in terms of the displacement at t = +-0At 
we obtain 


- 6 z = 
pia _ 8 is uy AOU (15.161) 
- OAt 
pir x 2 (yttan _ U')-20' — OAL it (15.162) 
The equilibrium is considered at time t+ GAt as: 
MU 4 cu& 4 Kym a ee (15.163) 


Since the acceleration is assumed to be linear, the force vector must also be linear 
as 


| i ae as an (15.164) 
Substitution of (15.161) and (15.162) into (15.163) gives an equation for 
displacement U'*”, This solution can then be substituted into (15.161) and 
(15.162) to obtain the acceleration and velocity. 
Before we consider the Newmark 8 method in the next section, we first set 0 
= 1 in (15.160) and obtain the special case of linear acceleration: 
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ut yt +a +50 \A0" (15.165) 


15.3.2 Newmark B Method 


The Newmark $ method can be formulated by starting from Taylor series 
expansion: 


2 3 - 
yt at At Gt AD ye OO (15.166) 
1! 2! 3! 
For small At, we can estimate the rate of acceleration as 
ae yi _ U' 
Us eo (15.167) 
t 
Substitution of (15.167) into (15.166) gives 
ut’ =U + atu! HANS yu +o - (15.168) 


Note that we have only retained the third order term in the Taylor series expansion. 
The idea of the Newmark B method is to modify the factor 1/6 in (15.168) to B 
such that all higher order terms in Taylor series expansion can be neglected. In 
particular, we have the Newmark B method as 


ut -yt +a! Han)? 5— 620" purty (15.169) 
Similarly, we can also use Taylor series expansion for the velocity as 
2 oF 
ut’ -U' +A ie. (15.170) 


Again for small At, we can substitute the rate of acceleration given in (15.167) into 
(15.170) as 


ut ot 4 Ar SS (eas =U" Ls (15.171) 
This can be rewritten as 
fyttae fre tans )b! Oe. (15.172) 


Using the same idea, we can replace 1/2 by y such that all higher order terms can be 
dropped: 
Ut” 2U' + At(l-yyU! +yU™} (15.173) 
Newmark originally proposed to use B = 1/4 and y = 1/2. Consider the special case 
that 
B=1/6, y=l/2. (15.174) 


Substitution of these values into (15.166) gives 


ut — ut +t (a0)? 50" oe") (15.175) 
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Note that (15.175) is exactly the same as (15.165) obtained in the last section, 
which is for linear acceleration. Thus, the special case given in (15.174) for the 
Newmark B method gives the linear acceleration method. Note also that for B = 0, 
it can be seen from (15.169) that the Newmark B method becomes an explicit 
scheme. 

Thus, in conclusion, the Wilson 8 method with 0 = | is equivalent to the 
Newmark B method with B = 1/6, and y = 1/2. In addition, both the Wilson and 
Newmark methods are exact if the acceleration in the problem is constant (this is 
clearly not true for the case of seismic loads). We should note that the Wilson 0 
method and the Newmark 8 method are the most popular finite difference methods 
in time integration in the area of structural dynamics. All dynamic finite element 
programs include the options of using either of these methods. 

Prof. Wilson was a professor at University of California at Berkeley and 
made significant contributions to nonlinear analyses of structures under seismic 
loadings, whereas Prof. Newmark was a professor at University of Illinois at 
Urbana Champaign and made significant contributions to seismic analysis of 
structures. In geotechnical engineering, Newmark chart was developed to estimate 
the vertical stress increment under arbitrary surface loading and Newmark sliding 
block model was developed to investigate seismic slope stability. 


15.3.3. Stability Condition of 3-Level Scheme 


We now return to study the stability condition of the 3-level scheme discussed in 
Section 15.3. In particular, a special form of (15.128) with no damping and no 
forcing term is considered 
2 
MoS + Ka =0 (15.176) 
t 


The corresponding numerical scheme given in (15.152) is simplified to 
(1+ BAthM'K)a?"** +[-21 + eo —~2B+y)AteM' Ka" 
; a (15.177) 
nee. G + B-y)At2M"'K]a”" =0 
We first return to the original system and consider the natural frequency of 
vibrations of the system. That is, we seek for a solution of the form: 


a = acos(at — 0) (15.178) 
Substitution of (15.178) into (15.176) leads to 
(-o°M + K)a=0 (15.179) 
It is clear that if the mass matrix M is invertible, we can rewrite (15.179) as 
(M'K-o Da=0 (15.180) 
This is the well-known eigenvalue problem. For nonzero vector @, we require 
det(M 1K — oI) =0 (15.181) 


If the size of the matrices M and K is MxM, the solution can be expressed as: 
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M 
a=) a,,CO8(@p,t — 5) (15.182) 
m=1 
where the eigenvector is normalized as 
a Ma,, =1 (15.183) 
a Ma,, =0, m#l (15.184) 
a] Ka,,=0, m#l (15.185) 
We can express the numerical approximation vector a” as: 
M 
a” => in On (15.186) 
m=1 


Substitution of (15.186) into (15.177) gives 
(1+ Pato} wil"? +[-24(-28+ Nagoa lye" 


m 


F (15.187) 
+{1+ (St B-y)At?a2 ]y" =0 


For numerical calculations, we can further define: 
yor = Aun (15.188) 
Using this particular form, (15.187) is reduced to 
1 1 
(1+ Ato? )u? +[-2+ (528+ yAte ao |u+[l+ GC B-y)At@2 |=0 (15.189) 


There are two roots for 1, namely tt and py. We can see that u is in general 
complex. If t1; and Ly are complex and the modulus is less than 1, we have stability 
criteria as 


Stable: ets. t=1,2 (15.190) 
Stable and undamped: [24 =1;. 21,2 (15.191) 
Stable and artificially damped: | u4,| <i, a=1,2 (15.192) 


The roots of 1 can be expressed as: 


= = o)\2— 
pen g)ty¥2-g) -40+h) (15.193) 


2 
where 
Canezar — (2=pezar 
po —_ fa — (15.194) 
(1+ Ba; At; ) (1+ Bo; At; 
Using these definitions, we find 
2-(1-26+)e2M? 144-74 Pozar 
2-g-—_2 << , =—2 _ (15.195) 
(1 + BO, At, ) qd + BO, At, ) 


It is straightforward to see that 


Ja) = fu =V1+h (15.196) 
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Substitution of (15.196) into (15.190) to (15.192) leads to 
-1<h<0 (15.197) 
This leads to 
1 1 
It is clear that Newmark’s original proposal of B = 1/4 and y = 1/2 will lead to an 
unconditional stable solution. The linear acceleration method is also unconditional 
stable. 


15.4 MULTI-STEP METHOD 


So far, our discussion has been focused on the numerical scheme that the current 
time step depends only on the solution of the last time step. This is known as the 
one-step method. It is natural to ask whether we can estimate the solution at the 
current time step as a function of previous time steps (i.e., more than just the last 
time step). This technique is referred as the multi-step method. Consider a first 
order ODE as 


d 
afb Wl) =¥o (15.199) 
Assume that an approximation of y is given as: 
y(t) =o) (15.200) 
The numerical solution can be written as: 
Cnt 
Abyss) Aly) =|" Hat (15.201) 


15.4.1 Adams-Bashforth Method 


The Adams-Bashforth method assumes a polynomial form of @’ in (15.201) and 
carries the explicit integration on the right hand side of (15.201). For example, if a 
second order function is assumed for ¢', we have 
P(t)h=¢'(0 =At+B (15.202) 
First, we can evaluate the constants A and B by considering the equations at two 
points (tn, Vn) and (tn-1, Vn-1): 
At, + B= f (tas Yn) = Sn (15.203) 
Ab, + B= f G1sYna) = faa (15.204) 
This provides a system of two equations for two unknowns and the solutions are 
A= ip = Ja = ti Si 


t= h 


n n-l 
Tina tna Sntn -l 
h 


where h is the time step. Thus, substitution of (15.205) and (15.206) into (15.202) 
gives 


(15.205) 


B= (15.206) 
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ttl A 
"Ode = J (Ae, + Bld =F (ips 3) + Bllgat bn) 


ty 


tn ai Tis t = Jul n- 
= nat = Gna +t) + te h (tat t,) 
; ; (15.207) 
fay Ey = 2b Lo =Gag atl, 
= (esta Maly 5, (Hata 
= ohh =n 


Substitution of this result into (15.201) gives the following second order Adams- 
Bashforth finite difference scheme: 


3h 1 
Ynet = In +> Pe - Mf, (15.208) 


This is an explicit scheme as the unknown only appears on the left hand side and no 
iteration is needed. The truncation error is proportional to h?. If we use only the 
constant term in (15.202) and follow the same procedure, we will arrive at Euler’s 
forward difference method that we discussed in Section 15.2.1. Following a similar 
idea, we can extend the analysis to higher orders. For example, if we employ the 
results of previous steps to fit an approximation of third order, 


P,(t)= AP + Be +Ct+D (15.209) 
we have the following fourth = Adams-Bashforth formula 
Vn+ = 6554, - =.) +37 n-2 =O 53) (15.210) 


Again, this is an explicit eel scheme. 


15.4.2 Adams-Moulton Method 


There is another variation of such derivation and it leads to the Adams-Moulton 
scheme. The only difference is that instead of using the previous solution in fitting 
the power law for the differential form, one may use the unknown solution in fitting 
the power law. More specifically, for the second order case we have 

P(t)=¢'(H) =at+B (15.211) 
First, we can evaluate the constants by considering the equations at the current 
point (fn, yn) and the unknown point (f1+1, yn+1): 


at, + B= f (tis In) = Sn (15.212) 
Abn) + B = F byt Pret) = Fe (15.213) 
Thus, solving we have 
= fusi~ In (15.214) 
h 
B= Sub, n+l ; = Saat n (15.215) 


Following exactly the same procedure, we obtain the second order Adams-Moulton 
formula: 
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1 1_. 
Vast = Vn ran +5 Pasi nse Yast) (15.216) 


This formulation is an implicit finite difference scheme as the unknown also 
appears on the right hand side of (15.216). If we take simply the constant term in 
(15.211), we arrive at Euler’s backward method. This idea can easily be extended 
to the case of higher orders (such as the one given in (15.209)), and we can obtain 
the fourth order Adams-Moulton formula 


h : 
Mn+ = In + Ofnet 119 Sn — 5 Ina + n-2) (15.217) 


Again, this is an implicit numerical scheme. Moulton was an American astronomer 
who derived this formula during World War I when he worked on the ballistics 
trajectories for the US Army. Moulton also published a number of books related to 
astronomy, including celestial mechanics (Moulton, 1914). 

Since the implicit method is more complicated (and iterations is needed), one 
may ask why it is being proposed and used in the first place. It turns out that some 
differentiation equations are stiff, in a way that a much smaller step size is needed 
for stability than for the accuracy requirement. For such problems, the backward or 
implicit scheme is found stable independent of the time step whereas the time step 
in the explicit method must be constrained to arrive at a stable solution (e.g., 
compare Example 15.2). 


15.4.3 Predictor-Corrector Method 


A popular method is to combine the explicit method and the implicit method. In 
particular, we approximate ¢’ by polynomials passing through several previous 
points and possibly also passing through the current point ¢,:1. The evaluation of 
the integral in (15.201), in general, leads to the following form: 
Vn+t =Nn + h(Qo frst F af, + A fy + a3fy,-2 + +) (15.218) 

If a is zero, we have the explicit method; otherwise, we have the implicit method. 
To solve (15.218), we can use either functional iteration or Newton’s method. For 
the iteration, we have to make an initial guess for y,+1 and substitute it into the right 
hand side of (15.218) to get an updated value of yn. If the change of the updated 
value is too large, we can repeat the iteration process. To get an initial guess, we 
can use the explicit method, such as the Adams-Bashforth method. This is the 
predictor step. It is essentially an extrapolation of previous data points. Once we 
have the predicted value, we can use it to interpolate the derivative term to get a 
corrected result. This is the corrector step. The comparison of the predicted and 
corrected values provides information on the local truncation error and this can 
lead to error control and to adjusting step size. 

More specifically, a popular predictor-corrector method is the Adams- 
Bashforth-Moulton scheme. For example, we can use the third order Adams- 
Bashforth as the predictor equation: 


h 
Vntt =Vn ee -16f,4+5f,->) (15.219) 


Once the prediction for yn+1 is made, we can use the following third order Adams- 
Moulton method as the corrector equation: 
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h : : 
Vn+t = Vn +5 Osan +8f, = faa) (15.220) 


Note that with the help of the predictor equation in (15.219) the formula given in 
(15.220) is actually an explicit scheme. This is the beauty of this method. That is, 
we don’t have to solve the nonlinear implicit equation. If the predicted and 
corrected solutions differ too much, we can repeat the iteration process using 
(15.220). If after a number of iterations the accuracy is still not up to our 
expectation, we may consider reducing the step size instead of continuing our 
iteration process. Note, however, that this predictor-corrector scheme is essentially 
an explicit scheme and, thus, the strong stability property of the implicit methods is 
lost and should be used for stiff ODEs. 


15.4.4 Backward Differentiation Formula 


An alternative to the Adams type of numerical scheme is to assume a polynomial 
for the unknown y instead as for y’ in the Adams approach. In particular, we 
assume in the backward differentiation formula: 

PO=¢t=at+P (15.221) 
First, we can evaluate the constants by considering the equations at the current 
point (¢,, Vn) and the unknown point (fn+1, Va+1): 


at, +B=y, (15.222) 
Anat B= Va (15.223) 
Thus, solving we have 
aq = 2a In as of (15.224) 
However, since we have 
RO=2= fons Dns)> (15.225) 
combining (15.225) and (12.224), we obtain 
Vn+l = Vn thf nits Yast) = Yn +hfns (15.226) 


This is an implicit method, and equals (15.15) or the Euler backward difference 
method. This method can be extended easily to higher order polynomials. In 
particular, the fourth order backward differentiation formula is 


1 
Vat = 55 48 = 36),-1 a 16y,-5 = ¥ic8 + 12hf (ty 41> Vint | (15.227) 


This method has been found useful in solving stiff ODEs. 


Example 15.3 Consider the solution of the following “stiff? ODE by using Euler’s 
forward and backward schemes, and the fourth order backward differentiation 
formula: 

a =-C (15.228) 
ames 


where c is a very large number. 


880 Theory of Differential Equations in Engineering and Mechanics 


Solutions: Using Euler’s forward difference scheme, we have 


Ynvt = Vn tlh = Vn — PY n = ¥,A-ch) (15.229) 


If the absolute value of (1—c/) is larger than 1, the finite difference will not be 
unstable in the sense that y  o as n — o. In other words, for a stable solution we 
need to control the time step / as 


hee (15.230) 


c 
For example, if c = 10000, we need to have a time step of less than 0.0002 just for 
the stability of the solution. 

If we apply (15.226) or the Euler backward difference scheme, we get 


Yn+l = Ln —chy, 41 (15.231) 
Solving for yn+1, we obtain 
Yn 
= 15.232 
ant l+ch ( ) 


We can see that no matter how large c is, the implicit scheme is stable for all time 
steps of h. 
Using the fourth order backward differentiation scheme, we get 


1 
Ve = 48y, -36y,_,+16y,_) —3y,_ 15.233 
ont TPS Vn n-l Vn-2 y 3| ( ) 


Again, we see that (15.233) is unconditional stable. Therefore, the implicit 
backward differentiation formula is a good choice for solving stiff ODEs. 


15.5 RUNGE-KUTTA METHOD 


People often save the best for the last. We now discuss one of the most popular and 
successful finite difference schemes, which is known as the Runge-Kutta method. It 
was originally proposed by Runge in 1895 and was extended to solve systems of 
equations in 1901 by Kutta. Runge was well known for his work on spectroscopy 
and Kutta was famous for his airfoil theory. 

The idea behind the Runge-Kutta method is to increase the order of accuracy 
without reducing the time step. In particular, we consider the following first order 
ODE: 


7 = f(t, Vv) (15.234) 


The Taylor series expansion of the solution at time step n+1 about that at time step 
nis: 
h W df. ww ida’*f. h' af 
=y,+ sss 15.235 
Yn =Nn The TAs 3! dt n at ap oe ( ) 


where the time step is 

h=t 
The idea of the second order Runge-Kutta method is to look for the constants a and 
b in the following formula: 


t (15.236) 


n+l 'n 
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Yaad =n + Alaf (ty, ¥n) + Of nar Kn 1+ OCH?) (15.237) 
where 
Kin =n thf (t,.¥n) (15.238) 


We want to find the values of a and b such that the accuracy of the finite difference 
scheme is of accuracy O(h*) instead of just O(h). In a sense, we are using the slope 
of y’at both initial point (¢,, yn) and final point (t)+1, vat) in (15.237); and we want 
to adjust their corresponding weightings such that the accuracy would be one order 
higher. Note that for a = 0 and b = 1 we recover the Euler backward difference 
scheme and for a = 1 and b = 0 we recover the Euler forward difference scheme. 
Physically, k,1 is the first estimation of the solution of y,+1 using the Euler forward 
difference scheme. In essence, (15.237) is an explicit scheme. 
Note by definition that the slope at the end point is 


dj 
Iasi) = S(t, +h, y, +i.) = ip ne n (15.239) 
Substitution of (15.239) into (15.237) gives 
d 
Ynst =u HLAP Cy) +Bfy + BHD), +O) 


(15.240) 
= 2 3 
Se ATO) LTO A et ONE) 
Comparison of (15.240) with the Taylor series expansion (15.235) leads to 
1 
=b=— 15.241 
a 5 ( ) 
Finally, we obtain the second order Runge-Kutta method 
1 1 
Vn = Yn + AL S(t, Vn) + sf nok mI + O(n’) (15.242) 


This is also known as the trapezoidal rule. We note that we can raise the order 
accuracy of the finite difference scheme by an order of magnitude, if we are willing 
to evaluate the slope (i.e., the right hand side of (15.234)) twice. 

This idea can be extended to higher orders; fourth order Runge-Kutta method 
is by far most popular and defined by the following approximation: 


Vast = Vn + hl af (t, Vn) a7 Of tnsij2>Km ) ay of (tnsi/2> ky2) + Of (laaas ky3)] a O(h’) 


(15.243) 
where 

kent = Yn tf (tao Vn) (15.244) 

kya =n tA (trati2>Kna) (15.245) 

kins =Yn tf Cnsv2>Kn2) (15.246) 


In general, for m-th order Runge-Kutta method, we have 
Kn =i thf bisVa) 
Kacpy = Yn t+ hF Enstiaoknip-y)s 2S p<m (15.247) 
Kentm’ = Vat hf tna > Kntm-ty) 

Following the same idea, we can expand the slopes in (15.243) as 
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h h 
F nasa kn) a Itt; a 2 Vn + 2 Fi) = ie + 7 ( 4, (15.248) 
h h 
SF (tasts2>Kn2) = Sith a zen + 5 Ff nsvi2> md} 
. tt, (15.249) 
= tnt 2 alla + 2 apm 
SF (tnatkn3) = S(t, +h - oa k,2)) 
dj (15.250) 
= fy WE, + Se OD), 
Substitution of (15.248) to (15.250) ae (18: sa cans to 
Ynat =n hl ly + BUS +S (LY) +e + Ua (PM 
hd h df 5 
+dtf, +e eS aw lie 5 7 W)k]+O) 
(15.251) 
This can be rewritten as 
Ynst = Nn +h{a+b+c+d}f, +h tf OG +. 2 4d} 
(15.252) 


+P ( hceh + hh 26 yo 


Comparison of (15.251) a the ees series expansion in (15.235) gives four 
equations for four unknowns: 


at+b+c+d=1 (15.253) 
ae as (15.254) 
2 2 2! 
oa (15.255) 
4 2 3! 
d 1 
sae 15.256 
4 A! ( ) 
This system of equations can be solved easily and the solutions are 
jae _—— (15.257) 
6 3 


Finally, we obtain the popular fourth order Runge-Kutta method: 


h 
Vana = Yn + AL lun) + 2F Cnsri2kn) + 2S Cnsrir> bua) + frais) + OUP) 


(15.258) 
This is called the fourth order in the sense that the order of error is O(h°). 
Physically, the first slope in the bracket term on the right of (15.258) is the slope 
evaluated at the initial point, the second term is the first approximation of the slope 
evaluated at the mid-point, the third term is the second approximation of the slope 
evaluated at the mid-point, and finally the fourth term is the slope evaluated at the 
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end point. There is also a clear symmetry in the coefficients of a, b, c and d. Thus, 
we keep the time step as /, but the order of error becomes O(h°) instead of O(h?). 
The price to pay is that we have to evaluate the right hand side of (15.234) four 
times. 

Without going into the details, we quote the following Runge-Kutta formulas 
for different orders. The third order Runge-Kutta formula is 


h 
Vn+t = Vn + lf Cnn) + f rsa Kn) +f (triton) + O(n*) (15.259) 
The fifth order Runge-Kutta formula is 
h 
Vn+t = Nn + jsf ln In) +5 f trsi2>Km) +5 f(traia>Kn2) 


(15.260) 
+3f Oneisasns)+ fCnsiokng + OO") 
The sixth order Runge-Kutta formula is 
h 
=V, tA Gy Vn) FISS tras Kn) t+ 1S P Gita Ky 
Yn+i = Yn aut nod ) ig +1/2 1) f( +1/2 2) (15.261) 


+9 fF (thst > ky3) Af (thr 2 kn4) + F (bua > kins ) + O(h’) 
For the proof of these formulas, we refer to the problems at the back of the chapter. 
We note that there is no more symmetry in the coefficients for scheme beyond the 
fourth order. 
Let us consider the general k-th order Runge-Kutta method as: 


Vi Vee Ala f(t, Vn) + anf (ty41/2> Kn) + asf (thas2> kno ) +... 
+n iF bn41/22%n(n—2)) + Inf Cnt kn(n—r) + och") 
Following the procedure in obtaining (15.253) to (15.256), it can be shown that the 


coefficients in (15.262) satisfy the following system of k equations for the k 
unknowns: 


(15.262) 


Ay Fy +z +g Heseeccccsceees +a, ,+a, =1 
U2 sy ck 2 fol g =— 
2 2 2 2, 2! 
hein Ou. 2 ne gens 
a - 2. 
aA Apay a, 1 (15.263) 
kA yk k-2 7 kl 
an-| + ay _ 1 
gn-2 gn (n = 1)! 
Ay _ 1 
gn-2 ~ n! 


The last equation gives the constant a, instantly. The second to last equation gives 
dn-1, and so on. The solution of these equations can be compiled as: 
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gn-2 
Gs (15.264) 

n!} 

yn-2 

an) => — 24, (15.265) 

(n-1)! 

ak 

ay "op oe Ay jae Ay, 2Sk<n-2, nz (15.266) 


a, =1-a, -—a,_) —...- a (15.267) 
Using (15.264) and (15.265), we can first find the last two coefficients, then use 
(15.266) to find the other coefficients. The last coefficient must be found by 
(15.267) as suggested by the first equation of (15.263). Although we give the 
solution scheme for generating the higher order Runge-Kutta method in (15.264) to 
(15.267), the Runge-Kutta higher than 4th order is seldom used because a higher 
order scheme does not necessarily give a more accurate result (Press et al., 1992). 
The Runge-Kutta method (especially the fourth order method) has been used 
extensively by researchers. Although it may not be the most efficient numerical 
scheme, it is very stable and reliable especially when those with an adaptive step 
size algorithm are incorporated in the numerical code. For example, for the same 
accuracy the predictor-corrector technique may be more efficient. The fifth order 
Runge-Kutta formula can be embedded in the algorithm to estimate the plausible 
error at a certain step for the fourth order Runge-Kutta method. The step size can 
then be lengthened or shortened according to the estimated error. Section 16.2 of 
Press et al. (1992) gives a more detailed discussion of this error control algorithm. 
In short, for fourth order Runge-Kutta method the error estimate A can be 
determined by comparing the fourth order prediction (say yn+i in (15.260)) with the 
fifth order Runge-Kutta method (say *,+1) as: 


5 

A= Vast —V* nu A (15.268) 
Since we are using the fourth order method, we are expecting the error is 
proportional to /°. Suppose that the error estimate at the current step with step size 
h, is A; and the required error is Ao with a corresponding projected step size ho. 
Then, we must have 
Ao js (% 
A, h, 
If the current estimated error is smaller than the requirement, we can have Aj/Ao < 
1. Thus, we can extrapolate to increase the step size to: 


( ; (15.269) 


A 
In =hy(—%)"” (15.270) 
A 


If the estimated error is larger than the requirement, we have Aj/Ao > 1. We can use 
the same formula given in (15.270) to reduce the step size. However, due to the 
inherit uncertainty in the error estimations, Press et al. (1992) proposed a more 
conservative approach. In particular, it was proposed that a larger exponent index 
is used (say 1/4 instead of 1/5) for the case of reducing the step size. In summary, 
we have (Press et al., 1992): 
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A 
hy = hoy, Ay =A, 
1 
(15.271) 
= (224, Ay <A, 
A, 


This approach is also equivalent to assuming that the error is actually proportional 
to h* (more conservative thinking). 

In solving the highly nonlinear landslide model based on state- and velocity- 
dependent friction law, Chau (1995, 1996b) found that the fourth order Runge- 
Kutta method with adaptive step size algorithm recommended by Press et al. 
(1992) was very reliable. 


15.6 NEWTON-RAPHSON AND RELATED METHODS 


In this section, we consider the Newton-Raphson iteration method for the nonlinear 
problem. This method was developed by Isaac Newton in 1671 but it was not 
published until 1736, while essentially the same method was published by Joseph 
Raphson in 1690. The currently adopted version of the Newton-Raphson method is 
actually due to Raphson, which is simpler than Newton’s version. The method was 
originally developed for searching the roots of an equation. In addition, some 
related methods are also introduced, namely the initial stress method, modified 
Newton-Raphson method, and quasi-Newton or BFGS method. 


15.6.1 Newton-Raphson Method 


Let us illustrate the idea by considering the following second order ODE, which is 
obtained for nonlinear undamped structural dynamic problems: 

2 

Moo + fU)-RO=0 (15.272) 

t 
where U is the displacement vector for the dynamic system, M is the mass matrix, f 
is the restoring force vector, and R is the external applied force vector on the 
system. Clearly, the nonlinearity in (15.272) comes from the nonlinear restoring 
force f. For simplicity, we focus on the static problem and assume that we have the 
following solution at time ¢ 


F(U') = f(U')-R (1) =0 (15.273) 


where U' denotes the actual solution at any time. Considering the Taylor series 
expansion of F at U"“ about an approximation after i—1 iteration, we have 


aF(U') 
aU 


F(U‘"') = F(U‘,) + (eet at ie... (15.274) 
ial i-l 
we 


Assuming the external force R is not a function of displacement U, we have 
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SF) wr -op 
OU Luis 

(15.275) 
|=] AU w= Say — RO 

OU Jui 

Note that the superscript indicates the current time whereas the subscript denotes 
the iteration number within the time step. The right hand side of (15.275) is the 
unbalanced force at the current iteration (in general not zero) and this term must be 
calculated exactly. The tangent stiffness matrix is denoted by [0f/0U], which is 
evaluated based on the displacement at the new time step and from the last iteration 
step i-1. If we use K as the tangent stiffness matrix, we can write (15.275) as 


=] AU y= KG AU y= Sy — RO (15.276) 
OU Jus 


This system is used to solve for the displacement increment, and with this the 
updated displacement after the i-th iteration is calculated as 
t+At t+At 
Un =U Gay +AU Gy (15.277) 
This iteration process starts with the stiffness matrix, restoring force vector, and 
displacement vector of the last time step as follows: 


Kiy =K' (15.278) 
fo -=f' (15.279) 
Ug =U’ (15.280) 


Equations (15.276) and (15.277) constitute the Newton-Raphson method, and it is 
illustrated in Figure 15.8. We can see from (15.276) that the tangent stiffness 
matrix needs to be updated and solved (or inverted) at each iteration. For large 
systems and highly nonlinear problems, it is the most time consuming process for 
the method. 


SU) 

Unbalanced force 
A t+At 

fi 


t 


Aut” 
(dl) t+At 


Figure 15.8 Ilustration of the Newton-Raphson method 
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15.6.2 Initial Stress Method 


In view of the computationally demanding effort in the Newton-Raphson method, 
several variations of the Newton-Raphson method have been proposed. The 
simplest and fastest one is to use the stiffness matrix of the system before the 
loading is applied. That is, we replace (15.276) by 


RMU p= fey -R™ (15.281) 


This is known as the initial stress method. If the system of equations results from 
the finite element method (FEM), it is the linearized response about the initial 
configuration of the FEM model. For highly nonlinear problems, the convergence 
of the method is slow and sometimes the solution may even diverge. 


15.6.3 Modified Newton-Raphson Method 


The modified Newton-Raphson method lies somewhere between the initial stress 
method and the full iteration method of Newton-Raphson and is called modified 
Newton-Raphson. It uses the tangent stiffness matrix from the last time step 
throughout the iteration process: 


K' AU y= fin ROY = ARG) (15.282) 


The modified Newton-Raphson method is less computationally demanding but then 
converges slower than the Newton-Raphson method. Physically, (15.282) gives the 
displacement vector increment at the i-th iteration to balance the difference 
between the internal restoring force vector and the external applied force vector at 
the (i-1)-th iteration. The modified Newton-Raphson method is illustrated in 
Figure 15.9. As expected, the modified Newton-Raphson method converges much 
slower than those in Figure 15.8, but, however, we do not need to solve the tangent 
stiffness at every iteration step. Thus, the modified Newton-Raphson is, in general, 
faster than the full Newton-Raphson method. 


SU) 

Unbalanced force 
1+At t+At 

Soo) —R 


Figure 15.9 Illustration of the modified Newton-Raphson method 
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15.6.4 Quasi-Newton and BFGS Method 


The shortcoming of the modified Newton-Raphson method leads further to the 
development that we should update the tangent stiffness as required by the accuracy 
as well as the degree of nonlinearity of the problems. This variation is called the 
quasi-Newton method. In this method, the tangent stiffness is updated according to 
whether a certain error control condition is satisfied. For example, the following 
criterion is used for the so-called BFGS method: 


5) Bii 
(Pi 
Cy “lait g = Fe (15.283) 
6) Ki Bi 
where 
Gi) =U” — UGS) (15.284) 
Buy = ARG — ARG (15.285) 


Whenever the condition number defined in (15.283) is larger than the prescribed 
number & (or (15.283) is violated), an update of the tangent stiffness is performed. 
This BFGS method is illustrated in Figure 15.10. 

Therefore, in a sense this is an error-driven scheme for updating the tangent 
stiffness. This particular version of the quasi-Newton method is known as the 
BFGS_ (Broyden-Fletcher-Goldfarb-Shanno) method, which was developed 
independently by Broyden, Fletcher, Goldfarb, and Shanno all in 1970 (see Press et 
al., 1992). It is believed that the quasi-Newton method is most efficient, and is also 
widely used in hill-climbing types of unconstrained optimization problems. 


R 
fU) 


Unbalanced force 
t+At t+At 
Soy ~R® 


t+At 
Ki) 


U' ae t+At 7 
a AU) 


Figure 15.10 Illustration of the BFGS or quasi-Newton method 


15.6.5 Secant or Fixed Point Iteration Method 


There is another variation the called fixed point iteration or secant method. The 
stiffness is formed between the fixed initial point and the current point, and thus is 
a secant stiffness. The unbalanced force vector can be formulated: 
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=] Ve pideed = Uo] 
vis Wie) 


(15.286) 


|E| avast Re 
oU t+At t+At 
U5) Yo) 


) 
This secant method is illustrated in Figure 15.11. 


R 
fU) 
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t+At t+At 
Soy ~R® 


AUTt™ 
(1) t+At 
AUG) 


Figure 15.11 Illustration of the secant or fixed point method 


All Newton-Raphson type methods may or may not converge to the actual 
solution, depending on whether the function has a quadratic Taylor expansion near 
the solution point and on how far the initial point of iteration is from the true 
solution. In general, there is no single nonlinear algorithm that can guarantee 
convergence for every nonlinear problem. In particular, Figure 15.12 shows some 
situations in which at least one of the above nonlinear iterative methods breaks 
down. 


15.6.6 Convergence Criteria 


Regarding the convergence criteria for the iterations of the Newton-Raphson type 
methods, we can have at least three different approaches. 


(1) Displacement error control 


The increment of displacement after the current iteration step can be compared to 
the current value of the displacement. If the contribution in the displacement 
increment is less than a prescribed error, iteration can be stopped. The current 
value of displacement is regarded as the solution for the current time step. For 
example, we have 


[av 


2 


7 SS ép (15.287) 
t+At 
lav, 2 
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(a) Newton-Raphson breaks down 


‘N 


(c) Secant method breaks down 


Figure 15.12 Pitfall of some softening behavior of a nonlinear system 


where the norm of the displacement vector is defined as 


1/2 
lel, =| Set (15.288) 
i=l 


This definition is also known as the Euclidean norm. The dimension of the vector is 
denoted by 7 in (15.288). However, we should not be too greedy about this choice 
and its value should also depend on the current time step that we are using. As 
illustrated in Figure 15.13, it shows the case of a stiffening system and for such 
stiffening, the response displacement convergence criterion will fail. In view of this 
limitation, load error control is introduced. 
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Stiffening 


Figure 15.13 Stiffening behavior of a nonlinear system 


(ii) Load error control 


When a nonlinear system becomes excessively stiff with the increase of the 
displacement, displacement increment will be small but the magnitude or norm of 
the unbalanced force vector can remain large. It is sometimes more meaningful to 
control the error in the unbalanced force: 


ns _ Rite 
(i) 


a (15.289) 


Ir’ _ Rita 


2 
This load convergence criterion is not effective for the case in which a very soft 
response of the nonlinear system becomes apparent with the progressive 
displacement. A typical value of ¢, = 0.0001 can normally be prescribed; for 
example, a truss undergoing a large deformation due to elastic-plastic behavior of 
the material. In this case, we should use the displacement control convergence 
criterion. Figure 15.14 shows such a case of a softening system and for which the 
load convergence criterion will fail. 


R fied _R 


— +} 


Softening AU; 


U 


Figure 15.14 Softening behavior of a nonlinear system 
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(iii) Energy error control 


For highly nonlinear and complicated models of composite structures, it is 
sometimes difficult to pre-determine whether the structure is getting softer and 
stiffer under loading. Therefore, the third convergence criterion based on energy 
change in the system due to the unbalanced force at each iteration can be used as an 
error control criterion. It is because energy change is involved both displacement 
increment and unbalanced force: 


AUR” fo =ROY) 
AUG (f' =k) 


where & is a prescribed error. 


<ép (15.290) 


15.7 CONTROL ALGORITHMS FOR NONLINEAR ODE 


The static nonlinear system considered in (15.273) is first rewritten in the following 
form: 

fU)-R= f(U)-AR=0 (15.291) 
where A is the load parameter. Tacitly, we have assumed that the loading is of the 
proportional type. That is, the ratio between different loading components of R for 
each degree of freedom of the displacement vector U is constant, and this is the 
reason why we can extract a common loading parameter A in (15.291). We can also 
rewrite this vector equation in scalar form as: 


R' fW) 
RR 
The load control algorithm can be unified with the displacement control algorithm 
in the following constraint: 


yy BU -UPY + 6,50) -A°Y =e (15.293) 
k=l 
where /7; is a controlling parameter for each component of displacement or loading, 
and c is the size of the loading control. Clearly the unit of 4; (where i = 1,2,..., 1) 
differs from that of ,+1. The superscript for U denotes the degree of freedom and 
the subscript denotes the loading step number. A number of special cases are 
considered next. 


~A=0, or fU)-A=0 (15.292) 


15.7.1 Displacement Control 


For the special case 4; = 1 and all other A; = 0 (i # 7), we may select the most 
dominant displacement U; as the controlling parameter (typically at the degree of 
freedom subjected to the largest external load): 


Ui -U' =c (15.294) 


Displacement control may fail for the case of a very stiff system, and this is 
illustrated in Figure 15.15. 
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Figure 15.15 Stiff system in which displacement control fails and arc-length control may also fail 


15.7.2. Load Control 


For the special case f,+1 = 1 and all other £;= 0 (i # n+1), we select the loading 
step as the nonlinear parameter in searching for the next equilibrium solution. 
Mathematically, it can be expressed as: 

A=. =c (15.295) 
This type of loading control parameter is the most commonly employed approach 
in numerical analysis. Figure 15.16 illustrates the case of snap through buckling of 
cylindrical shells in which load control fails. 


Snap through 


AR ARy U 


Load control fails 


Figure 15.16 Snap through buckling problem in which load control fails 


15.7.3. Arc-Length Control 


If we set all £;= 1 (where 7 = 1,2,...,1+1), we have the arc-length control algorithm. 
We select the loading step as the nonlinear parameter in searching for the next 
equilibrium solution. Mathematically, it can be expressed as: 
VU Sy eG =) =e (15.296) 
k=l 
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Figure 15.15 shows that if c is too large, the arc-length control may also fail. In 
short, there is no single algorithm that works for all nonlinear problems, if the 
loading step is not controlled properly. 


5.8 FINITE ELEMENT METHOD FOR LAPLACE EQUATION 


The finite element method (FEM) is a numerical technique for finding approximate 
solutions to boundary value problems. It uses either the weighted residue method 
(e.g., Galerkin method) or the variational methods via the use of the calculus of 
variations (e.g., Rayleigh-Ritz method) to minimize an error function (in the case of 
the weighted residue approach) or a functional (energy in the case of variational 
approach) in a global sense and produces a stable solution. FEM divides the origin 
domain into many small finite sub-domains, named finite elements, to approximate 
a larger domain. The origin of FEM is not so straightforward. As summarized by 
Oden (1990), it probably dates back to the time of Hrennikoff in 1941 and Courant 
in 1943. However, this idea was not further pursued then since computers were still 
largely unavailable in 1940s. Decades later, the term “finite element method” was 
coined by the renowned structural and earthquake engineer Ray Clough in 1960 
when he was involved in the design of the wings of the Boeing 747 airplane 
(Clough, 1980). The first finite element book was The Finite Element Method in 
Structural and Continuum Mechanics by O.C. Zienkiewicz in 1967. 

In this section, we will illustrate the main concepts of FEM using the Laplace 
equation. In particular, for potential flow around a circular cylinder shown in 
Figure 15.17, we can formulate the problem as: 


V-y =0 (15.297) 

w=w,, onS, (@=1,2,3) (15.298) 

OY gy: on Sy; OU on Ss (15.299) 
On On 


where y is the stream function. As shown in Figure 15.17, the surface of the 
circular cylinder is impermeable, and both the upper and lower boundaries are also 
impermeable. 

In view of symmetry with respect to both vertical and horizontal axes, we can 
consider a quarter of the domain. This quarter domain is further discretized into 
elements (in this case triangular elements). Within each element, we assume there 
is a fixed and assumed variations of the streamline function y between the nodal 
values of the element: 


3 
VOY) = NY, = MOY +N + NGOs (15.300) 
i=l 
where N; (i = 1,2,3) are called shape functions. The nodal values of y; are the 
unknowns. At node 1, we have N = | whereas the others’ shape functions are zero, 
similar to other shape functions. Without going into detail, it was found that 


1 
Ni (x, y) = aA — X32) + (V2 — V3 )x + (%3 — XQ) y] (15.301) 
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Figure 15.17 Finite element model for potential flow around a cylinder in a channel 


1 
Ny(x,y) = 5A a — X13) + (V3 — YW )x+ (1 — X53) y] (15.302) 


1 
N3(x,y)= ae = XV )+ (WY — V2 )X+ (OX —H)V] (15.303) 


where A is the area of the triangular element and (x;, y;) are the coordinates of the 7 
node in the element. Physically, this shape function can be interpreted as: 


A A A 
Nixy) =, N(x, y)=—8, Nj (x,y) = 24% (15.304) 
Ai93 Ay93 Ain3 
3(x3,.V3) 
2(X,¥>) 
1(x1,9) 


Figure 15.18 Interpretation of shape functions in a triangular element 
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where the area of triangle ijk is denoted as Ajx (see Figure 15.18). By adopting the 
Galerkin method, we have 


[[vvnaa =0) (15.305) 

Applying integration by parts, we can rewrite it as: 
ON; ON, 
(Kh +e sida = [anNias (15.306) 
ox x 

where 

dy ON; 

ph ce 15.307 

Ox fa Ox m ( ) 


For the triangular element, the above integration can be carried analytically but for 
other more complicated element shapes, numerical integration should be used (like 
the Gauss quadrature integration). If there are a total of m nodes and m elements in 
the whole domain, we can assemble the element in a matrix form: 


[Ki {vi} =a} (15.308) 


The boundary conditions given in (15.298) and (15.299) have to be applied to 
(15.308). The matrix equation can be solved for the nodal unknowns of yj. 
The total error associated with FEM modelling can be expressed as: 
Error = E+E, +2, +E; (15.309) 


where £, is the continuity error, E, is boundary error, E, is the interpolation error, 
and £; is the integration error. The first three errors are caused by discretization. 
The continuity error depends on the highest derivative in the formulation and can 
be removed if the shape function used in the element is raised. The boundary errors 
can be reduced if the number of elements is increased and if the size of the 
elements is reduced. The interpolation error is reduced if the element size is 
reduced and the order of the interpolation function is increased. 


5.9 FINITE DIFFERENCE METHOD FOR LAPLACE EQUATION 


In this section, we will illustrate how to apply FDM to PDEs, and again the Laplace 
equation is selected for the sake of simplicity. In particular, for potential flow 
around a circular cylinder shown in Figure 15.17, we can formulate the problem as: 


ow Oy 
ax? dy" 
where y is the stream function. As shown in Figure 15.19, the domain of the 
quarter problem of the potential flow around a circular cylinder can be expressed in 
a finite difference scheme. The following notation is adopted to express the 
streamline function at different grid points: 

7 =YV(% +iAx, Yo + jAy) (15.311) 


Vy = =0 (15.310) 
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where Ax and Ay are the grid sizes along the x-direction and y-direction 


respectively. 
Using the finite difference scheme, differentiation of the streamline with 


respect to x can be approximated as 


oy _w(x+Axy)-vay)_ 1 
sg = 1 o-W,, 15.312 
Be Ax Ax Wis) Wij ) ( ) 
Similarly, differentiation of the streamline with respect to y can be expressed as 
ow ~ VOY + AV)—- WY) _ I (15.313) 


By re i (Wi jut Wii) 


Ay; 
Viwjo---ot tio Vist; 
Ax Ax 
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ae 


Figure 15.19 Grid network used in the finite difference scheme 


Repeat the same procedure once more, and the second derivative can be 
approximated as: 


Oy 1 w(x+Ax,y)-y(x,y) v(x.9) -WX— AX Y), 


i~] 


2 
Ox” Ax ax (15.314) 


1 
7 ce Wis, —2Vi,j +Vi-y) 


Similarly, the second derivative taken with respect to y can be approximated as: 


Ow l 
ro = ae (Wi 541 — 2Wi,j +Vi ja) (15.315) 
For further simplification, we can set the grid size in both directions as equal: 


Ay = Ax (15.316) 

Substitution of (15.314) and (15.315) into the Laplace equation and utilization of 
(15.316) leads to 

W541 — 2,5 + Vij t+ Via — 2,7 tYi; =9 (15.317) 


Solving for yi; we get 
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1 
Wii = gin +W; 4 +Wiarj +Vias) (15.318) 


As illustrated in Figure 15.19, the value of yi; depends on the neighboring points. 
Once the boundary values of y are known on the essential boundaries, we can use 
(15.318) to generate the rest. For the natural boundary conditions, we can use either 
forward difference or backward difference schemes, depending on the location of 
the natural boundaries. For example, for problems shown in Figure 15.17 and 
15.19, the inflow boundary on the left boundary is 


oy ow 1 
= = —1,-W,,)= 15.319 
on ox \ (Vist; Wij ) qx. ( ) 
whereas the inflow boundary on the right boundary is 
ow ow 1 
= = -.-W.,;,)=q. 15.320 
an ax W;; Wij ) qs. ( ) 


15.10 SUMMARY AND FURTHER READING 


Numerical methods employed in solving ODEs and PDEs are under rapid 
expansion in the last few decades. Many restrictions of the finite difference method 
and finite element method have been removed in these new methods. A number of 
them are summarized briefly here, namely, the finite volume method (FVM), 
smooth particle hydrodynamics (SPH), and the material point method (MPM). Full 
discussions of them are out of the scope of the present chapter. 

A numerical technique called the material point method (MPM) is 
particularly useful in modelling large deformation problems, such as landslides, 
runouts, or dynamic fragmentations. This formulation uses a dual description of the 
media by using Lagrangian material points and a Eulerian numerical mesh. The 
MPM is an extension of the particle-in-cell method (a method developed in Los 
Alamos National Laboratory in 1957) incomputational fluid dynamics to 
computational solid dynamics, and is a finite element method (FEM)-based particle 
method. It is primarily used for multiphase simulations, because of the ease of 
detecting contact without inter-penetration. It can also be used as an alternative to 
dynamic FEM methods in simulating large material deformations, because there is 
no re-meshing required by the MPM. It was originally proposed by Sulsky et al. 
(1995). 

The smoothed particle hydrodynamics (SPH) method belongs to mesh-free 
techniques that have been widely adopted in many areas of mechanics. Smoothed 
particle hydrodynamics (SPH) is a computational method used mainly for 
simulating fluid flows. It was developed by Gingold and Monaghan (1977) and 
Lucy (1977) initially for astrophysical problems. It is a mesh-free Lagrangian 
method in which the coordinates move with the fluid, and the resolution of the 
method can easily be adjusted with respect to variables such as density. This 
technique can handle very large deformations and is more suitable for post-failure 
analysis in the case of solid mechanics. 

In recent years, the finite volume method (FVM) has become a popular 
numerical method used in fluid mechanics. “Finite volume” refers to the small 
volume surrounding each node point on a mesh, resulting from discretization of the 
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body. In the finite volume method, a partial differential equation is converted 
to surface integrals. These integrals are then evaluated as fluxes at the surfaces of 
each finite volume. Because the flux entering a given volume is identical to that 
leaving the adjacent volume, these methods are conservative. FVM is best for 
solving conservation law in integral form and can solve for discontinuous solutions. 
The most fundamental hyperbolic wave problem with a jump discontinuity is called 
the Riemann problem (Le Veque, 2002). In fact, most of the current finite volume 
methods make use of the Riemann problem as the building block, and therefore 
FVM literally uses the Riemann solver. Most of the FVM solution schemes used 
nowadays are of the Godunov type (Le Veque, 2002). Another advantage of the 
finite volume method is that it is easily formulated to allow for unstructured 
meshes. 


15.11 PROBLEMS 


Problem 15.1 Consider the following third order Runge-Kutta method: 


Vn+ = Yn + hlaf (t, > Vn) + Of Oysii2> Kn) a Of (Gate ko) + O(h*) (15.321) 
where i, and kn»2 have been defined in (15.244) and (15.245). 


(i) Show that the coefficients satisfy the following system of equations: 


atb+c=l (15.322) 
ens (15.323) 
i ae 
c ol 
Bia 15.324 
rey ( ) 


(ii) Show the validity of (15.259). 


Problem 15.2 Consider the following fifth order Runge-Kutta method: 
Vn+l = Vn tr Ataf (t, ? Vid + OF (thst/2 ? Kn) + SP (trav ? kn) 


: (15.325) 
+f (tn129kn3)+ PF Crate kng 1+ OM ) 
(i) Show that the coefficients satisfy the following system of equations: 
a+b+c+d+e=l (15.326) 
2g Sees (15.327) 
2 2.2 2! 
ite (15.328) 
> 22) 3! 
Laas (15.329) 
2 2 4! 
e l 
aa = 3) (15.330) 


(ii) Show the validity of (15.260). 
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Problem 15.3 Consider the following sixth order Runge-Kutta method: 
Yas = Vn + ALP (tas Yn) + OF En sti2>%m) + SF Otnst2>Kn2) 


(15.331) 
+f (trat/2>kn3) +B tnatio» Kuna) + SF Cat Kns J+ Of ) 

(i) Show that the coefficients satisfy the following system of equations: 
at+tb+c+dt+et+g=l (15.332) 
bcde 1 

+—+—+-—+g= 15.333 
2 2 2. 2 2 2! ( ) 
c de gi |i 
SF oh ge gk (15.334) 

d e g 1 
go oe a (15.335) 
é 2) 
a =5 (15.336) 
J 
> ae (15.337) 


(ii) Show the validity of (15.261). 


Problem 15.4 Derive the formulas governing the coefficients in the k-th Runge- 
Kutta method given in (15.263). 


Problem 15.5 In view of (15.263), show the validity of formulas given in (15.264) 
to (15.267). 


Problem 15.6 Prove the following seventh order Runge-Kutta method: 


h 
Vast = Vn + aed ? Vn) + TTF thar ? Kn) +133 Ff (tys1/2> kn) 


+636 (thst > kn3) + 28 fh thar 2 kia) + 10f Gnst/2 2 kas) + 2f tt > king ) a O(n’) 
(15.338) 


Problem 15.7 Prove the following fifth order Adams-Bashforth formula: 
h 
Veet Vet mee —2774f,_;+2616f,. -1274f,_,+251f,_4] (15.339) 


Problem 15.8 Prove the following fifth order Adams-Moulton formula: 
h 
Vnet =n t+ mo 1f4) +046f, —264f,_,+106f,_,—-19f,-3] (15.340) 


APPENDICES 


Appendix A: Greek Letters 


For mathematical and engineering analysis, Greek alphabets are normally adopted 
as mathematical symbols or scientific symbols. Engineering and science students 
who do not know how to pronounce Greek letters will have trouble communicating 
with others. It is of paramount importance to recognize and to pronounce Greek 
letters. Here we compile a list for easy reference by readers. There are a total of 24 
letters in Greek and they are summarized below: 


Table A.1 Table of Greek letters 


Number | Capital lower Pronoun. | Number | Capital lower Pronoun. 
1 A (or Alpha 13 N v Nu 

2 B B Beta 14 >) a Xi 

3 r Y Gamma | 15 O oO Omicron 
+ A a) Delta 16 II ™ Pi 

5 E E Epsilon | 17 P p Rho 

6 Z C Zeta 18 x fe) Sigma 

7 H n Eta 19 T T Tau 

8 12) 0 Theta 20 Y b Upsilon 
9 I l Tota 21 ® o Phi 

10 K K Kappa 22 x x Chi 

11 A Xr Lambda | 23 WY W Psi 

12 M LU Mu 24 Q (0) Omega 


Note that the partial differentiation sign 0 (pronounced as “round’’) is not a Greek 
letter. 
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Appendix B: Nobel Prize and 
Mathematicians 


Alfred Bernhard Nobel (1833-1896) was a Swedish chemist and engineer. He 
invented dynamite in 1867, gelignite in 1875, and ballistite (a kind of smokeless 
gunpowder) in 1887. When Nobel passed away in 1896, he left all of his 
considerable fortune to a foundation that funds five Nobel Prizes annually in the 
areas of physics, physiology or medicine, chemistry, literature, and contributions to 
peace. The first prizes were awarded in 1901, five years after the death of Nobel. 
The award also comes with cash of about | million US dollars for each prize. It is, 
however, not the cash prize but the instant fame that makes the Nobel winners a 
household name and an iconic figure of his or her generation. Even today, it is safe 
to claim that it remains the highest recognition and reward that can be earned by a 
“scientist.” In view of the important role of mathematics in the development of 
scientific and technological breakthroughs, many people, mathematicians and 
scientists alike, question why there is no “mathematics” category of the Nobel 
Prize. The renowned Fields Medal in mathematics is awarded to a young 
mathematician once every 4 years, but is no match for the Nobel Prize. A more 
recent establishment of the Shaw Prize, named after the Hong Kong-based movie 
tycoon Mr. Shaw, does include a category of “mathematical science.” Clearly, this 
reflects the views of scientists and mathematicians on this issue. 

However, there is an unconfirmed rumor about why Nobel did not choose to 
award his prize in “mathematics.” When Alfred Nobel set up his Nobel Prize, 
mathematics was one of the potential subject areas to be awarded. However, when 
he knew that Mittag-Leffler was a potential candidate for the prize in mathematics, 
Nobel crossed out mathematics and no such prize has ever been awarded. This 
negative impact by Mittag-Leffler on mathematics is definitely more far-reaching 
than his mathematical contributions. 

Nevertheless, some mathematicians have been nominated to receive Nobel 
Prizes and they also have played an important role in promoting the success of 
Nobel Prizes. Ironically, it was S. Arrhenius and G. Mittag-Leffler (the Swedish 
mathematician that Nobel disliked) who helped to mobilize large numbers of 
scientists from various nations participating in the nomination process. Arrhenius 
received the Nobel Prize in chemistry in 1903, and, of course, Mittag-Leffler never 
received the prize as a mathematician. In 1974, the Nobel foundation and 
associated institutions agreed to open their archives for historical research on 
materials at least 50 years old. Crawford (1985, 1998, 2001) is among the 
forerunner on the analysis of the selection process of Nobel Prizes. 

According to the released data, the renowned French _ engineer, 
mathematician, and physicist Henri Poincare (pioneer in stability analysis of 
differential equations) was nominated 34 times in 1910. In fact, Poincare received a 
total of 51 nominations in 1904-1912 before he passed away in 1913. Poincare 
himself had made 10 nominations and four nominations were successful (including 
Lorentz in Physics 1902, Becquerel in Physics 1903, Lippmann in Physics 1908, 
and Curie in Chemistry 1911). Mittag-Leffler made 6 nominations with one 
successful case (Lorentz in Physics 1902). The famous German mathematician 
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David Hilbert was nominated 6 times between 1929 to 1933. He also nominated 
the successful winner Peter Debye (Chemistry in 1936), and, incidentally, Debye 
received 47 nominations in both Physics and Chemistry. More importantly, being a 
student of Sommerfeld, Debye made a major contribution in 1912 to the 
mathematical development of the integral representation of the Bessel function of 
the first kind of large order using the saddle point method (see Watson, 1944) when 
he considered the vibrations of large spheres. 

We should note the story of Sommerfeld (supervisor of Debye) before 
continuing our discussion of the other mathematicians and their role in the Nobel 
Prize. Sommerfeld was nominated a record 81 times for the Nobel Prize within a 
period of 34 years before he was run down by a car in 1951, and he never received 
it. Otto Stern also shared the honor with the highest number of nominations of 81, 
but Stern was awarded the Nobel Prize in 1943 for demonstrating the wavelike 
properties of elementary particles. Ironically, he himself had made two successful 
nominations, one to Albert Einstein (Physics in 1921) and one to Max Planck 
(Physics in 1918). Even ironically, four of his PhD students did receive Nobel 
Prizes (Heisenberg in 1932 (PhD in 1923); Bethe in 1967 (PhD in 1928); Pauli in 
1945 (PhD in 1921); and Debye in 1936 (PhD in 1908)). Three of his other 
postgraduate and post-doctoral students also received Nobel Prizes (including L. 
Pauling in Chemistry in 1954 and in Peace in 1962, I.I. Rabi in 1944, and M.T. von 
Laue in 1914). Sommerfeld had published a six-volume series of classic textbooks, 
and one is Partial Differential Equations in Physics. This is a very good book on 
PDEs even by today’s standard. One of his mathematical contributions is 
Sommerfeld’s wave radiation condition in wave equation analysis. Sommerfeld was 
also a pioneer in the mathematical theory of diffraction of waves. 

Another famous French mathematician Jacques Hadamard made 32 
nominations and his successful nominees include Einstein, Perrin, Richardson, 
Yukawa, and Blackett. French mathematician Emile Borel (a pioneer in game 
theory and a main contributor on the theory of real variables) also made 6 
nominations and Nobel laureate Perrin is among them (Physics in 1926). Recall in 
Chapter 11 on Volterra’s integral and integro-differential equations, the renowned 
Italian mathematician V. Volterra also made 9 nominations (1 in chemistry and 8 in 
physics). 

There is also a long list of renowned mathematicians and mechanicians, most 
of their names are mentioned in this book, who made nominations or had been 
nominated. They include G. Darboux (Darboux’s formula in series expansion in 
Chapter 1), O. Heaviside (Heaviside step function throughout the text), Lord 
Kelvin (method of stationary phase in Chapter 12), I. Fredholm (Fredholm integral 
equation in Chapter 11), O. Backlund (Backlund transform in soliton), P. Painlevé 
(differential equation with movable poles), T. Levi-Civita (permutation tensor in 
Chapter 1), P. Levy (solution on plate bending), A.E.H. Love (Love’s wave in half- 
space), G.I. Taylor (main contributor in stability of fluid), P.M. Morse (author of 
the classic book on Method of Theoretical Physics), van der Pol (van der Pol 
nonlinear oscillator), L. Fuchs (Fuchsian singular differential equation in Chapter 
4), H. Bateman (Bateman’s manuscript project), L. Prandtl (boundary layer in 
fluids in Chapter 12), J. Boussinesq (shallow wave equation), and L. Boltzmann 
(kinetic gas theory). Among them, Prandtl received 4 nominations in 1928 and 
1937 because of his boundary layer theory in fluids, and G.I. Taylor was nominated 
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in 1937 by H. Jeffreys (geophysicist and author of the book The Earth) and in 1945 
by Sir N.F. Mott (Nobel Prize in physics 1977). A master among mechanicians, 
Lord Kelvin was actually one of the eleven people being nominated to receive the 
Nobel Prize in physics in 1901 (the first year Nobel Prizes were awarded). The first 
recipient in physics in 1901 was Wilhelm Conrad Rontgen (for his discovery of X- 
ray) and in that year Rontgen actually nominated Lord Kelvin to get the Nobel 
Prize! Recall from Chapter 12 that Kelvin was the main inventor of the method of 
stationary phase and from Chapter | that Kelvin was the founder of the Kelvin- 
Stokes theorem. We can see that many Nobel Prize nominees and winners could be 
considered as applied mathematicians themselves and made significant 
contributions to the development of various branches of applied mathematics, 
especially related to the analysis of differential equations. 

In Crawford’s mind, there are certainly many well-deserved winners who 
ended up as losers. Among others, they include Arnold Sommerfeld, Henri 
Poincare, Oliver Heaviside, Ludwig Boltzmann, and Vilhelm Bjerknes (father of 
modern meteorology and proposed the theory of polar front or jet stream in weather 
systems). 


Appendices 905 


Appendix C: Proof of Ramanujan’s Master 
and Integral Theorems 


C.1 RAMANUJAN’S MASTER THEOREM 


To prove Ramanujan’s Master Theorem, let us first recall the following definition 
of gamma function I(x) proposed by Euler 


T(x) = [tera (C.1) 
Next, consider the Laplace transform of a power series: 

Lit") = ,e “4M d (C.2) 
Applying the change of variables of st = z, we obtain 

Lit") = [ere = = a [ecetae = — ae edt (C.3) 

The last equation in (C.3) can be rewritten as 

A [oeerat (C4) 
Applying another change of variable of s = r* gives: 

= ene" de (C.5) 


Multiplying both sides by the k-th derivative of a function f(x) evaluated at zero 
divided by & factorial as: 


(k) (k) 
f FO ire —r*t t” l= Pade 8) oa a (C.6) 
Consider an ape sum on both sides with wpe to the index k 
(Kk) (k) 
ee ire —rkt tf" lat = ye (0) I(n) (C.7) 
ki 
py 0 
The er function inside the integral is now expanded in Taylor series as 
=p = yee 4 —" i rx he pr x3 . (C8) 
exp(—r ky ~ i! rT aa : 


Substitute (C.8) into (C.7) and reverse the order of integration and summation to 
yield 


Be (k) 0 7k ym (kK) —kn 
{ ye wy! PX) ld = S1@)f Or" a (C9) 
k=0 . k= . 


We now further reverse the order of summation within the integral on the left hand 
side of (C.7) 
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(— ay" . f° O) yim x" 1 = n-l (— xy m 
LHS = is pe : yD 7 dx as f(r™dx (C.10) 
The last part ss follows from the Maclaurin series expansion the function 


Six) as: 


yy lox 
i) ar (C.11) 
Equation (C.11) can also be applied to the right hand side of (C.9) to get 
a oo kyr yk 
I xt LOY Tif) (C.12) 
0 rar k! 
Finally, we can rewrite using the following identifications: 
LO) & 9h), (C.13) 
we have 
[> “yr 3 Oe in PUN)" a. rem g(-n) (C.14) 


This completes the proof of Ramanujan’ s Master Theorem. This is a clever and 
original technique employed by Ramanujan. The main success of the proof rests on 
the expansion of the exponential function, the reverse of the order of integral and 
summation, and the reverse of the order of different summations. 


C.2 RAMANUJAN’S INTEGRAL THEOREM 


In this appendix, we consider Ramanujan’s integral theorem, which is a 
generalization of the Frullani-Cauchy integral theorem reported in Section 1.8. This 
integral theorem is expressed as: 


i HE gO} 1 EF(0)- re 24 fn ey) : (C.15) 


provided that a, = g(0) and f(0) = g(~) with a, b > 0. In addition, functions u and 
v are defined as the coefficients of the Borel theorem of functions fand g as: 


k k 
F(x) fl) = > WON" 9(0)-2(0)= — WC C16 
For the special case that “s = g, we have the last term in (C15) am and giving 
[ELON t= FO fe} IN) (C.17) 
x a 


which is the Frullani-Cauchy integral given in (1.202). Thus, the Frullani-Cauchy 
integral is recovered as a special case of Ramanujan’s integral theorem. 
To prove (C.15), we consider the following integral 
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1, = [x (sa) f)} ~tebx) - g(@)}) de 


=T(n) [2 (—n)- 9) (-n)] 
where we have applied (C.16) to get 
ea) a & < 
ee py ax) > ao “) a‘ -> feet oi (C19) 


! 
k=0 k! k=0 


(C.18) 


‘i xX 
ee ye bx) Sites yb Sak eos 


k=0 k=0 k! 
In other words, we have 
a (k)=u(kja*, y(k) =v(k)b* (C.21) 


Clearly, substitution of (C.19) and (C.20) into the first line of (C.18) and 
application of Ramanujan’s Master Theorem given in (C.14) results in the second 
line of (C.18). Using of the definition of @; and @2 given in (C.19) and (C.20) gives 


1, =T(n) [a"u(-n)—b"(-n) | Spy Ae, 459) 
n 
Now, we consider the limit of n — 0 for J, given in (C.21) as: 
lim J, = lim i a) a 
n->0 n->0 ; n ; : . (C.23) 
= fay b"u(-n)—a"v(-n) _ ia b"u(-n) —a"v(—n) 
n>0 a'b"n n->0 n 
In obtaining (C.23), we have taken the following limits 
lim(n+1)=T()=0!=1, lima"b” =a°b® =1 (C.24) 
n—>0 n>0 


Note that u(0) = v(0) because f(0) = g(0). Application of L’H6pital’s rule to the last 
part of (C.23) leads to 


lim J,, = lim {b"u( n)Inb—b"u'(—n)-—a"v(-n) Ina +a"v'(-n)} 


n->0 


=u(0)Inb-v(0)Ina+v‘(0) —w'(0) 
(C.25) 


=u(0) in(2) +v'(0)—u'(0) 


={/(0)= fleo)}In(*)+¥"0)—w'O) 


Finally, Ramanujan found a more compact form for the last two terms in the last 
part of (C.25) by noting 


d inp 24 _ u(s) a a vou (C.26) 
ds u(s) Ws) uu vou 
Taking the limit s 0, we have 


d v(s) v(0O) uw) 1, , 
(np) 0) u(0) woe -* O ee 
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To visualize the last part of (C.27), we have used the following identity 


S(0)— f(~) =u(0), g(0)— g(~) = v(0) (C.28) 
Since, we require f(0) = g(0) and /(«) = g(o«) thus we must have 
u(0) = v(0) (C.29) 


Combining (C.27) and (C.28) gives 


v'(0)—u'(0) = u(0) ue 
s=0 
(C.30) 


oy 4) 
={f(0)—f¢ ym) 


Finally, substitution of (C.30) into the last line of (C.25), we have Ramanujan’s 
integral theorem. 
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Appendix D: Jacobi Elliptic Functions 


D.1 JACOBI ELLIPTIC FUNCTIONS 


Jacobi elliptic functions are related to the solution of the following integral when wu 
and k are given: 


¢ do 
7 =| = F(,k) (D.1) 
° V1—k? sin? 0 
where F(@, k) is the elliptic integral of the first kind. If the evaluation of the 
amplitude @ of the integral can be done, we write its solution as: 


sn(u,k) =sing, cn(u,k)=cosd, dn(u,k)=V1-k?sin?¢=Agd (D.2) 
sn(u, k) 
cn(u,k) 


It can be seen that sn, cn, and tn closely resemble the circular functions sine, cosine 
and tangent. The amplitude function of the integral is denoted by am(u,k) or ¢ For 
a fixed value of k, the Jacobi elliptic sine, cosine, and tangent functions can be 
simplified as: 

sn(u,k)=sn(u), cn(u,k)=cn(u), tn(u,k)=tn(v), dn(u,k)=dn(u) (DA) 
Abramowitz and Stegun (1964) also defined additional types of Jacobi elliptic 
functions, namely, 


tn(u,k) = =tang, am(u,k)=4¢, (D.3) 


ne _ dn() ae 1 
cd(u) = ae de(u) = au ns(u) Gy" 
sn(w) 1 4) _ ale) 
sd(u) = a nc(u) = ont)’ ds(u) aCe. (D.5) 
nd(u) = —- sc(u) = a) cs(u) = a) 


dn(w) en(u)’ sn(u) 
The values of the integral u can be evaluated by using inverse functions that closely 
resemble arcsine, arccosine, etc. 


u=sn'(sing,k) u=cn'(cosg,k), u=tn'(tang,k), 
u=dn'(Ag,k), u=am'(¢,h), 
For example, the inverse elliptic function can be evaluated as 


mS, 3, F (30° \ 60") = 0.54222911 (D.7) 


(D.6) 


where F(¢,q) is the aa integral of the first kind expressed in terms of the 
parameter o instead of k as 


F(¢\a) -{" do 
0 Jl—sin? asin? 0 


The numerical value in (D.7) was looked up from Table 17.5 of Abramowitz and 
Stegun (1964). 


(D.8) 
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D.2 IDENTITIES OF JACOBI ELLIPTIC FUNCTIONS 


Some special values of Jacobi elliptic functions are: 
sn(0)=0, cn(0)=1, dn(0)=1, am(0)=0 (D.9) 
sn(u,0)=sinu, cn(u,0)=cosu, dn(u,0) =1, 


(D.10) 
sn(u,l)=tanhu, cn(u,l) =dn(u,1l) =sechu, 
A number of identities can be proved easily 
sn7u +cn7u = sin? ¢+cos’ ¢ =1 (D.11) 
dn?u—k?cn7u =1-k? =k”? (D.12) 
k’sn?u+dn7u =1 (D.13) 
sn(—u) =—snu (D.14) 
cn(—u) = cnu (D.15) 
dn(—u) = dnu (D.16) 
am(—u) = —amu (D.17) 
Jacobi elliptic functions are doubly periodic functions 
sn(u+2mK + 2nK‘i) = (-1)"snu (D.18) 
cn(u + 2mK + 2nK i) = (-1)""*" cnu (D.19) 
dn(u + 2mK + 2nK 1) = (-1)"dnu (D.20) 
where 
m/2 dé 
K =K(m)=[ (D.21) 
: P 1-msin* 0 
= K'(m) = i m,+m=1 (D.22) 
yl- m, sin 29° 
Thus, K and ik’ are called real and imaginary quarter-periods. 
We will ae a simple case of periodicity by considering 
i =-|; do i (ia do 
v= —— 
V1-k? sin? Vl-k’ sin’? @ °7 V1-k’ sin? 6 
(D.23) 


ae of do Pe do 
0 eer wI2h_ 42 sin29 °* ~Jl-k? sin? 6 
=K+(" ns 

72M? sin? 9°  S1—k? sin? @ 


For the second integral on the right of (D.23), we can apply the following change 
of variables 

O0=1-0 (D.24) 
Thus, we have 


(i d@ =[ —dp =" dp 
wrk gino =e Jl-k? sin?g °° fl" sin? p 


=K (D.25) 
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For the last integral on the right of (D.23), the following change of variables is 
applied 
0=1+9 (D.26) 


Thus, the final integral becomes 


d@ 7 iE dp _ 
® Wk sin? G 7° fl-K? sin’ ~ 
Substitution of (D.25) and (D.27) into (D.23) gives 
v=2K+u (D.28) 
By applying the Jacobi elliptic function to solve for the amplitude of the first 
integral of (D.23) gives 


(D.27) 


sn(v) =sn(2K +u) =sin(¢+z) =—sin ¢ =-sn(u) (D.29) 
Therefore, comparing the second and the fifth terms gives the required periodicity 
sn(2K +u) =—sn(u) (D.30) 


D.3 DIFFERENTIATION OF JACOBI ELLIPTIC FUNCTIONS 


Some formulas of differentiation for Jacobi elliptic functions are 


“sn =cnudnu (D.31) 
lu 
d 
Ph =—snudnu (D.32) 
u 
d 2 
ao =—k*snucnu (D.33) 
u 
@ 
ao = 2k?sn? u—(1+k*)snu (D.34) 
u 
d 
an = (2k? —lenu—2k?en? u (D.35) 
lu 
a 
ae =(2—k*)dnu—2dn? u (D.36) 
u 
d 
Pou =dnu (D.37) 
u 


To see the validity of these formulas, we first consider the differentiation of u with 
respect to ¢ 
du_d ( do _ 1 ll 
dp doo fix? sin29) ft—k? sin? g = dn 
In obtaining the above result, we have applied the Leibniz rule for differentiating 
integrals. Recalling that sn u = sing, we have 
d 
du 


(D.38) 


snu = q ipseosa =cnu dnu (D.39) 
du du 
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We have used (D.38) to obtain the last part of (D.39). Thus, the identity given in 
(D.31) is established. Similarly, we note that cn u = cos¢ and thus 
d 
du 
This is identical to (D.32). 
Note from (D.2) that dn uw = (1—-?sin?¢)!”, and we have 


é go 1-k? sin’ ¢ = : ( a sinpeosg 2 


—dn 
du du atk sin? ¢ du’ (D.41) 


=-k* singcos¢ = —k*snucnu 


cnu = d ep sasing andi (D.40) 
du du 


This proved (D.33). For the second derivative, we differentiate (D.31) one more 
time with respect to u to get 
2 


snu = dnu é Suc a (D.42) 
du? du du 
Substitution of (D.32) and (D.33) into (D.42) results in 


2 
a u=—snudn* u—k*snucn? u =-snu (dn? u+k? cn? u) (D.43) 
lu 
By virtue of (D.11) and (D.12), all Jacobi elliptic functions in the right hand side of 
(D.43) can be expressed in terms of a Jacobi elliptic sine function as: 
2 
~~ snu = -snu[l—k’sn?u+k?(1-sn? 
ae u u[ u ( u)] (D.44) 
=-snu[1+ k? —2k?sn? uj= 2k’sn? u— d+ k*)sn u 
To prove (D.35), we differentiate (D.32) one more time with respect to u to get 


cnu =—snu g dnu—dnu d snu (D.45) 
du? du du 
Substitution of (D.31) and (D.33) into (D.45) results in 
2 
ae =k’sn? ucnu—dn* ucnu (D.46) 
u 


Utilizing (D.11) and (D.12), (D.46) becomes 
2 
ea = k?(1-cn? u)enu—(L —k? +k? cn? u)cnu 
dw? (D.47) 
= (2k? -1)enu—2k*cn? u 
This establishes the formula given in (D.35). Differentiation of (D.33) with respect 
to u gives 
3 
anu =-k*(snu Sein +cnu ee) 
du du du 


=—k*(-sn? udnu + cn? udnu) (D.48) 
=—k*dnu(cn* u—sn? w) 


= (2—k?)dnu—2dn7u 
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In proving this formula, we have employed (D.12), (D.13), (D.31) and (D.32) in 
(D.48). Finally, the proof of (D.37) is more straightforward by noting that am u = @ 


e siidy Oe (D.49) 
du du 


D.4 INTEGRATION OF JACOBI ELLIPTIC FUNCTIONS 


Integration of Jacobi elliptic functions is less straightforward. Here are some 
formulas of integration: 


[snucdu =~-In(dnu—kenw) (D.50) 
fonwdu =—-cos-"(dnu) (D.51) 
[ane du = sin“ 1(snw) (D.52) 
| a paca (D.53) 
snu snu 
| 1 a 1 eo a8) (D.54) 
cnu k' cnu 
1 1 _1,cnu 
du = D.55 
lan 7 ke ae ( ) 
cnu 1-dnu 
| du = In( ) (D.56) 
snu snu 
[rt au= 1 all (D.57) 
cnu k' cnu 


[Pe au = in (D.58) 


cnu cnu 


where k' = (1—-k)'?. These formulas can be proved easily by directly differentiating 
both sides with respect to u. For example, (D.50) can be proved as 


© fsnudu=snu = oy : 52 dig waa 
du k dnu-kenu du du 
ti ! \[-k?snuenu+ksnudnu] (D.59) 


7 k dnu—kcnu 
=snu 
Similarly, all other formulas can be checked. 
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D.5 OTHER PROPERTIES OF JACOBI ELLIPTIC FUNCTIONS 


Maclaurin series expansion can be used to expand the Jacobi elliptic functions in a 
power series as 


sn(u,k) =u - SP re del4e? A yr (D.60) 


en(uk) =1— Su + (14 4k ut dk? + 16k a +. (D.61) 


dn(ugk) =1- $a 4+ P(A at P06 4d? 4 US (D.62) 


The sum rules for Jacobi elliptic functions are also somewhat similar to those for 
circular functions 
snucnvdnv+cnusnvdnu 


sn(u+v) = (D.63 
1—k’sn7u sn? v ) 
Ce (D.64) 

1—ksn“usn’ v 

dnudnv¥ k* 

din nudnvF ; a (D.65) 

1—k*sn“usn* v 

t +f 

FI i ee (D.66) 


1=tnutnvdnudny 


Davis (1962) provided a sizable introduction to elliptic integrals and Jacobi elliptic 
functions and their applications in nonlinear differential equations. One can refer to 
Abramowitz and Stegun (1964) and Olver et al. (2010) for comprehensive 
coverage of the properties of Jacobi elliptic functions. 
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Appendix E: Euler’s Constant 


The birth of Euler’s constant is related to the infinite series of 
. m ] 
lim — (E.1) 


00 
m ke k 


It was known to Euler and other mathematicians that this infinite series does not 
converge. For example, we find numerically that 


204 220 20220 
— = 3.60, — = 5.98, — =10.49 E.2 
Dural gaa 2a, E2) 
Although the initial convergence looks promising, but it never converges 
1 
>= > (E.3) 
ia * 


Thus, people were not interested in it, except Euler. It turns out that the difference 
between this infinite series and In m when m —> oo is finite 


moo 
k=1 


1 
= lim ——Inm |=0.5772157 E.4 
Mi bs ; (E.4) 

This finite value constant appears naturally in many mathematical analyses and thus 
becomes a very important mathematical constant. To start with, let us consider the 
area under a hyperbolic curve shown in Figure E. 1: 

x ‘x 2, 3 4 5 
In(l+.x) =| ks =| (1-t+2? = +...)dt =f (E.5) 

11l+¢ 41 2 3 4 #5 
Euler was an expert in playing around with infinite series (although at his time the 
convergence test for infinite series did not exist). He substituted x = I/n and 
expanded the left hand side using Taylor series expansion as 


ind++)=+ = z - (E.6) 
non 2n° 3n An 
This can be rearranged as 
1 n+] 1 1 1 
= In( y+ + + (E.7 
n n 2n> 3n> 4n* 
Taking different values of n, we have 
1 1 1 
1=In(2)+ +—+... E.8 
(2) a a (E.8) 
1 3. 1 ot. 
= In(—)+ +—+... E.9 
2 © 8 24 64 ee) 
: in) 4 ee + : theses (E.10) 
3 3° 18 81 324 


Summing all the left hand sides of (E.8) to (E.10), we have 


916 Theory of Differential Equations in Engineering and Mechanics 


Figure E.1 Area under a hyperbola 


1 n+l, 1 1 1 1 
—= In2+In Pes +...+In +—[1+—+=—4+..4— 
Di [ : (Jt slit gt gt tal 


(E.11) 


alles) | A fy 
8 27 n 4 16 81 n* 


The first sum in the square ong on the right of (E.11) can be simplified as 
[in2+In5 ia ae In) 


eee ee ee (E.12) 
=In(n +1) 
All the other terms on the right of (E.12) do converge and can be summed to 
approximately 0.5772157. Thus, we have the following approximation 


oo 


Yo> =In(n +1) 0.572157 (E.13) 
i=l 
For large n, we have the approximation 
lim In(z +1) = lim Inn (E.14) 
No no 
Using this condition, we finally arrive at Euler’s constant 
y = lim yi+-inm = 0.577215664901532860606512... (E.15) 
m—> co ‘el k 
There are many amazing properties of y, and here are some of them: 
20 ; 1 oO 2 
po I e™ Inxdx =— In(Inx)dx = —4 Jo e* xInxdx (E.16) 
0 0 
o @* e* 2] =e ix 
y= (A ae= [= de = | ( de (E17) 
0 |-—e” Bs 0x l+x 
y=1+| oe Ee (E.18) 
Ox x 1+x 0 Inx Il-x 


= lim Ya-4 


xolt Ty on 


dx 


1 1 1 © COsSx 
y= i} 


+ 
2-2! 4-4! 6-6! 


x 


y= 2 (e* -e*)dx 
Xx 


Ramanujan derived some amazing formulas for v 


_ 
1 
In2- 2n ———. 
- » 2 ‘Gh 
k= 


2) 
= In2 Ee ‘{ = - = - = 
3° -3 6-6 9-9 12°>-12 


1 1 1 
o( 7 er tits 
15°-15 18°-18 39° —39 


eae 0 


1 k 
1 (3/2) d 
i . 
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(E.19) 


(E.20) 


(E.21) 


(E.22) 


(E.23) 


(E.24) 


(E.25) 


(E.26) 


(E.27) 
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Appendix F: 7 


It has been known for thousands of year that the circumference of a circle depends 
on the diameter of the same circle. The ratio between them is known to be a 
constant and is normally denoted by the Greek alphabet 2, which was originally 
proposed by Euler. 
The symbol z is defined as: 
ee circumference (D.1) 
diameter 

Mathematically, x can be evaluated as the area of a unit circle 


wal yde= af ima (F.2) 


where x and y are defined in Figure F.1. 


a 
x dx 


Figure F.1 Area of a quarter of a circle 


Alternatively, we can also define 7 as 


1 | 4 
ages eres cla! x, (F.3) 


By employing Euler’s formula, Fagnano defined 7 as: 


iit? 
n= 4in| | (F.4) 
1+i 
This can be shown easily by observing that 
1-i=V2e""*, 147=V2e'7"4 (F.5) 


It turns out that it is transcendental, or it is not the solution of any algebraic 
equation (proved by Lindemann in 1882). It is also an irrational number, or it could 
not be written as the ratio of two integers (proved by Lambert and Legendre in 
1700s). In other words, there are infinite decimal digits for z. This is one of the 
most fundamental constant of mathematics. 

Mathematicians had struggled in the last few thousand years to give the most 
“accurate” result for this 2. In 2009, x had been computed to 2.7 trillion decimal 
digits. In fact, the speed of a supercomputer is normally gauged by measuring the 
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time that it takes to calculate the trillion digits of x (this apparently was advocated 
by John von Neumann). 
For practical purposes, these most “accurate” m’s normally do not have much 
significance. For example, the approximation by Zu proposed in 462 AD: 
355 
1 = — 
113 
This gives 3.14159292... compared to the more accurate result 3.14159265... If you 
draw a circle 10 km in diameter, this approximation gives a circumference of the 
circle of less than 4 mm. Thus, a 6-digit accuracy appears to be good enough for 
most engineering applications. It is, however, of fundamental importance and of 
theoretical interest to summarize some commonly used formulas and some less 
commonly known formulas of 7. 
It has been known for thousands of years that a first approximation in rational 
form is 


(F.6) 


24 

7 
This number and 3.14 are the z values that all primary school students are asked to 
remember. This approximation gives only 0.04% error. If you draw a circle of 10 
km in diameter, this approximation gives a circumference of the circle of about 3 
m. In 825, Archimedes gave the result as 


“feo — (F.8) 


1 (F.7) 


In fact, the number 355/113 is the most accurate fraction approximating 7 for a 
denominator of less than 16,586. To see this, if there is fraction more accurate than 
355/113, we must have 


359 _ 4) < 9<0,00000026677 (F.9) 
113. p 
This can be rewritten as 
a BS5P—1134| <2x0.00000026677 (F.10) 
Pp 


Since 355p—113q must be an integer larger than 1, we must have a less restrictive 
inequality: 
. 1 

(113 x 2 x 0.00000026677) 


Thus, this completes the proof. Other commonly adopted fractions for 2 include: 


1 328 3.141509..., 7 = Ts 3.14166..., 7 = IN 3.14655..., 
106 240 232 


> 16586 (F.11) 


P 


(F.12) 
pe 3 iG. ee 


62,351 67,440 
Historically, the number of significant digits was improved by cutting a circle 
geometrically into more segments. One such geometrically obtained formula is by 
Viete in 1615 (published posthumously): 


= 3.14159253... 
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2 V2 J24+V2 y2+y2+v2 eis) 
a ; " 


1 
This formula is of theoretical significance, since it is the first formula that extends 
to infinity. We outline the proof of the Viete’s formula briefly. 


Figure F.2 Viéte scheme for calculating the circumference of a circle 


Consider the case that O= 7/4 in Figure F.2; we have 
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1 2 


cos(7) aa (F.14) 


Next, we can use the sum rule for the cosine function as 


a a 2/0 
cos = cos(— + —) = 2cos 1 F.15 
( Z ( os ) ( a (F.15) 
Solving (F.15) and noting the result of (F.14) gives 


1 
eco 7 2+ 2 


2,0 
cos = F.16 
( 5 ) ; Fi (F.16) 
Therefore, we have 
a 2+ iz 
cos(—) = ———— F.17 
( : ) 5 (F.17) 
The process of taking half of the angle again leads to 
Ya x 2,0 
cos = cos(— + =2cos 1 F.18 
( P ) 5 ra 5 c Z (F.18) 


Thus, the solution of it and in view of (F.17) gives 


244/242 (F.19) 
5 . 


Clearly, we can repeat the process of halving the angle to infinitely small angles. 
This is the basis for Viéte’s formula going to infinity. For example, we can easily 


see that 
2+ 2+42+j2+y2+V2 


T 
Cos! = 
GP 


cos = F.20 
G a8) 5 (F.20) 
Or more generally, we can express the general form as: 
je 24+4/2+ 2424+ \2+...4V2 
re = 5 (F.21) 


In (F.21), there is a & square root sign on the right hand side. When k— o, we will 
have an infinitely long square root term on the right. 

Our main task is now to link the above identity of the cosine function to the 
circular arcs AC and AE shown in Figure F.2. First, we recall that 


seo) = /2 =sec0 (F.22) 
From right angle triangle OAB, the length of AB is 
AB Se (F.23) 
sec 0 


From right angle triangle OAZ, we see that AZ equals 


AZ =P +2 = V2 =secd (F.24) 


Thus, AB equals 
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AB= AZ = v2 (F.25) 
2 2 
Considering triangle ABC and assuming that ZBAC = 0/2 (this will be verified as 


true), we have 
AC asses oie” (F.26) 
2 2 2 
Noting (F.17) and (F.22) that 


2 
sec 0 = am sod = 


V2 + af 
Substitution of a“ into (F.26) gives 


1 oa pe | 
“EE Bee i 


Let us now use the Pythagoras theorem (independent of the assumption that angle 
ZBAC is 0/2) as 


2: 
AC = AB BC = |2.02=) = AN} pk (F.29) 


This result is consistent with (F.26) which is a result of the assumption that ZBAC 
= 9/2. Thus, we have proved that ZBAC is indeed 0/2. This also proves the validity 
of (F.26). 

However, taking the first approximation that arc AC equals straight line AC, 
we have the circumference as 


2n ~ 8AC = 82-2 (F.30) 


Inverting this formula, we have 


20 1 2 + V2 cao 2 y= 02 2+y? ) (F31) 
a * 32 V2 22- 22+ _ 


This is clearly the first two terms in the Viéte’s formula given in (F.13). 
Now, let us take the next approximation shown in the lower figure in Figure 
F.2. Now, let us first apply the Pythagoras theorem that 


AE = AD* + DE? = sin? S —)+(1-x)? = ine S ) +[1 —cos( SP 
(F.32) 
= ,2- 2cos(— 5) =\2-y2+W2 240f2 


In obtaining the te result of (F.32), we have employed the result of (F.17). 
Finally, we want to show that it can be expressed as: 


AE =2-J2+ V2 = Zsecdsee(S)see(T) (F.33) 


(F.27) 


In view of (F.19). the right hand side of (F.33) is 
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1 0 QO. 12 2 2 

RHS = —sec @sec(—) sec(—) = — —= —— ——— 
4 a 4° 442 Jo4ifo 2+ 24/2 
a A 1 2=s/2-4f2 


V2 242 Jos Jo J2 o-so+2 (F.34) 
2 qisazeala 

V2 Jo+ 22-2 

S044) 


Therefore, it equals the left hand side of (F.33). One half of the circumference is 
approximately given as: 


- ~4AC = sec Ose0(S) sect) (F.35) 


In view of (F.34), inversion of (F.35) gives 


0 f2 24/2 24/2472 (F.36) 
2 2 2 


0 
ff) = 
cos COs, Peon 


When we repeat the halving procedure infinitely, we have Viéte’s formula: 


2 J2 J2+J2 J2+y2+V2 37) 
a 5 a 


a 
Viete’s formula can actually be obtained by setting A = 7/2 in the following 
formula for the infinite product of cosines (discovered by Euler): 


sin A 
= F. 
A A (F.38) 


AA 
cos — cos —cos—... 
2 4 8 


But such technique is tedious and the best that can be done was only up to 40 digits 
in 1630 by Grienberger. Beyond this, calculus and infinite series are needed. 
A particularly useful formula in calculating 2 involves the arctangent. In 
particular, using calculus and Taylor series expansion it was found that 
30 8 


=i x x x 
tan (x) =x-—+ +... F.39 
(x) ss (F.39) 
Setting x = | in (F.39), Leibniz obtained the following result in 1673 
1 1 1 1 1 
—=tan” (1) =1--—+—-—4+... F.40 
4 @ a: 3D) ( ) 


With this value of the arctangent, John Machin in 1706 proposed a more refined 
formula that involves two arctangent functions: 
a -] 1 | 1 
= 4tan tan F.41 
4 ( 5) ( 3 5 (F.41) 
Using it, Machin obtained 101 digits of z. Euler in 1755 followed the same line of 
analysis and obtained 205 digits of x. Gauss proposed another form as: 
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“= Se — F.42 

4 =3tan- id _ C =) tan ‘e) ( ) 
Although these kinds of dors iles are more oles they converge faster. The 
human record on the digits of m was obtained by Levi Smith in 1949 with 1121 
digits. This is the end of human computation of digits of 2, but it also marks the 
emergence of computer calculation of 2. Actually, there are an infinite number of 
formulas similar to (F.41) and (F.42). To see how to generate them, we follow 
Casper Wessel’s (a Danish surveyor) approach of using complex numbers. Let us 
start with a simple example as: 

(2+i)(3+i) =5+i5 (F.43) 

Rewriting it in terms of the polar form we have 


Rene? =7relA® = 


=re'? (F.44) 
However, from the original complex numbers in (F.43), we have 


6, =tan"(), 6, =tan"(), Q=tan'()=7 (F.45) 


Equation (F.44) instantly gives another simple formula of the type of (F.41) and 
(F.42) as: 


1 _1,1 _1l 
rae tan CG) + tan ) (F.46) 
This analysis can be extended to more general cases 
(n+1+in)(2n+1+i) =2n? +2n+1+i(2n? +2n+1) (F.47) 
Using the same line of argument, we get 


T -l, -] 1 
—=tan (——)+tan (——— F.48 
4 ary, GaaD ( ) 


Alternatively, we find 
(n+2k+1+in)[2n+2k+1+i(2k+)] 


4 ‘ . 5 (F.49) 
=2n* +2n(2k +1) + (2k +1)° +:i[2n7 + 2n(2k +1) + (2k +1)°] 
Thus, we have 
a -| n -| 2k+1 
= tan + tan F.50 
4 ear one DkaD ( ) 


When k = 0 and n = 1, (F.46) is recovered. Clearly, (F.50) is true for any integer of 
n and k, therefore we have an infinite number of formulas of this type. 
Other similar forms are 


[4-10" +1+1(4-10” —1)][4-10” +i] =(4-10")? +14:[(4-10")? +1] (F.51) 
The corresponding arctangent formula is 
an 14-10" a, 1 


tan |( = tan ) (F.52) 
4 4-10" +1 4-10" 
A more elaborate formula involves the power of a complex number, for example 
(5+i)*(-239 +i) =-114244 — 1114244 (F.53) 


The author is encouraged to show the validity of the above formula. This gives 
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_ - 
j 4tan- a 5 tan Ss) (F.54) 


This is the Machin formula that we ee in (F.41). This kind of formula may not 
be easily obtained but there are an infinite number of them as illustrated in (F.50) 
and (F.52). The convergence of each formula varies from one to another. We report 
here some of the more well-known formulas from Euler, Gauss, and others: 


ae aie 
co G) tan (~) (F.55) 
Bf yay cap 
oO Gyan ory, (F.56) 
7 . tan""(5) ‘ tan“'(5) + tan!) (F.57) 
ae ‘© tan” C- =) + tan a (F.58) 
= alps 4 
a = 4tan™ 'G )-2tan™ Bre og) tan 8 (F.59) 
* =6tan (2) 2tan"(S =) tan"(— rT (F.60) 
i 7h =i ames 
4 12 tan oo oo Stan™ hy (F.61) 
1 1/1 ty./1 ~t, 21 
a F.62 
qo otan tan Ga)" Ge? mon) 


a =| 1 = 1 ef 1 af 1 =| 5 
—=tan (—)+tan (—)+tan (—)+tan (—)+tan “(— F.63 
3 @ ) ore Cy om (F.63) 


The steps to prove the identities involving three arctangent functions are somewhat 
similar, although it is more tedious. For example, to prove (F.62) we can consider 


(5+ 1)4(240-1)(57361—i) = (476 + 480i)(240 —i)(57361-i) 
= 6580568644(1+i) 


Thus, the identity of (F.62) is established by using the polar form of the complex 
numbers on both sides. 


(F.64) 


Other formulas for the evaluation of x include: 
John Wallis: 
A _ 2-4-4-6+++ 


lc i F.65 
4 3-3-5-5e ae 


Lord Brouncker: 
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2 1 
= = —_____ (F.66) 
= i 
2+ 2 
ae 49 
2+ 
Al-Kashi: 
ora64!6, 9, 8, 1,4, 51, 64, 0 Gey 
60 60 60° 60 60° 60 60° 60° 60 
Takano: 
i 1 1 
m = 48 tan '(—)+128tan~!(—) —20tan-!(—_) + 48 tan! F.68 
Gy ) (39° (04a3> ( ) 
Gregory-Leibniz: 
a 1. de de _ll (F.69) 
4 3 > “7 Yo ol 
Euler: 
xz |i 1 1 1 1 | 1 1 
7 + os + +... (F.70) 
A Bee 3 34° 363° 753" 
Simon Plouffe: 
—el 4 2 1 1 
=> —( ) (F.71) 
rear lls 8nt+1l 8n+4 8n+5 8n+6 


It would be incomplete in any discussion of a without mentioning the amazing 
formulas by the Indian mathematics genius Ramanujan. Some of them are the 
results from the analyses of the elliptic integral and modular equation. Here is one 
of his amazing formulas: 


ee ! 
1 2/2 py SLE. (F.72) 
x 9801S (n!)*3964" 
The first term in this series already gives 7 digits of 1 
1 a6 se 3.14159273.. (7 digits) (F.73) 
2206/2 


We can add eight correct digits with each additional term in the summation. This 
convergent rate is amazing. He also gave other very simple but peculiar forms of 
approximations of z, which are largely different from other approximations. Here 
are some of them (Ramanujan, 1910): 


3 
1 In(640320) = 3.141592654... (15 digits (F.74) 
V163 = 
3 
7 © —==1n(5280) = 3.141592653... (9 digits) (F.75) 
J67 


1 WC" 42n+5 
= x F.76 
1 Lge 16 oe) 
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= 4/9? + +1 =3.141592682. (9 digits) (F.77) 
n=4 = = 3.141592653.. (9 digits) (F.78) 
ae 2 eee (F.79) 
> Bo V2 
re alae ESE (F.80) 
" 
= In{(2/2 + J10)(3 +10) (F.81 
a { ) } ) 
= (975-2)! =3.14159273. (7 digits) (F.82) 
ms 17+ 155 y_ 3.14159265380... (11 digits) (F.83) 
* 25 7155 
355 0.0003 . 
1 = 3.141592744... (7 digits F.84 
113 ce, (7 digits) (F.84) 
1 1 


x 7 digits F.85 
3 Je 0112539839) | Bits) men) 


1, 103, 27493 11-3, , 59882 1:3, 3-5-7 
QnJ2 99% 998 27°42" gg! *2.4%° 42.9? 


ie in=G + $5)(2+V/2)[(5 +2J10) + (61+ 2010 J} (22 digits) 


)+..  (F.86) 


Bio 
(F.87) 
neh ney (JB 1b CAM) +E (31 digits) 
(F.88) 


An improved version of Ramanujan’s formula given in (F.76) is known as the 
aia aa formula 


! 
1 evy’ (- 1)" (6n)!(13591409 + 5451401347) 


n=0 (3n!)(n!)°640320°"*>/ 


This improved result will lead to 14 correct digits just by the first term. In 1998, 
this formula was used to calculate | billion decimals. 

There are also some very peculiar observations about x. Here are the first 38 
digits of x 


(F.89) 


1 © 3.1415926535897932384626433832795028841... (F.90) 
Amazingly, this number appears to have intimate relation with prime (i.e., an 
integer that cannot be written in terms of multiples of integers of other than 1 and 
itself). We see that the first digit is 3, which is a prime. The first two digits are 31, 
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which is also a prime. The first six digits are 314159 which is also a prime. More 
amazingly, the first 38 digits of 2 is also prime. That is, the following are primes: 


31 
314159 


31415926535897932384626433832795028841 
More interestingly, it was discovered that the reverse of the first three primes are 
also primes: 


(F.91) 


3 
13 (F.92) 


951413 

But, it has been checked up to the first 432 digits of m and no more primes were 
found. It is still not known whether there is another prime for more digits of 2. In 
addition, 314159 is a very peculiar prime. The complement number of this prime 
(each digit of the number is replaced by the difference of it with 10) is 796951, 
which is also a prime. If this prime is chopped into 3 two-digit numbers as 31, 41, 
and 59, we obtain another three primes. They are, by the way, twin primes. That is, 
29 and 31 are twin primes, 41 and 43 are twin primes, and finally 59 and 61 are 
also twin primes. The sum of them, i.e., 31+41+59 = 131, is also a prime, and the 
sum of the cube of them, ie., 31°+417+59> = 304091, is also a prime. In 
conclusion, 314159 is a very peculiar prime and is the first 6 digits of 7. 

There are too many coincidences between prime and x. There may exist a 
more in-depth relation between them, and it remains to be discovered. 


Problem F.1 Prove Euler’s formula given in (F.38) 


sin A 
A= ; 
al (F.93) 
cos — cos —cos—... 
2 4 8 
Hints: The following formulas are useful 
sin A = aa che (F.94) 
2 2 
sin —- 
li =1 F.95 
non A ee 
oF 


Problem F.2 Use the result of Problem F.1 to show the validity (F.13): 


2 2 J2+V2 y2+ 242 (F.96) 
; 5 ss é 


a 


Problem F.3 Prove that 


tan”!(a) + tan”1(b) = tan! pat (F.97) 
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Problem F.4 Use the result of Problem F.3 to show that 


| eee, +tan!( P Z ) = tan !( ; ) (F.98) 
pt p t+pqtil 72) 
Problem F.5 Use the result of Problem F.4 to show that 
a =A il 1 1 
—=tan (—)+tan(— F.99 
Fi ( a ( a (F.99) 
Problem F.6 Use the following formula 
tan“'(a)— tan !(b) = tan! [=| (F.100) 
1+ab 
to show that 
7 = tan — =) tan” ids (F.101) 
4 101 , 
Problem F.7 Prove that 
wa) = eg +tan! — (F.102) 
b a+b b+(b* +1)/a 
and subsequently show that 
77 = tan™ ron iu tan (=) (F.103) 
Problem F.8 Use Wessel’s complex ae ee to show that 
oa “A n -| k 
= tan + tan F.104 
4 © + aR os + ® ( ) 


and subsequently show that 


1s -] 1 =i 3 
—=tan  (—)+tan (—). F.105 
ri @ ) ( ) 
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Knowing the biographies of mathematicians and scientists is essential for readers to 
appreciate the significance of these mathematical theories. Many of these 
mathematical techniques in solving the associated differential equations have great 
influence on the technological development of our daily lives. Biographies of a 
number of mathematicians, scientists, and engineers whose works are covered or 
mentioned in this book are included here. It is sometimes difficult to distinguish 
applied mathematicians from scientists and engineers, especially theoretical 
scientists. The main references for this section are Jenkins-Jones (1996), Struik 
(1987), James (2002), Kline (1972), and Millar et al. (2002). This section will 
hopefully form a mini-Who’s Who in differential equations, and their applications 
to mechanics and engineering. 


Abel, N.H. (1802-1829) was a Norwegian mathematician who made major 
contributions to elliptic functions, integral equations, infinite series, binomial 
theorem, and group theory. He provided the first stringent proof of the binominal 
theorem. He revolutionized elliptic integrals. He discovered Abelian functions. He 
proved that no algebraic solution of the general fifth degree (quintic) equation 
exists, but ironically Gauss threw his proof away unread when Abel sent it to him. 
Abel was extremely poor throughout his life, and was unrecognized. In 1825, he 
visited France and Germany but was largely ignored by mathematicians like Gauss 
and Cauchy, except Leopold Crelle. He died at age 26 due to tuberculosis. Two 
days after his death, a letter arrived from Crelle offering him a professorship at 
Berlin. 


Adams, J.C. (1819-1892) was a British mathematician and astronomer. His most 
famous achievement was his prediction of the existence of Neptune using 
"perturbation theory" while he was still an undergraduate student, but his prediction 
was not followed up by G.B. Airy. He was the Lowndean Professor at University of 
Cambridge and a recipient of the Gold Medal of the Royal Astronomical Society in 
1866. He was also a foreign honorary member of the American Academy of Arts 
and Sciences. He was supposed to be knighted by Queen Victoria but declined. The 
Adams-Moulton formula in the finite difference method in Chapter 15 is named 
after him. 


Airy, G.B. (1801-1892) was a British mathematician, astronomer, and 
geophysicist. A special function called the Airy function was the result of a kind of 
water wave. In 2-D elasticity, the Airy stress function has been of great importance 
to the analysis of 2-D elasticity problems. He considered the bending of beams and 
published in 1862 the use of the stress function on a rectangular beam, but failed to 
consider the compatibility condition. He was also involved in laying the 
transatlantic telegraph cable, and the construction of the clock of Big Ben. Airy 
was, however, better known for serving as the Astronomer Royal for 46 years and 
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for measuring Greenwich mean time by stars crossing the meridian observed 
through his telescope. Airy was arrogant and perhaps best known for his failure to 
exploit Adam’s prediction of a new planet, Neptune. While still an undergraduate, 
Adam sent his prediction to Airy, but Airy was skeptical. He asked Adams for 
clarification but did not receive a complete reply from Adams. Nine months later 
Le Verrier of France made the same prediction, which led to the discovery of 
Neptune. 


Bateman, H. (1882-1946) was an English mathematician and moved to USA in 
1910 and took up his permanent position at the California Institute of Technology 
(Caltech) in 1917. He published his textbook Differential Equations in 1915, and 
Partial Differential Equations in Mathematical Physics in 1932. He was elected to 
the National Academy of Sciences in 1930 and to the Royal Society of London in 
1928. When he passed away in 1946, a huge number of drafts of manuscripts on 
special functions and mathematical tables for integral transforms were left behind 
at Caltech, and Erdelyi was recruited to form a team to finish his classical works of 
“Bateman Manuscript Project.” These are significant textbooks on differential 
equations and engineering mathematics. 


Beltrami, E. (1835-1900) was an Italian mathematician who made notable 
contributions to differential geometry, non-Euclidean geometry, and mathematical 
physics. He developed the singular decomposition theory for matrices. We have 
mentioned Beltrami equations in Chapter 7 when we talked about the canonical 
form of elliptic PDEs. 


Bernoulli, Daniel (1700-1782) was a Swiss mathematician and physicist. He was 
one of the main pioneers of differential equations through interactions with Euler. It 
was Bernoulli who found Euler a job at St. Petersburg. The Bernoulli equation in 
fluid mechanics was named after him and is the main principle that the airplane 
wing was based upon. 


Bernoulli, Jacob (1655-1705) was a Swiss mathematician and a brother of Johann 
Bernoulli. He studied theology first and then shifted to mathematics and astronomy. 
He was one of the main contributors to calculus, differential equations, and the 
calculus of variations. The Bernoulli equation in first order ODEs was named after 
him. He was initially a tutor of his younger brother Johann (who was major in 
medicine), but soon their rivalry turned sour. When he considered the isochrone 
problem (originally posed by Galileo), or the curve of constant descent along which 
a particle will descend under gravity from any point to the bottom in exactly the 
same time, he used the term integral for the first time. He also discovered the 
Bernoulli numbers in probability and logarithmic spirals (such a spiral even 
appeared on his gravestone). 


Bernoulli, Johann (1667-1748) was a Swiss mathematician and was one of the 
many prominent mathematicians in the Bernoulli family. He is known as John 
Bernoulli. He studied medicine at Basel University. He was one of the main 
contributors to differential equations and educated the youth Euler. He also solved 
the isochrone problem independent of Jacob. He and his brother Jacob are among 
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the first mathematicians using calculus to solve real problems. He was also the 
discoverer of the L’H6pital’s rule. 


Bessel, F.W. (1784-1846) was a German astronomer and mathematician. He was 
the first to measure a star’s distance by parallax. He studied the perturbation of 
planetary and stellar motions and he developed a mathematical function, now 
called the Bessel function. This function had wide applications in many other areas 
of mechanics. The Hankel transform for cylindrical coordinates is based on the 
Bessel function. Bessel made fundamental contributions to positional astronomy, 
geodesy, and calculating the sizes of stars, galaxies, and clusters of galaxies. Based 
on the irregularities of Uranus’s orbit, he predicted the existence of Neptune in 
1840, but died a few months before its discovery. 


Cauchy, A.L. (1789-1857) was a French civil engineer, mathematician, and 
mechanician who founded complex analysis and contour integration (on which the 
inverse Laplace transform is based). He also made major contributions in 
continuum mechanics and elasticity. He published 7 books and over 700 papers, on 
such topics as calculus, definite integrals, limits, probability, convergence of 
infinite series, mechanics, astronomy, geometry, wave modulation, and complex 
functions. There are 16 concepts and theorems named after him, the most of any 
mathematician. The story is told that when Cauchy presented his theory of 
convergence of series, Laplace rushed home and checked those series that he used 
in his books on celestial mechanics (luckily they all converged). He was a devoted 
teacher, was the most careful in citing other people’s works, and candidly admitted 
errors in his publications. He is also a founder of the matrix method for systems of 
first order ODEs. 


Cayley, A. (1821-1895) was a British mathematician who knew many languages, 
including Greek, French, German, and Italian. He worked as a lawyer for 14 years. 
He developed matrix algebra, algebraic invariants, and n-dimensional geometry. He 
was a recipient of the Royal medal and Copley medal of the Royal Society. He 
finished his undergraduate course by winning the place of Senior Wrangler and the 
first Smith’s Prize. He is among the most prolific researchers. A total of 967 papers 
were assembled in his 14 volumes of collected works published by Cambridge 
University Press. He worked in nearly all areas of mathematics. His hobbies 
include novel reading, painting, architecture, traveling, and hiking. He wrote many 
papers in French and was totally at home with German and Italian. He was popular 
among continental mathematicians. For example, Hermite compared him to Cauchy 
and others even compared him to Euler. There are also criticisms of him. Tait 
remarked "Is it not a pity that such an outstanding man puts his abilities to such 
entirely useless questions?" and when G.H. Hardy was asked whether he thought 
Cayley was a great mathematician, he just glared. Indeed, it is not too difficult to 
spot careless typos in his collected works with careful reading. Quality and quantity 
are not always compatible. The factorization of the ODEs discussed in Section 
3.5.10 was introduced by Cayley. 


Charpit, P. (??-1784) was a “young” French mathematician who died in 1784. 
His birth year is not known. The so-called Lagrange-Charpit method for nonlinear 
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Ist order PDEs (also referred to as the “Lagrange method” or “Charpit method” in 
the literature) was named after him. Paul Charpit was a nephew of Laplace, and had 
assisted in a course of Monge. There is a dispute in recent literatures as to whether 
Charpit contributed anything to the so-called Lagrange-Charpit method (e.g., Kline, 
1972, p 535). Charpit submitted a paper to the French Academy of Sciences in 
1784, the year of his death. The work was never published. His work was first 
reported and publicized by Sylvestre-Francois Lacroix in 1814 (Johnson, 2010). 
Kline (1972) reported that when Jacobi learned about the method from Lacroix, he 
expressed the wish that Charpit’s work be published. Kline (1972) further claimed 
that it was never done. Kline (1972) further questioned the reliability of Lacroix’s 
claim in his book, and he inferred that “Lagrange had done the full job and Charpit 
would have added nothing.” In fact, the assertion by Kline is not accurate. Grattan- 
Guinness and Engelsman (1982) did find the original manuscript of Charpit that 
Lacroix got at the Archives of the Academie des Sciences, Paris. A copy owned by 
Charpit’s friend Arbogast was also found at the Biblioteca Medicea-Lauenziana, 
Florence. According to Grattan-Guinness and Engelsman (1982), Charpit presented 
his paper on June 30, 1784, but died on December 28, 1784. When Charpit read his 
paper to the Academie des Sciences, Monge, Bossut, Condorcet, Cousin, Laplace, 
Vandermonde, and de Borda were present. The original paper was in Laplace’s 
hands for 9 years before he passed it to Lagrange on June 13, 1793. In September 
1973, Lagrange sent the text to Arbogast, who made a copy (which ended up at 
Florence’s library mentioned above). Then, Arbogast sent the original paper to 
Lacroix (which ended up at Paris’s Archives of the Academie des Sciences 
mentioned above). When H. Villat sorted documents in the Archives of the 
Academie des Sciences in 1928, he unearthed Lacroix’s copy of Charpit’s paper. 
He made a photocopy and sent it to N. Saltykow in Belgrade, who published the 
main content of Charpit’s paper in French in two separate papers (Saltykow, 1930, 
1937). The content of these papers did contain the main equations that we today 
call the Lagrange-Charpit method. Therefore, we disagree with Kline’s (1972) 
verdict that Charpit would have added nothing to the method. 


Clairaut, A.C. (1713-1765) was a French mathematician, astronomer, and 
geophysicist. He was a prodigy, published his first paper at the age of twelve, read 
in front of Academie of France at fourteen, and published his first book at eighteen 
in 1731. He found the exact condition for Ist order ODEs. The Clairaut equation 
was named after him, and in the process he found his singular solution of 
differential equation. His second book in 1743 was on the equilibrium of fluids and 
the attraction of ellipsoids of revolution. In the book, Clairaut’s theorem was 
applied to find Earth's ellipticity based on surface measurement of gravity. His 
third book in 1752 was on the three-body problem in astronomy. He calculated the 
path of Halley’s comet as affected by the perturbation of the planets. He was 
elected to fellow of the Royal Society at the age of twenty-four. 


Courant, R. (1888-1972) was one of the most influential applied mathematicians 
of our time, and was a student of Hilbert and Klein. He helped Hilbert to search 
literature and write lectures. As a result, he co-authored with Hilbert the book 
Methods of Mathematical Physics. It is a classic for both applied mathematicians 
and physicists (in fact Hilbert’s contribution to the writing of this book is limited). 
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His charisma made him an educational leader. The CFL stability criterion of the 
finite difference method for solving PDEs was named after him, Friedrichs, and 
Lewy. According to J.T. Oden (1987), Courant was also one of the founders of the 
present-day finite element method. He left Germany and joined New York 
University and pretty much single-handedly founded the Courant Institute of 
Mathematical Sciences in 1936. He recruited K.O. Friedrichs and J.J. Stoker, 
making “Courant Institute” the home of the next generation of applied 
mathematicians in the United States. He was a member of the National Academy of 
Sciences of USA. 


D’Alembert, J.L.R. (1717-1783) was a French mathematician, mechanician, 
physicist, philosopher, and music theorist. He studied law and medicine before he 
shifted to mathematics and mechanics. He worked on dynamics, celestial 
mechanics, and partial differential equation. The D’Alembert principle in dynamics 
was named after him. He was one of the founders of the three-body problem, 
together with Clairaut, Euler, Lagrange, and Laplace. The D’Alembert solution for 
1-D wave equations was named after him. He was the illegitimate son of writer 
Tencin and artillery officer Destouches, and was left on the steps of the Saint-Jean- 
le-Rond de Paris Church. His first name Jean-le-Rond came from the name of the 
church. He was put in an orphanage and was later adopted by the wife of a glazier. 
D’Alembert’s education was secretly supported by his father Destouches, who left 
a modest sum to support him when D’Alembert was only nine. He was elected to 
the Academie des Sciences in 1741, and he was the co-editor (with Denis Diderot) 
of 17 volumes of an encyclopedia. In his later years, he helped launch the careers 
of Lagrange and Laplace. He was the first man proposing the idea of 4-dimensional 
time-space. 


Darboux, G. (1842-1917) was a French mathematician. He made several 
important contributions to geometry and linear differential equation. His results on 
differential geometry of surfaces were compiled in four volumes of collected works 
from 1887 to 1896. In Darboux’s hand, differential geometry became connected 
with ordinary differential equations, partial differential equations, and mechanics. 
His influence in France was compared to that of Klein in Germany. He was a 
biographer of Henri Poincaré and edited the Selected Works of Joseph Fourier. In 
1884, he was elected to Academie des Sciences, elected as a fellow of the Royal 
Society in 1902, and received the Sylvester Medal in 1916. 


Debye, P.J.W. (1884-1966) was a Dutch-American physicist, physical chemist, 
and applied mathematician, and Nobel laureate in Chemistry (1936). His degree 
was in electrical engineering and he got his PhD following theoretical physicist 
Arnold Sommerfeld, who later claimed that his most important discovery was Peter 
Debye. In 1909, Debye obtained a new integral representation of the Bessel 
function of large order using the method of steepest descent of Riemann. He 
developed the theory for dipole moment to charge distribution in asymmetric 
molecules in 1912 (the units of molecular dipole moments are termed “Debyes” in 
honor of him). He derived the Planck radiation formula using a method which 
Plank agreed was simpler than his own. In 1913, he extended Bohr’s atomic 
structure, introducing elliptical orbits. In 1914-1915, he and Paul Scherrer 
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calculated the effect of temperature on X-ray diffraction of crystalline solids (the 
Debye-Waller factor). He and his assistant developed the Debye-Huckel equation 
to model the conductivity of an electrolyte solution in 1923. He also developed a 
theory to explain the Compton effect, the shifting of the frequency of X-rays when 
they interact with electrons. After serving as professor at the University of Zurich 
(succeeding Einstein in 1911), Utrecht, Gottingen, ETH Zurich, Leipzig, and 
finally Berlin (succeeding Einstein in 1934), Debye moved to Cornell University in 
1940 for good just before the Nazi invasion of the Netherlands (we will come back 
to this later). He was awarded the Rumford medal in 1930, the Lorentz medal in 
1935, the Nobel Prize in 1936, the Franklin medal in 1937, the Max Plank medal in 
1950, the Priestley medal in 1963, and the National Medal of Science in 1965. 
Actually, Debye was nominated 47 times to receive the Nobel Prize, both in 
Physics and Chemistry. Two of these nominations were made by the great 
mathematician David Hilbert. Debye’s name was never mentioned in Struik (1987) 
and Kline (1972), Jenkins-Jones (1996) listed him as physicist, and Millar et al. 
(2002) listed him as chemical-physicist. In Chapter 13, we mentioned the saddle 
point method by Debye. He is also known for the Debye potential in solving 
Maxwell equations for spherical waves. Therefore, Debye should at least be 
identified as an applied mathematician, as we did here. According to Beiser (2003), 
Heisenberg, a colleague of Debye for a time, thought him lazy (“I frequently see 
him walking around his garden and watering roses even during duty hours of the 
Institute’), but Debye published nearly 250 papers and received the Nobel Prize in 
chemistry in 1936. 

Forty years after death, a Dutch book written by Rispens accused Einstein of 
actively trying to prevent Debye from being appointed in the United States and 
further accused Debye of being a Nazi activist, and stirred an international debate 
(leading to the 2007 NIOD report and the 2008 Terlow report). This initially 
resulted in the “Debye Institute” at University of Utrecht being renamed in 2006 
but the name was reinstated in 2008 after the 2008 Terlow report, but otherwise the 
claims by Rispens were mainly dismissed. In 2010, a publication by Reiding 
asserted that Debye may have been an MI6 spy because of Debye’s close friend 
Rosbaud being a well-documented spy, and because of Debye’s timely departure to 
the United States on January 16, 1940, coinciding with the planned German 
invasion of the Netherlands a day later. 


Dirichlet, J.P.G.L. (1805-1859) was a German mathematician who made 
contributions to partial differential equations and number theory. He proved 
Fermat’s last theorem for the case of n = 5, an exceptional feat for a 20-year-old 
without a degree. He read the result at the French Academy of Sciences, and that 
brought him immediate fame. This put him in close contact with Fourier and 
Poisson. They later introduced him to German explorer and scientist Alexander von 
Humboldt who secured a recommendation from Gauss for Dirichlet and helped to 
secure a teaching position for Dirichlet at the University of Breslau. However, he 
failed to pass the doctoral dissertation submitted to the University of Bonn because 
of his poor Latin. But because of his work on the last Fermat theorem, the 
university bypassed the problem by awarding him an honorary doctorate in 1827. 
He became the youngest member of the Prussian Academy of Sciences at age 27 in 
1832. His Dirichlet theorem proves the existence of an infinite number of primes. 
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He was a friend of Gauss and Jacobi. He succeeded Gauss’s chair at Gottingen in 
1855. His students at Gottingen included R. Dedekind, B. Riemann, and M. Cantor. 
He found the sufficient conditions for Fourier series to converge in 1829. The 
Dirichlet boundary value problem in PDEs was named after him. His PhD students 
included L. Kronecker and R. Lipschitz. His collected work was edited by 
Kronecker and L. Fuchs, initiated by the Academy of Berlin. He was a foreign 
member of Royal Society and French Academy of Sciences. 


Du Bois-Reymond, P. (1831-1889) was a German mathematician who worked on 
mathematical physics, including the Sturm-Liouville theory, integral equations, 
variational calculus, and Fourier series. He coined the terms elliptic, parabolic, and 
hyperbolic for the classification of the 2nd order PDEs. 


Erdelyi, A. (1908-1977) was a Hungarian-born British mathematician who was a 
leading expert on special functions, especially hypergeometric functions. He got a 
degree in electrical engineering from Czechoslovakia. He got a DSc in 1940 and 
joined the University of Edinburgh with the help of E.T. Whittaker. In 1946, 
Whittaker recommended Erdelyi to the California Institute of Technology to take 
up the task of publishing Harry Bateman’s manuscripts: the Bateman manuscript 
project. He was elected fellow of the Royal Society of Edinburgh in 1945, and 
fellow of the Royal Society in 1975. 


Euler, L. (1707-1783) was a Swiss mathematician, physicist, and astronomer. He 
is recognized as the greatest mathematician genius of all time. He wrote almost 900 
papers, memoirs, books, and other works, and is one of the most prolific 
mathematicians ever. He made seminal contributions to differential equations. In 
terms of mechanics, he contributed to the principle of superposition, the principle 
of virtual work, the free-body and section principle, tidal theory, and the Laplace 
equation in potential flow. His investigation of the seven-bridge problem of 
Konigsberg marked the beginning of graph theory. Euler made major contributions 
to all areas in mathematics, engineering, and science, including calculus, 
differential equations, analytic and differential geometry of curves and surfaces, 
number theory, infinite series (such as Euler’s constant in infinite series), calculus 
of variations, optics, acoustics, light, and hydrodynamics. It was estimated that 
three-quarters of analytical mechanics consists of Euler’s contributions. He also 
contributed to the design of telescopes, microscopes, and ships. His solution of the 
three-body problem of Earth, Moon, and Sun improved navigational tables. He 
developed much of classical perturbation theory. In geometry, the beautiful Euler 
formula for polyhedron relates numbers of vertices, edges, and faces. Euler’s 
formula of e”+1 = 0 is considered by many to be the most famous and beautiful 
formula in mathematics. In structural mechanics, Euler’s buckling formula for 
columns remains a classical result today. He investigated the base of the natural 
logarithm e (Euler’s number). The Eulerian formulation for large deformations is 
named in honor of him. Most of our modern mathematical notations are those of 
Euler. After Euler lost one of his eyes in Russia, he said “now I have less 
distraction and can focus more.” Euler had a prodigious memory and could perform 
complex calculations in his head when he became blind in his old age. 
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Forsyth, A.R. (1858-1942) was a Scottish mathematician who worked on theory 
of functions and differential equations. He was elected a fellow of the Royal 
Society in 1886 and awarded the Royal medal in 1897. He was forced to resign his 
chair at the University of Liverpool as a result of his adultery with the wife of 
physicist Boys. His only student was E.T. Whittaker, whose biography is also 
covered in this section. His most important contribution to differential equations 
was his 6 volumes of Theory of Differential Equations. 


Fourier, J.B.J. (1768-1830) was a French mathematician and physicist. The 
Fourier series, Fourier transform, and Fourier law of heat conduction were named 
in his honor. He accompanied Napoleon on his Egyptian expedition. His name is 
inscribed on the Eiffel Tower. He also contributed to dimensional analysis, and the 
heat equation or diffusion equation in 2"! order PDEs. He was the first in 
discovering the greenhouse effect of the Earth’s atmosphere. He attended Ecole 
Normale in Paris and was taught by Lagrange, Laplace, and Monge. His advisor 
was J.L. Lagrange and his students included G. Dirichlet. He was elected to the 
Academie des Sciences in 1817. 


Fredholm, E.I. (1866-1927) was a Swedish mathematician who contributed to 
integral equations and the theory of Hilbert space. His teacher was Mittag-Leffler. 
The Fredholm integrals of the first and second kinds are named in honor of him. 
His Fredholm Alternative Theorem considered the existence of a solution to the 
nonhomogeneous differential as well as integral equations, and is a very important 
theorem in differential equations. 


Friedrichs, K.O. (1901-1982) was a German American mathematician. When he 
taught at Technische Hochschule, he fell in love with a young Jewish student, 
Nellie Bruell. With the anti-Semitic rules under Hitler’s regime, they managed to 
emigrate separately to New York City where they married. In New York, he joined 
his former teacher Courant, and became the co-founder of the Courant Institute at 
New York University. He received the National Medal of Science in 1977. He 
worked on partial differential equations, the finite difference method, differential 
operators in Hilbert space, and nonlinear buckling of plates. The CFL criteria for 
numerical stability in time step in finite difference is named after him, Courant, and 
Levy. He formalized the boundary layer analysis proposed by L. Prandtl to become 
the method of matched asymptotic expansion. 


Frobenius, F.G. (1848-1917) was a German mathematician who made 
contributions to differential equations, elliptic functions, number theory, and group 
theory. He gave the first proof of the Cayley-Hamilton theorem. His teachers 
include Kronecker, Kummer, and Weierstrass. He succeeded Kronecker at Berlin 
in 1891 upon the recommendation of Weierstrass. He was elected to the Prussian 
Academy of Sciences. 


Frullani, G. (1795-1834) was an Italian mathematician. He was a professor at the 
University of Pisa. He worked on definite integrals and trigonometric functions in 
series and in integrals. The Frullani integral discussed in Section 1.8 was named 
after him. 
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Fuchs, L. (1833-1902) was a German mathematician who worked on differential 
equations. He was a student of Weierstrass and a contemporary of Riemann. His 
works resulted in a Fuchsian theory of linear ODEs. He mainly considered 
solutions of linear differential equations with singular points, which include the 
hypergeometric equation as a special case. His work was a great influence on Henri 
Poincaré. The term adjoint of differential equation was coined by Fuchs in 1873. 


Galerkin, B.G. (1871-1945) was a _ Russian/Soviet structural engineer, 
mathematician, elastician, and engineer who made significant contributions to 
numerical methods for solving differential equations and to the theory of three- 
dimensional elasticity by extending Love’s potential to 3-D cases. He grew up in a 
poor family and went to work in the Russian Court as a calligrapher at the age of 
12. In his college years, he had to work as a private tutor and draftsman to support 
himself. His involvement in political activities when he worked as a railway 
engineer resulted in a 1.5-year jail sentence. It was the turning point in his life. He 
lost interest in politics and devoted himself to science and engineering. He wrote 
his first paper (130 pages) while in prison. In 1915, Galerkin published a paper on 
the approximate solution of differential equations applied to plate bending 
problems. This method is now known as the Galerkin method. This method forms 
the basis of the finite element method. He was a member of the Academy of 
Sciences (USSR). 


Gauss, C.F. (1777-1855) was a German mathematician considered by many to be 
one of the greatest of all mathematicians. He contributed to all areas of 
mathematics, especially number theory, statistics, and topology. In statistics, 
normal distribution is called Gaussian distribution. Gauss also originated the 
method of least squares for best fit curves among data points. In science, Gauss 
made contributions in geodesy, electric telegraph, crystallography, optics, 
mechanics, electricity, magnetism, and capillarity. His book on arithmetic is the 
basis of modern number theory. The Gauss theorem, introduced in Chapter 1, is of 
great importance in mechanics. 


Goursat, E. (1850-1936) was a French mathematician who contributed to 
complex analysis, differential equations, and hypergeometric series. The proof of 
the Cauchy theorem was extended by Goursat to the general situation that no 
Riemann-Cauchy relation is needed (Spiegel, 1964). It was therefore also known as 
the Cauchy-Goursat theorem (see Chapter 1). His doctoral thesis advisor was 
Gaston Darboux. 


Green, G. (1793-1841) was a British mathematical physicist who introduced 
Green’s theorem and Green’s function method for partial differential equations. 
The entire Chapter 8 discussed Green’s function method. These methods had huge 
impacts in applied mathematics and mechanics. His work on potential theory ran 
parallel to that of Gauss. Green’s story is remarkable in that he was almost entirely 
self-taught; he only had one year of formal education at the age of eight. The son of 
a baker, he worked his childhood years in a bakery, except for one year of formal 
schooling at Robert Goodacre Academy. He published his famous Green’s theorem 
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in “An Essay on the Application of Mathematical Analysis to the Theories of 
Electricity and Magnetism” at his own expense at the age of 35 in 1828. This work 
was considered by some to be one of the most significant mathematical works of all 
time. The way that he acquired his mathematical skill remains a mystery. He was 
encouraged by Sir Bromhead to enroll as an undergraduate at Cambridge 
University at the age of 40. He died before his work were discovered and 
publicized by Lord Kelvin (see biography of Lord Kelvin). His works were further 
developed by James Maxwell to formulate the electromagnetic theory. To 
commemorate the 200th anniversary of his birth in 1993, a plaque bearing Green’s 
name was placed in Westminster Abbey near Isaac Newton’s grave. Similar honors 
have been given to Michael Faraday, William Thomson (Lord Kelvin), and James 
Clerk Maxwell. 


Hadamard, J. (1865-1963) was a French mathematician who founded the area 
functional analysis. Hadamard was one of the most influential mathematicians of his 
time. He published over 300 papers containing novel and highly creative works. He 
made contributions to logic, complex analytic functions, number theory, geodesics, 
and hydrodynamics. He proved the prime number theorem (proposed by Gauss and 
Riemann) independently with Poussin that the number of prime numbers less than x 
approach x/Inx as x — 00. This remains perhaps the most important result in number 
theory. He published a book on psychology of mathematical minds and initiated the 
concept of “well posed” in differential equations. He was an acclaimed and inspiring 
lecturer. Hadamard’s method of descent for 2-D waves is discussed in Section 9.2.7. 


Hamilton, Sir W.R. (1805-1865) was an Irish physicist and mathematician who 
made major contributions to optics, mechanics and quaternions (an extension of the 
complex number to higher dimensions). His Hamilton principle is covered in Chapter 
14. In classical mechanics, the Hamiltonian is named after him. He was a fellow of the 
Royal Society of Edinburgh. 


Hankel, H. (1839-1873) was a German mathematician who made significant 
contributions to complex and hypercomplex numbers, and the theory of function. The 
Hankel functions provided a solution to the Bessel equation. The Hankel transform 
used in Chapter 11 bears his name. He originated the “measure” theory of point 
sets which are useful in probability, cybernetics, and electronic. 


Helmholtz, H. von (1821-1894) was a German physicist, mathematician, and 
physiologist. He discovered the law of conservation of energy, developed a theory 
on the nature of harmony and musical sound (he was a skillful musician), and 
invented the ophthalmoscope for viewing the human retina. Boltzmann was one of 
his students. Helmholtz was considered the most versatile scientist of his century. 
He has been called the last scholar whose work covered science, physiology, and 
the arts. Helmholtz believed that his diversified interests helped him adopt novel 
ideas in research. Together with Kirchhoff, he was one of the main contributors to 
mathematical physics in Germany in the 19th century. His work on Riemann’s 
quadratic measures led to the Lie—Helmholtz space problem which is important to 
Einstein’s relativity, group theory, and physiology (Struik, 1987). Helmholtz’s 
equation (a reduced wave equation) is discussed in Chapters 7 and 9. 
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Hilbert, D. (1862—1943) was a German mathematician who worked on geometry, 
logic, and functional analysis for differential and integral equations (see Chapter 
11). Hilbert space was named after him, and is important in the spectral theory of 
self-adjoint linear differential operators (like in quantum mechanics). The Einstein- 
Hilbert action for the field theory of gravitation was named after him. He also 
coauthored the famous textbook on mathematical physics with Courant. At 
Gottingen, one of his students was H. Weyl and one of his assistants was J. von 
Neumann. He listed 23 unsolved problems at the International Congress of 
Mathematics in Paris in 1900. He was a foreign member of the Royal Society. 


Hua, Loo-Keng, (1910-1985) was a Chinese mathematician who made important 
contributions to number theory. After three years of middle school, Hua attended 
the Chinese Vocational College in Shanghai but could not graduate due to the lack 
of funding. In 1927, he returned home in Jintan (Jiangsu Province) to help his 
father’s store and studied mathematics by himself. In 1929, Hua suffered from 
typhoid fever, resulting a partially paralyzed left leg. Hua did not receive a formal 
university education, although he was awarded several honorary doctorates (from 
the University of Nancy, the Chinese University of Hong Kong, and the University 
of Illinois). After reading his early paper, Prof. Xiong Qinglai invited Hua to study 
mathematics at Tsinghua University. Despite the lack of formal qualification, Hua 
was exceptionally hired by Tsinghua, initially at Library and eventually rose to the 
rank of lecturer because of his research papers. In 1935-36, Hua attended classes 
by visiting French mathematician Jacques Hadamard and American mathematician 
Norbert Wiener. It was reported that he was the only one who could follow through 
to the end of the lecture series. Wiener was impressed and mentioned Hua to G.I. 
Hardy at Cambridge. Hardy invited Hua to Cambridge to visit (probably 
envisioned another Ramanujan from China) for 2 years. Hua quickly established his 
name in the area of number theory. In 1938, in view of the full outbreak of the 
Sino-Japan war, Hua decided to return to Tsinghua, where Hua was hired as a full 
professor despite not having any degree. Due to Japanese occupation, Hua 
followed Tsinghua’s retreat to Yunnan. Despite the hardship of poverty, enemy 
bombing, and academic isolation, Hua continued to produce important 
mathematical papers. After the war, Hua visited Ivan Vinogradov in the Soviet 
Union for three months and then the Institute of Advanced Study at Princeton 
University in the United States. In the spring of 1948, Hua accepted the 
appointment of full professor at the University of Illinois at Urban-Champaign. In 
October 1949, he gave up his comfortable life in the United States and returned 
China with his wife and kids. Gradually, Hua moved from pure mathematics to 
applied mathematics, including linear programming, operation research, and 
multidimensional numerical integration. He was involved in solving all practical 
problems that new China faced using mathematics. His Chinese book An 
Introduction to Higher Mathematics was translated into English in 2009, with an 
excellent introduction to differential equations. He was elected a foreign associate 
of the National Academy of Sciences in 1982. Chapter 7 reported his solution for a 
circular domain governed by the biharmonic equation. 


942 Theory of Differential Equations in Engineering and Mechanics 


Huygens, C. (1629-1695) was a prominent Dutch mathematician and scientist. He 
made notable contributions to astronomy, physics, and probability. His father was a 
diplomat and was a friend of G. Galilei, M. Mersenne, and R. Descartes. Huygens 
was educated at home until the age of 16. He studied the rings of Saturn and 
discovered its moon, Titan. He invented the pendulum clock. The Huygens or 
Huygens-Fresnel principle in wave propagation was named after him. In 1646, he 
demonstrated that a catenary is not a parabola. Huygens was the first to derive the 
period of an ideal pendulum. In 1659, he derived the centripetal force of circular 
motions. He tutored the young diplomat Gottfried Leibniz in mathematics from 
1673. In 1673, Huygens found the curve down which a mass will slide under the 
influence of gravity in the same amount of time, regardless of its starting point; he 
solved this so-called tautochrone problem by geometric method. In 1675, Huygens 
patented a pocket watch. He was elected to the Royal Society in 1663. The 
Huygens principle for a 3-D wave is discussed in Chapter 9. 


Ince, E.L. (1891-1941) was a British mathematician who worked on differential 
equations with periodic coefficients, such as the Mathieu equation and Lame 
equation. He also introduced the Ince equation, which is a generalization of the 
Mathieu equation. He received the Smith Prize in 1918 and the Makdougall 
Brisbane Prize in 1938-1940. His famous book Ordinary Differential Equations in 
1956 remains a classic today. 


Jacobi, C.G.J. (1804-1851) was a German mathematician who made fundamental 
contributions to elliptic functions, dynamics, differential equations, and number 
theory. Jacobi elliptic functions formed the solution of the pendulum problem 
(Chapter 2 and Appendix D). In mapping, we check the existence of a nonzero 
Jacobian, which was named in his honor (see Chapters | and 6). For first order 
ODEs, we have discussed the Jacobi method in Chapter 3. For lst order PDEs, 
there is the Jacobi method (see Chapter 6). In celestial mechanics, Jacobi’s integral 
gave the first integral of the three-body problem (Moulton, 1914). 


Kelvin, Lord (Thomson, William) (1824-1907) was an Irish mathematician, 
physicist, and mechanician. Kelvin is probably best known for his introduction of 
the absolute temperature scale Kelvin. As a young man, he discovered Green’s 
work, then little known, and publicized it. Since then, Green’s method has become 
a powerful tool in mathematical physics. His work on the conservation of energy 
led to the second law of thermodynamics. He was an unusual scientist with 
unparalleled enthusiasm, energy, and talent. He invented the tide gauge, an 
improved compass, and a simpler method for fixing a ship’s position at sea. He 
investigated many different areas of science. He published 661 papers and many 
books and was the author of several patents. He coined the term “turbulence” in 
fluid mechanics. The Kelvin solution in elasticity remains one of the most 
fundamental contributions to applied mechanics. For his role in the Kelvin-Stokes 
theorem, see the biography of G.G. Stokes (see also Chapter 1). His stationary 
phase method with Stokes is discussed in Chapter 12. He directed the first 
successful project for a transatlantic cable telegraph, which became operational in 
1866, and brought him considerable wealth. The Cambridge Dictionary of 
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Scientists says he was “probably the first scientist to become wealthy through 
science” (Millar et al., 2002). 


Kirchhoff, G.R. (1824-1887) was a German physicist and a pioneer in 
spectroscopy. He also made major contributions to plate theory and elasticity. An 
early accident made him a wheelchair user but did not alter his cheerful character 
or hinder his scientific curiosity. He formulated Kirchhoff’s law for electrical 
networks. Kirchhoff and his lifelong friend and colleague Bunsen established 
spectroscopy as an analytical technique in chemical analysis. Using spectroscopy, 
they discovered the elements cesium and rubidium, and were able to analyze the 
chemical element present in the Sun’s atmosphere (see, however, the biography of 
Stokes for his role in the development of spectroscopy). The spectrometer, 
telescope, and microscope are the most dominant scientific instruments of our time. 
Kirchhoff’s formula for 3-D waves is discussed in Chapter 9. 


Kline, M. (1908-1992) was a physicist and applied mathematician who studied the 
history of mathematics and philosophy on physical sciences. He graduated from 
New York University (NYU) and continued to teach at NYU as an instructor. In 
1946-1966, he was the director of the division for electromagnetic research at the 
Courant Institute of Mathematical Sciences. He made significant contributions to 
mathematics teaching and was an advocator of changes. He published more than 12 
books and mathematics and sciences. His 3-volume series, Mathematical Thought 
from Ancient to Modern Times, is probably the best book on the historical 
development of differential and integral equations. 


Kovalevskaya, S. (1850-1891) was probably one of the most influential female 
mathematicians in the 19th century. She was the mathematical protégé of 
Weierstrass and Mittag-Leffler. She was also a novelist. But she died in her prime 
at the age of 40. She was known for her ability and originality. The Cauchy- 
Kovalevskaya theorem mentioned in Chapter 6 was published at the age of 25. She 
obtained the prestigious Bordin prize on her mathematical theory on the dynamics 
of top (now known as Kovalevskaya top). She was promoted to full professor at 
Hogskola because of her originally, charm, and mathematical knowledge. She was 
particularly popular among her students. 


Lagrange, J.L. (1736-1813) was an Italian mathematician. He is one of the most 
influential mathematicians on differential equations, in addition to his major 
contributions in number theory and algebra. Lagrange almost single-handedly 
developed the major part of the methods for solving first order PDEs (see Chapter 
6). The calculus of variations presented in Chapter 13 was also mainly formulated 
by Lagrange, including the use of the Lagrange multiplier method. We also 
discussed the Lagrange identity in Chapters 7 and 10 when we discussed the adjoint 
ODE problem. He also made major contributions to celestial mechanics, winning 
him prizes of the French Academy of Sciences many times. In classical mechanics, 
the Lagrangian formulation is the standard (although Hamiltonian may be more 
powerful for non-classical mechanics, like quantum mechanics). He was elected a 
fellow of the Royal Society of London, Royal Society of Edinburgh, foreign 
member of the Swedish Academy of Sciences, and member of the Berlin Academy. 
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Lambert, J.H. (1728-1777) was a Swiss mathematician, physicist, philosopher, 
and astronomer. He is best known for proving the irrationality of x. After he left 
school, he had been working as an assistant to his father (a tailor), a clerk, a private 
tutor, and a secretary to an editor. In 1758, he published first book on optics and 
cosmology, and this allowed him to start an academic career. After a few posts, he 
was invited to a position at the Prussian Academy of Sciences in Berlin. Lambert 
was the first to introduce hyperbolic functions. Lambert also derived a theorem on 
conic sections to make the calculations of the orbit of comets much simpler. He 
also derived hyperbolic triangles for a concave surface. Lambert was the first 
mathematician to consider the general properties of map projection. For analyzing 
3-D planes on a plane surface, Lambert invented the azimuthal equal-area 
projection and the equal-area stereonet, which finds application in rock slope 
stability analysis. 


Laplace, P.-S. (1749-1827) was a French mathematician, astronomer, and 
mathematical physicist. The story has often been told of how D’Alembert gave him 
difficult mathematical problems to test his ability, and found that Laplace was able 
to solve them overnight. Much impressed, D’Alembert helped secure Laplace a 
teaching job at the Ecole Militaire in Paris. He is one of the founders of probability, 
and he made his name in celestial mechanics by publishing a five-volume survey of 
celestial mechanics. He theorized that the solar system originated from a cloud of 
gas (called the nebular hypothesis). Laplace developed the concept of potential and 
the study of the Laplace equation. He was from a poor family, but he was appointed 
minister and later senator by Napoleon. The Laplace transform that bears his name 
is of fundamental importance for solving differential equations. Many considered 
Laplace as the most illustrious scientist of France’s golden age, and one of the most 
influential scientists of all time. Our current unit of length, the meter, was proposed 
by Laplace in 1790. Laplace is considered only second to Newton in scientific 
talent. He was known for his arrogance, and he frequently neglected to 
acknowledge the sources of his results. He was notorious for overusing the term “it 
is obvious” in mathematical derivations when it was far from obvious (James, 
2002). 


Legendre, A.M. (1752-1833) was a French mathematician who contributed to 
number theory, celestial mechanics, and elliptical functions. In celestial mechanics, 
he derived the Legendre equations and the Legendre polynomials (see Chapter 4). 
As shown in Chapter 9, it is closely related to the Laplace equation in spherical 
coordinates. 


Leibniz, G.W. (1646-1716) was a German mathematician, philosopher, and 
diplomat. His Bachelor and Master degrees are both in philosophy, and he had a 
doctoral degree in law. He could write articles in Latin, French, and German. He 
assisted redrafting of the legal code for electorate for the Elector of Mainz, and was 
involved in politics in proposing an Egyptian plan for German-speaking Europe to 
diverge France’s attention from them to Egypt. In Paris, he met Dutch physicist and 
mathematician Christiaan Huygens and Irish chemist Robert Boyle, and began a 
program of self-study with Huygens as his mentor. Eventually, Leibniz developed 
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differential and integral calculus independently of Newton, and his notations in 
calculus have been adopted today. He also contributed to the development of 
mechanical calculators and refined the binary number system for later development 
of the digital computer. When he visited the Royal Society, he demonstrated his 
calculating machine carrying all four arithmetical operations and finding square 
roots, and the Society quickly made him a foreign member. Leibniz was the first to 
see that the coefficients of a system of linear equations could be arranged into an 
array or matrix. His integral sign | represents an elongated S, from the Latin word 
summa and d representing differentiate or Latin differentia. Product and quotient 
tules of differentiation are due to Leibniz. The Leibniz rule for differentiation of 
integrals is named after him. In dynamics, he favored the conservation of energy 
instead of the conservation of momentum used by Newton. He also envisioned a 
universal language for all humans. 


L’H6pital, G.F.A. (1661-1704) was a French mathematician. The first calculus 
textbook was written by L’H6pital in 1696. The so-called L’H6pital’s rule in 
taking the limit of the indeterminate form of 0/0 and 0/co appeared in his book, but 
actually was discovered by Johann Bernoulli. He was elected to the French 
Academy of Sciences in 1693. 


Liouville, J. (1809-1882) was a French mathematician who made contributions to 
number theory, complex analysis, differential geometry and topography, linear 
differential equations, mathematical physics, and astronomy. His teachers at Ecole 
Polytechnique included A.M. Ampere. Liouville theorem in complex analysis is in 
honor of him. In mathematical physics, Sturm-Liouville problem resulted from 
collaboration with C.F. Sturm; it studied the eigenfunction expansions of self- 
adjoint type boundary value problems and found important application in physics. 
In Hamiltonian dynamics, the Liouville-Arnold theorem was named in honor of 
him. He proved the existence of transcendental numbers. According to Mittag- 
Leffler, Liouville’s greatest grief was meeting Abel, but failed to realize his 
mathematical talent. He was the first to recognize the importance of the 
unpublished works of Galois. 


Mittag-Leffler, G. (1846-1927) is regarded as the father of Swedish mathematics; 
he spent altogether 3 years in France and Germany. He became friends of Hermite, 
Poincaré, and Weierstrass. He considered himself as a disciple of Weierstrass, 
particularly following his power-series approach to function theory. The Mittag- 
Leffler expansion of trigonometrical functions was reported in Chapter 1. He also 
found a job for the extremely talented Sonya Kovalevskaya at Hogskola (see 
biography of Kovalevskaya). He was a member of King Oscar II’s circle (the king 
himself was a mathematician) and was an international celebrity, although he was 
not so popular among outsiders. When Alfred Nobel set up his Nobel Prize, 
mathematics was one of the potential subject areas to be awarded. However, when 
he knew that Mittag-Leffler was a potential candidate for the prize in mathematics, 
Nobel crossed out mathematics and no such prize has ever been awarded. This 
negative impact by Mittag-Leffler on mathematics is definitely more far-reaching 
than his mathematical contributions. 
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Monge, G. (1746-1818) was a French mathematician and the father of descriptive 
geometry (the mathematical basis for technical drawing) and of differential 
geometry. He was involved in the reform of the French education system during the 
French Revolution, helping to found Ecole Polytechnique. He devised a graphical 
method to optimize the defensive arrangement of fortification. He joined 
Napoleon’s expedition to Egypt. His name was inscribed on the base of the Eiffel 
Tower. His method in solving second order partial differential equations was 
reported in Chapter 7. 


Moulton, F.R. (1872-1952) was an American astronomer. He was a professor at 
the University of Chicago. He proposed that small satellites in the orbit around 
Jupiter were actually gravitationally captured planetesimals (small bodies that 
formed planets). This theory has become well accepted among astronomers. The 
Adams-Moulton method discussed in Chapter 15 for solving differential equation 
was named after him. 


Painlevé, P. (1863-1933) was a French mathematician and politician. Painlevé is 
best known for his studies of those nonlinear ODEs that can be transformed to 
linear ODEs or the so-called Painlevé transcendents. He was a student of Flex 
Klein. He served as French Prime Minister twice. 


Pfaff, J.F. (1765-1825) was a German mathematician who worked on series, 
integral calculus, and partial differential equations. The Pfaffian of Ist order PDEs 
was named after him. He was the PhD supervisor of C.F. Gauss, and also a teacher 
of A. Mobius. 


Picard, C.E. (1856-1941) was a French mathematician who made contributions to 
analytic functions and linear differential equations, including the Picard successive 
approximation for solving first order ODEs, and Painlevé transcendents for ODEs. 
He also worked on theories of telegraphy and elasticity. Charles Hermite was his 
father in law. 


Poincaré, H. (1854-1912) was a renowned French mathematician, mining 
engineer and theoretical physicist. He was a mining engineer by training, but got 
his PhD in mathematics under Charles Hermit. His thesis is on differential 
equations. Henri Poincaré, a pioneer in stability analysis of differential equations, 
was nominated to receive the Nobel Prize 34 times in 1910. In fact, Poincaré 
received a total of 51 nominations in 1904-1912 before he passed away in 1913, 
but he never received it. His Poincaré conjecture was one of the ten millennium 
problems of the Clay Mathematics Society. It was eventually “solved” by 
Perelman, but it was bigger news when he declined both the Fields medal and the 
Millennium Prize from the Clay Mathematics Institute. The New Yorker’s article 
stirs the controversy of Perelman with former Fields medalist S.T. Yau. 


Poisson, S.D. (1781-1840) was a French mathematician and physicist who made 
contributions to probability theory, elasticity, electricity, magnetism, heat, and 
sound. In probability, we have the Poisson distribution (the basis for modern 
hazard analysis) and in elasticity we have the Poisson ratio. In complex analysis, he 
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was the first to carry out path or contour integration of complex functions (called 
contour integration). The solutions of 2-D and 3-D wave equations are expressible 
in Poisson’s integral (Chapter 9). The nonhomogeneous Laplace equation is also 
called Poisson’s equation. 


Ramanujan, S. (1887-1920) was an Indian mathematician who made significant 
contributions to mathematical analysis, number theory, infinite series, and 
continued fractions. He received no formal training in mathematics but 
rediscovered known theorems and produced new ones. He was working as a clerk 
when he conducted his mathematical research. He was a prodigy discovered by 
G.H. Hardy, and Hardy compared Ramanujan with Jacobi or Euler. After 
Ramanujan sent some of his theorems to G.H. Hardy, Ramanujan was invited to 
work with him at Cambridge. He compiled a total of 3900 identities and equations, 
and many of them are highly original and unconventional, including the Ramanujan 
prime, and Ramanujan theta function. He became a fellow of the Royal Society and 
died at the age of 32. His 87 pages of lost notebook was rediscovered by George 
Andrews in 1976. The number 1729 was called the Ramanujan-Hardy number: 
Once Hardy took a taxi to visit Ramanujan in hospital, and found the cab number 
of 1729 rather dull and believed it was an omen. But when he told Ramanujan 
about this, Ramanujan told him that it is a very interesting number which is the 
smallest number that can be expressed as sum of two cubes in two different ways 
(i.e. 1729=13+123 = 93+103) (Kanigel, 1991). One of Ramanujan’s formulas has 
been adopted to generate a huge amount of digits of 2, and it became one way to 
test the speed of supercomputers. Ramanujan’s Master Theorem was introduced in 
Chapter 1 and Appendix C. Some of his amazing formulas on 1 were reported in 
Appendix F. The 2015 British biographical film The Man Who Knew Infinity was 
based on his biography written by Kanigel (1991). 


Rayleigh, Lord (Strutt, J.W.) (1842-1919) was a British mathematician and 
physicist and Nobel Prize winner for his work on gas density and on argon. 
Rayleigh made major contributions to sound, light, surface waves, and electricity. 
He wrote his classic book Theory of Sound partly on a boathouse on the Nile. He 
inherited the title Lord Rayleigh from his father, and succeeded Maxwell at 
Cambridge. Rayleigh explained the blue color of sky from the scattering of light by 
dust particles in the air. His enthusiasm on precise measurement led him to the 
standardization of electrical units in 1884: the ohm, ampere, and volt. The 
inconsistency of the Rayleigh-Jeans equation (published by Rayleigh in 1900), 
which describes the distribution of wavelengths in black-body radiation, led Planck 
to the formulation of quantum theory. In numerical analysis, the Rayleigh-Ritz 
method discussed in Chapter 14 is a powerful approximate method that bears his 
name. 


Riemann, G.F.B. (1826-1866) was a German mathematician, who originated 
Riemann geometry, which was used by Einstein in the theory of general relativity. 
He also made breakthroughs in conceptual understanding of the theory of 
functions, vector analysis, differential geometry, and topology. He was a student of 
Gauss. In complex variable theory, he developed the concept of the Riemann 
surface which separates multi-connected surfaces by branch cuts. The differentiable 
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condition for complex variables is now known as the Cauchy-Riemann relation. He 
defined the Riemann zeta function and formulated a Riemann hypothesis of this 
function. It remains one of the most important unsolved problems of number theory 
and analysis. The Clay Mathematical Institute of Cambridge offered US $1 million 
for its proof (Sabbagh, 2003). Riemann died at the age of 39 because of 
tuberculosis. 


Robin, G. (1855-1897) was a French mathematician whose method on single- and 
double-layer potentials boundary value problems of electrostatics is an important 
technique in potential theory (see Section 8.4 of Chapter 8). In short, the Dirichlet 
and Neumann boundary value problems were reduced to integral equations for 
electric densities and to power series expansions of potentials. The single-layer 
potentials were also known as Robin-Steklov potentials (Steklov was a student of 
Russian mathematician A. Lyapunov and Robin was a student of the well-known 
French mathematician Picard). The criterion for uniform convergence of his series 
is called the Robin principle. The Robin constant is associated with the logarithmic 
electric capacitor. His name has been associated with the third type of boundary 
value problem of potential theory (as the Robin boundary condition or Robin 
Problem), but Gustafson and Abe (1998a,b) concluded that Robin had actually 
never used or studied such a boundary condition (see Chapter 9). He was a 
recipient of the Francoeur Prize and Poncelet Prize. 


Sommerfeld, A.J.W. (1868-1951) was a German theoretical physicist who was a 
pioneer of quantum theory. He was nominated a record of 81 times for the Nobel 
Prize within a period of 34 years before his death, but he never received it. Very 
likely, this record will not be broken easily in the near future. Ironically, he made 
two nominations, one of Albert Einstein (Nobel laureate in 1921) and one of Max 
Planck (Nobel laureate in 1918), and thus, in a sense, his nominations had a 100% 
success rate. In addition, he served as PhD supervisor for more Nobel Prize 
winners in physics than any other supervisor before or since. His PhD students 
getting Nobel Prizes include W. Heisenberg in 1932 (PhD in 1923), H. Bethe in 
1967 (PhD in 1928), W. Pauli in 1945 (PhD in 1921), and P. Debye in 1936 (PhD 
in 1908). His other postgraduate and post-doctoral students getting Nobel Prizes 
include L. Pauling in Chemistry in 1954 and in Peace in 1962, II. Rabi in 1944, 
and M.T. von Laue in 1914. Many of his other students became famous in their 
own right. His professors include German mathematicians Lindemann, Hurwitz, 
and Hilbert, and after graduation Sommerfeld was an assistant of Felix Klein and 
wrote up his lecture notes for the reading room. This resulted in a six-volume series 
of classical textbooks of Lectures on Theoretical Physics, including Mechanics, 
Mechanics for Deformable Bodies, Electrodynamics, Optics, Mathematical Theory 
of Diffraction, and Partial Differential Equations in Physics. They are all 
influential and all have been translated into English. Albert Einstein also admired 
Sommerfeld for nurturing a large number of young talents. Sommerfeld was one of 
the pioneers of quantum theory. He was elected to the Royal Society of London, 
US National Academy of Sciences, Indian Academy of Sciences, and USSR 
Academy of Sciences. Example 2.7 in Chapter 2 presents Sommerfeld’s solution 
for the Thomas-Fermi equation. 
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Stokes, G.G. (1819-1903) was an Irish mathematician and physicist who made 
fundamental contributions to fluid dynamics. The most general governing equations 
for fluid dynamics are called the Navier-Stokes equations. Stokes described the 
phenomenon of fluorescence in 1852. His Stokes law for a sphere settling in a fluid 
also bears his name. He was the first to explain the fundamentals of spectroscopy. 
However, the Stokes theorem discussed in Chapter 1 was in fact discovered by Lord 
Kelvin and communicated to Stokes in 1850, and Stokes set the theorem as a question 
for the 1854 Smith’s prize examination, which led to the theorem bearing his name. 
Therefore, some mathematicians called it the Kelvin-Stokes theorem. Stokes served 
as president of the Royal Society. 


Sturm, J.C.F. (1803-1855) was a French mathematician and best known for the 
Sturm-Liouville problem discussed in Chapter 10. He pointed out the relevance of 
eigenvalue analysis for systems of first order ODEs to Cauchy in 1828 (see Chapter 
5). Related to his works on stability of ODEs, Sturm’s theorem is a basic result of 
finding and counting real roots of polynomials. He was a foreign member of the 
Royal Society of London. 


Taylor, B. (1685-1731) was an English mathematician who is known for his Taylor 
series expansion. He was elected fellow of the Royal Society in 1712. However, he 
failed to express his ideas fully and clearly. His Taylor series expansion was found 
important by Lagrange only in 1772. 


Titchmarsh, E.T. (1899-1963) was an English mathematician. He was a student 
of Hardy and succeeded his position at Oxford in 1931. He contributed on the 
entire function of complex variables, integral equations, Riemann zeta function, 
eigenfunction expansions of differential equations, and Fourier series. He was 
elected fellow of the Royal Society in 1931 and received the Sylvester Medal in 
1955. He also received the De Morgan Medal and the Berwick Prize from London 
Mathematical Society. The Titchmarsh contour integral is discussed in Chapter 1. 


Tricomi, F.G. (1897-1978) was an Italian mathematician famous for his studies of 
mixed type partial differential equations. His book on the mixed type second order 
partial differential equations in 1923 was translated to Russian, then from Russian 
to Chinese. He also authored /ntegral Equations. He spent two years at Caltech to 
work on the Bateman Manuscript Project with Erdelyi, Magnus, and Oberhettinger. 
He was the president of the Turin Academy of Sciences. The Tricomi equation is 
discussed in Section 7.7. 


Volterra, V. (1860-1940) was an Italian mathematician who contributed to 
functional theory, nonlinear integro-differential equations, biological and 
population growth, and dislocation theory. Volterra integral equations were named 
after him. His contribution to dislocation was introduced in Chapter 2. During 
World War I, he was involved in designing armaments, and he was also the first to 
propose the use of helium to replace hydrogen in airships. 


Watson, G.N. (1886-1965) was an English mathematician who was an expert in 
the application of complex variables to the theory of special functions, especially 
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Bessel functions. He co-authored, with his supervisor E.T. Whittaker, the 2nd 
edition of A Course of Modern Analysis (1915). His book Treatise on the Theory of 
Bessel Functions in 1922 is a classic even using today’s standard. In 1918, he 
proved Watson’s lemma, which has applications on the asymptotic behavior of 
exponential integrals. Convergence of certain infinite series can be improved by his 
Watson’s transform. Ramanujan’s lost notebook was in his hands before it was 
rediscovered by George E. Andrews in 1976 from Watons’s left-behind boxes at 
Wren Library at Cambridge. He was elected fellow of the Royal Society and 
received the Sylvester Medal from the Society. 


Weierstrass, K.T.W. (1815-1897) was a German mathematician who is often 
cited as the father of modern analysis. He studied law, economics, and finance at 
University of Bonn but dropped out because of its conflict with his hope to study 
mathematics. He then moved to the University of Munster to study mathematics. 
He formally laid down the rigorous foundation of calculus and clarified the 
concepts of uniform convergence, derivative, and continuity. He also paved the 
way for the modern study of the calculus of variations. The Weierstrass elliptic 
functions were named after him. This contribution to the gamma function was 
reported in Chapter 4. 


Whittaker, E.T. (1873-1956) was an English mathematician who contributed to 
applied mathematics, mathematical physics, and the theory of special functions. 
The Whittaker function in the theory of confluent hypergeometric functions (or 
Kummer functions) was named after him. He also derived Bessel functions in terms 
of integrals of Legendre functions. He married the daughter of the minister of the 
Presbyterian Church. His co-authored book A Course of Modern Analysis with his 
student G.N. Watson (Whittaker and Watson, 1927) is a classic text on 
transcendental complex functions. He also worked on the history of physics and 
celestial mechanics. As a historian of science, he wrote A History of the Theories of 
Aether and Electricity in 1910, which was revised and expanded to be two books in 
1951 and 1953. In this book, he attributed the discovery of special relativity more 
to Henri Poincaré and Lorentz, and less to Einstein. He also attributed the formula 
E=mc? to Poincaré. Whittaker’s view on this appears not in the main stream. In 
Bodanis’s (2000) biography on E=mc?, he did mention that Henri Poincaré made a 
presentation at the St. Louis World Fair entitled “theory of relativity” and came 
close to giving the famous formula which Einstein got a year later. Whittaker wrote 
The Calculus of Observations: A Treatise on Numerical Mathematics in 1924 and 
Treatise on the Analytical Dynamics of Particles and Rigid Bodies: With an 
Introduction to the Problem of Three Bodies in 1937. He received the Copley 
Medal and Sylvester Medal from the Society. The Whittaker equation is discussed 
in (3.584) in Chapter 3. 


Wronski, J.M.H. (1776-1853) was a Polish mathematician. The Wronskian 
repeatedly used in Chapter 3 was named after him by Thomas Muir in 1882. He 
was also a metaphysician and proposed to build a machine that could predict the 
future. 
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Taylor series expansion, 4, 30, 42, 
45-46, 51, 80, 147, 208, 212-213, 
243, 253, 258, 417, 449, 692, 715, 
718, 735, 737, 740, 746, 749, 750, 
754, 767, 852, 855, 873, 880-882, 
885, 915, 923 

Telegraph equation, 568 
Formulation, 569 
Solution, 570 

Tensor, 62 


Cartesian, 67 
Curl, 55 
Cylindrical, 68 
Divergence, 55 
e-6 identity, 66 
Gradient, 53 
Spherical, 71 
Terzaghi 1-D consolidation theory, 
575, 580 
Theory of distribution, 495, 513 
Thomas-Fermi equation, 95, 690 
Thomas-Fermi integral equation, 690 
Three-dimensional elasticity, 108, 
486 
Titchmarsh’s contour integral, 34 
Total differential, 12, 101 
Transonic flow in 2-D gas, 445 
Tricomi function, 300 
Tricomi’s equation, 449 
Solution, 450 
Tsunami, 767-768 
Two-dimensional steady gas flows, 
445 
Two-point boundary value problems, 
611 


Vv 


van der Pol equation, 93 
Vector, 21 
Vector calculus, 53 
Divergence, 55 
Curl, 55 
Gradient, 53 
Vector identities, 58 
Veubeke-Hu-Washizu principle, 817, 
822 
Vibrating spring, 540-541 
Vibrations of circular plates, 844 
Viéte’s formula, 920-923 
Virtual displacement principle, 819 
Virtual traction principle, 820 
Virtual work principle, 818 
Vlasov’s stress function, 112 
Volterra-Fredholm 
differential equation, 689 
Volterra integral equation 
First kind, 679 


integro- 
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Second kind, 679 
von Karman-Foppl equation, 111 
von Neumann test, 867 


Ww 


Wave equations 
2-D waves, 550 
3-D waves, 553 
3-D symmetric waves, 554 
D’Alembert solution, 540 
Domain of dependence, 546 
Hadamard’s method of descent, 
Say 
Influence zone, 546 
n-Dimensional waves, 560 
Poisson or Kirchhoff formula, 555 
Wavefront condition, 563 
Waveguide problem, 434 
Wedge problem, 661, 705 
Weighted residue method, 828 
Galerkin method, 829 
Least square method, 829 
Petrov-Galerkin method, 829 
Point collocation method, 829, 858 
Whittaker equation, 168, 171 
WKB/WKBJ, 730 
Wronskian, 148, 159, 182, 201, 338- 
339 


Y 


Yang Hui triangle, 3 
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